Scheduling Jupyter Notebook Jobs with Zepl
Mitsu Wada
April 10, 2021
What is the Zepl Job Scheduler?
Zepl’s provides a cloud-hosted scheduler that spins up containers or clustered resources to execute your Jupyter or Zeppelin notebooks on a user-defined schedule. Users can monitor their job statuses to never miss a failed job again.
Why Use the Zepl Job Scheduler?
If you maintain open source tools or cron schedules on a shared server to scheduler your notebooks or Python scripts, this feature is for you. This makes training your models, keeping data visualizations up to date, or even just running a SQL job on a regular interval easy. You can set up a scheduled job to run your notebook in seconds.
If you already have an existing Jupyter or Zeppelin notebook, you can easily import your existing notebook into Zepl and run it on a schedule. With Scheduled Jobs, you can spin up a Spark cluster and run your job on a schedule. There’s no need to wait around for the cluster to spin up and you can even set your Spark cluster to shutdown automatically once the job completes.
How to Schedule your First Job:
- Create a notebook in Zepl
- In the notebook menu, select ‘Set Schedule’
- Name the job and select your Run Type, Start and End time. If you’re familiar with cron jobs, you also have the option to enter a cron syntax.
Manage Your Team’s Jobs
It’s great that your team can easily set their notebooks to run on a schedule; however, it’s also easy for them to forget about jobs once they’re set. Zepl gives admins visibility to see all the scheduled jobs that users have set in their organization with the Scheduled Jobs Monitoring feature.
Users with appropriate permissions can view the Scheduled Jobs by clicking on the Monitor tab of the sidebar and then Scheduled Jobs. This page lists all the Scheduled Jobs that are scheduled to run in the next month or have executed in the previous month for the entire organization. You can also adjust the date range and the table can be sorted on any column. This table contains all the information you need to maintain your scheduled jobs.
- Job name: What is the name of the job & of the notebook that the job executes upon
- Compute resource: What resources powers the notebook’s scheduled job?
- Created by: Who created the job?
- Schedule: When does this job run?
- Next Run: When will this job run next?
- Last Run: When was the last time this job ran?
- Status: Is this job done executing, paused, canceled, or scheduled to execute soon?
You can also edit, pause, resume, or cancel a job from this page. To see when the jobs ran, click on the scheduled job from the list and you will see the date and time of when the job was executed in the past.
Soon you’ll also be able to receive notifications when a scheduled job completes execution. So stay tuned and let us know if you have any feedback. We love to hear from our customers!
For more information, check out our documentation on scheduled jobs.