Monitoring Pipelines
You had your first look at the Monitoring Page in the Getting Started section. Now let's take a deeper dive.
Let's look at each section individually.
#
Pipeline RunsThis section consists of all your pipeline runs, including Review pipelines. Each pipeline is represented by a row. The following information is available on each one.
#
NameThe name of each pipeline.
#
Run IDThe ID of the last pipeline run. This consists of the timestamp it is scheduled to run at.
#
StateThe status of the pipeline consist of the following:
- Running - the pipeline is currently running
- Success - the pipeline run was successful
- Failed - the pipeline run failed
#
Start TimeThe time the last pipeline run started.
#
End TimeThe time the last pipeline run ended.
#
CLEAR ALLThe CLEAR ALL button clears the entire pipeline and reruns it. Use this if you want the pipeline to run before the next scheduled run. The pipeline will still run on the next scheduled run even if you clear it.
#
REFRESHThe REFRESH button on refreshes the data on the entire page. Click this to see whether pipeline runs have updated.
#
TasksClicking on a pipeline will open up all the current model runs associated with that pipeline in the Tasks section. For each model task the following information is available along the row:
#
NameThe name of the model tasked.
#
StateThe status of the pipeline consist of the following:
- up_for_reschedule - the model task run is being rescheduled for a run
- queued - the model task run is queued to run next
- running - the model task is currently running
- success - the model task run was successful
- up_for_retry - the model task run was not successful and is up for a retry
- upstream_failed - the model task run failed due to a dependency failing
- failed - the model task run failed
#
Start TimeThe time the last model task run started.
#
End TimeThe time the last model task run ended.
#
DurationThe duration of the last model task run in seconds.
#
Max TriesThe maximum number of tries during the last model task run.
#
Try NumberThe current attempt to run the model task.
#
Queued WhenThe timestamp the model task is next queued to run at.
#
CLEARThe CLEAR button clears the model task and reruns it. Use this if you want the model task to run before the next scheduled run. The model task will still run on the next scheduled run even if you clear it.
#
FAILThe FAIL button fails the model task and sets the state to failed. Use this if you want to stop a task from running entirely. The model task will still run on the next scheduled run even if you fail it.
#
REFRESHThe REFRESH button on refreshes the data on the entire page. Click this to see whether pipeline runs have updated.
Each model task row has a dropdown that reveals validation errors and error logs.
#
Validation ErrorsDisplays each of the validation runs tied to the model. To find out more on validation runs visit the validations section.
#
Error LogsThis displays the most recent error logs. This is used to debug an issue.
#
DurationThe duration graphs compares the durations of each of the latest model tasks of the selected pipeline.
#
Pipeline Monitoring HeaderTo ensure that the team is aware of any failures the Pipeline Monitoring Header is at the top of every page. The three sections consist of the following:
#
RunningCurrent production pipelines running.
#
FailedCurrent production pipelines failed.
#
SuccessCurrent production pipelines that were successful.
It is important to note that these do not include review pipelines.