Instance Monitoring
Each time a job is executed, a job instance record is generated. In the navigation pane of the DataArts Factory console, choose Monitor Instance page, you can view the job instance information and perform more operations on instances as required.
. On theYou can search for instances by Job Name, Created By, Owner, CDM Job, Node Type, and Job Tag. Search by CDM job is to search for job instances by node. In addition, you can filter job instances by status or scheduling mode.
Performing Job Instance Operations
- Log in to the DataArts Studio console by following the instructions in Accessing the DataArts Studio Instance Console.
- On the DataArts Studio console, locate a workspace and click DataArts Factory.
- In the navigation pane on the left of the DataArts Factory page, choose
- You can stop, rerun, or continue to execute instances or make instances succeed. For details, see Table 1.
When multiple instances are rerun in batches, the sequence is as follows:
- If a job does not depend on the previous schedule cycle, multiple instances run concurrently.
- If jobs are dependent on their own, multiple instances are executed in serial mode. The instance that first finishes running in the previous schedule cycle is the first one to rerun.
- Table 1 describes the operations that can be performed on the instance.
Table 1 Instance monitoring operations Operation
Description
Searching for jobs by Job Name, Created By, or Owner
If you select Exact search, exact search by job name is supported.
If you do not select Exact search, fuzzy search by job name is supported.
Filtering jobs by CDM Job, Job Tag, or Node Type
N/A
Stop
Stop an instance that is in the Waiting, Running, or Abnormal state.
Rerun
Rerun a subjob instance that is in the Succeed or Canceled state.
For details, see Rerunning Job Instances.
NOTE:- Manually scheduled jobs cannot be rerun.
- In enterprise mode, developers cannot rerun job instances.
If instances need manual confirmation before they are executed, they are in waiting confirmation state when they are being rerun. When you click Execute, the instances are in waiting execution state.
Manual Retry
Retry abnormal instances.
Continue
Continue to run subsequent nodes in instances which are in abnormal state.
Succeed
Change the statuses of instances in Abnormal, Canceled, or Failed state to Forcibly successful.
Confirm Execution
Confirm executing instances in pending confirmation state.
Execute Job Without Dependency
Select job instances that have dependency relationships and execute them.
More > Manual Retry
Retry abnormal instances.
More > View Waiting Job Instance
When the instance is in the waiting state, you can view the waiting job instance. Click Remove Dependency in the Operation column to remove dependency on an upstream instance.
More > Confirm Execution
Confirm executing instances in pending confirmation state.
More > Continue
If an instance is in the Abnormal state, you can click Continue to begin running the subsequent nodes in the instance.
NOTE:This operation can be performed only when Failure Policy is set to Suspend the current job execution plan. To view the current failure policy, click a node and then click Advanced Settings on the Node Properties page.
More > Succeed
Forcibly change the status of an instance from Abnormal, Canceled, or Failed to Succeed.
More > Execute Job Without Dependency
Execute job instances that have dependency relationships.
More > View
Go to the job development page and view job information.
More > History performance
You can view the historical performance of a job instance.
More > View Rerun History
You can view the job instance rerun records.
This is possible only if the job instance has rerun at least once.
More > Execute Preferentially
Preferentially execute job instances.
DAG
Display the DAG so that you can view the dependency between job instances and perform O&M operations on the DAG.
For details, see Viewing the DAG.
Export All Data
Click Export All Data. In the displayed Export All Data dialog box, click OK. After the export is complete, go to the Download Center page to view the exported data.
If the default storage path is not configured, you can set a storage path and select Set as default OBS path in the Export to OBS dialog box.
A maximum of 30 MB data can be exported. If there are more than 30 MB data, the data will be automatically truncated.
The exported job instances map job nodes. You cannot export data by selecting job names. Instead, you can select the data to be exported by setting filter criteria.
- Click in front of an instance. The running records of all nodes in the instance are displayed.
- Table 2 describes the operations that can be performed on the node.
Table 2 Operations (node) Operation
Description
View Log
View the log information of a node.
You can control access to the test run logs. For example, after user A performs a test, user A can view the test run logs on the Monitor Instance page, but user B cannot.
Manual Retry
Retry a failed node.
Retry an abnormal node.
NOTE:This operation can be performed only when Failure Policy is set to Suspend the current job execution plan. To view the current failure policy, click a node and then click Advanced Settings on the Node Properties page.
Succeed
Change the status of a node from Failed to Succeed.
NOTE:This operation can be performed only when Failure Policy is set to Suspend the current job execution plan. To view the current failure policy, click a node and then click Advanced Settings on the Node Properties page.
More > Skip
To skip a node that is to be run or that has been paused, click Skip.
NOTE:Instance with only one node cannot be skipped. Only instances with multiple nodes can be skipped.
More > Pause
When a job instance is in running state and a node is in waiting execution state, you can pause the node. Subsequent nodes will be blocked.
More > Resume
To resume a paused node, click Resume.
More > History performance
You can view the historical performance of a job node.
Rerunning Job Instances
In enterprise mode, developers cannot rerun job instances.
You can rerun a job instance that is successfully executed or fails to be executed by setting its rerun position.
- Log in to the DataArts Studio console by following the instructions in Accessing the DataArts Studio Instance Console.
- On the DataArts Studio console, locate a workspace and click DataArts Factory.
- In the navigation pane on the left of the DataArts Factory page, choose
- Locate a job and click Rerun in the Operation column to rerun a job instance. Alternatively, select the check boxes to the left of job names and click Rerun above the job list to rerun multiple job instances.
Figure 1 Rerunning a job instance
Figure 2 Rerunning job instances
When rerunning multiple job instances, you only need to set Rerun From, Parameters to Use, and Ignore OBS Listening.
Table 3 Parameters for rerunning a job Parameter
Description
Rerun Type
Type of the instance that you want to rerun.
- Rerun selected instance
- Rerun instances of selected job and its upstream and downstream jobs
Start Time
This parameter is required only when Rerun Type is set to Rerun instances of selected job and its upstream and downstream jobs.
After you set the start time and end time, the system will rerun all the job instances in the specified period.
NOTE:If no job instance can be rerun in the specified period, error message "Job xxx have no instances to rerun" will be displayed.
List of Rerun Job Instances
This parameter is required only when Rerun Type is set to Rerun instances of selected job and its upstream and downstream jobs.
You can select Display the current job and its directly connected jobs or Display complete dependency graphs in the Scheduling-State Job Dependency View dialog box.
The job dependency view is displayed. You can enter a job name to query the job dependency.
Figure 3 Job Dependency page
Select the job to rerun and its upstream and downstream jobs. You can select multiple jobs at a time.
NOTE:If you hover over the question mark on the right of Scheduling-State Job Dependency View, the following information is displayed:
- When you hover your cursor on a job, its upstream and downstream jobs will be marked blue and yellow, respectively.
- Drag the blank area to view the complete relationship graph.
- Click a job in the relationship graph to select all the instances within the duration of the job.
Figure 4 Rerunning all instances
- Right-click the job to view its instances, and select and run them again.
Figure 5 Rerunning some instances
- If no job instance is selected, No instance selected is displayed.
Figure 6 No instance selected
For detailed operations on the job dependency graph, see Batch Job Monitoring: Viewing a Job Dependency Graph.
Rerun From
Start position from which the job instance reruns.
- Error node: When a job instance fails to be run, it reruns since the error node of the job instance.
- The first node: When a job instance fails to be run, it reruns since the first node of the job instance.
- Specified node: When a job instance fails to run, it reruns since the node specified in the job instance. This option is available only if Rerun Type is set to Rerun selected instance.
NOTE:A job instance reruns from its first node if either of the following cases occurs:
- The quantity or name of a node in the job changes.
- The job instance has been successfully run.
Parameters to Use
- Parameters of the original job
- Parameters of the latest job
Concurrent Instances
This parameter is required only when Rerun Type is set to Rerun instances of selected job and its upstream and downstream jobs.
It indicates the number of job instances that can be concurrently processed. The value cannot be less than 1. The default value is 1.
Ignore OBS Listening
The default value is Yes.
- Yes: The system does not listen to the OBS path when rerunning the job instance.
- No: The system listens to the OBS path when rerunning the job instance.
NOTE:
If this parameter is not used, ignore it.
Viewing the DAG
You can view the dependency between job instances and perform O&M operations on the DAG.
- Log in to the DataArts Studio console by following the instructions in Accessing the DataArts Studio Instance Console.
- On the DataArts Studio console, locate a workspace and click DataArts Factory.
- In the navigation pane on the left of the DataArts Factory page, choose
- Locate the row that contains a job and click DAG in the Operation column.
Figure 7 DAG
By default, the DAG displays the current job instance and its upstream and downstream job instances. It supports the following operations:
- Click in the upper right corner of the DAG to restore the DAG to the initial state, and click to close the DAG. Drag in the upper left corner of the DAG to change its width.
- Click a job instance to select it.
Figure 8 Selecting a job instance
- When a job instance is selected, the background colors of the job instance and its upstream and downstream instances are darkened.
- Brief information about the instance is displayed in the lower right corner of the DAG. The instance name and ID can be directly copied.
- Click Show Details to open the details panel, which displays information such as the instance attributes, job parameters, node list, and historical instances. You can adjust the height of the panel or close it.
- Click the blank area to deselect the job instance.
- Right-click a job instance to expand its upstream and downstream job instances. You can stop, rerun, continue to execute instances, forcibly make instances succeed, analyze the upstream node, and edit the job.
Figure 9 Performing operations on job instances
Job Instance Statuses
Status |
Description |
---|---|
Waiting |
A job instance is in waiting state if the execution of its dependent job instances is not complete, for example, no instance has been generated, instances are waiting to be executed, or instances fail to be executed. |
Running |
A job is running. All of its dependent jobs have been executed successfully. |
Successful |
The service logic of a job is successfully executed (including the success of retry upon failure). Successful execution statuses include Successful, Forcibly successful, and Failure ignored. |
Forcibly successful |
A job instance in failed or canceled state is made successful. |
Failure ignored |
As shown in the following figure, a failure handling policy is configured to skip node B and continue to execute node C if node B fails. When the job is executed successfully, the job instance is in Failure ignored state.
Figure 10 Failure handling policy – Go to the next node
|
Abnormal |
There are few scenarios where this status is displayed. As shown in the following figure, a failure handling policy is configured to suspend the job instance immediately without continuing to execute node C. In this case, the job instance is in Abnormal state.
Figure 11 Failure handling policy – Suspend current job execution plan
|
Paused |
There are few scenarios where this status is displayed. When a running job instance is suspended by the test personnel, the instance is in Paused state. |
Canceled |
|
Frozen |
If a job instance is expected to be generated in the future, the job instance is in frozen state after being frozen. |
Failed |
A job fails to be executed. If a job fails to be executed, you can view the failure cause, for example, a node of the job fails to be executed. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot