Help Center/
ModelArts/
ModelArts User Guide (Standard)/
Model Training/
High Model Training Reliability/
Training Job Rescheduling
Updated on 2024-10-29 GMT+08:00
Training Job Rescheduling
When a training job fault occurs (such as process-level recovery, POD-level rescheduling, and job-level rescheduling), the Fault Recovery Details tab appears on the job details page, recording the start and stop details of the training job.
- On the ModelArts console, choose Model Training > Training Jobs from the navigation pane.
- In the training job list, click the name of the target job to go to the training job details page.
- On the training job details page, click the Fault Recovery Details tab to view the fault recovery information.
Figure 1 Viewing fault recovery details
Parent topic: High Model Training Reliability
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
The system is busy. Please try again later.
For any further questions, feel free to contact us through the chatbot.
Chatbot