Obtaining the Runtime Metrics of a Training Job
Sample Code
In ModelArts notebook, you do not need to enter authentication parameters for session authentication. For details about session authentication of other development environments, see Session Authentication.
- Method 1: Use the specified job_id.
from modelarts.session import Session from modelarts.estimatorV2 import Estimator session = Session() estimator = Estimator(session=session, job_id="your job id") info = estimator.get_job_metrics() print(info)
- Method 2: Use the training job created in Creating a Training Job.
info = job_instance.get_job_metrics(task_id="worker-0") print(info)
Parameters
Parameter |
Mandatory |
Type |
Description |
---|---|---|---|
session |
Yes |
Object |
Session object. For details about the initialization method, see Session Authentication. |
job_id |
Yes |
String |
ID of a training job. You can obtain job_id using the training job created in Creating a Training Job, for example, job_instance.job_id, or from the response obtained in Obtaining Training Jobs. |
Parameter |
Mandatory |
Type |
Description |
---|---|---|---|
task_id |
No |
String |
ID of a worker node for obtaining logs. It defaults to worker-0. If train_instance_count is set to 2 when you create a training job, the value of this parameter can be worker-0 or worker-1. |
Parameter |
Type |
Description |
---|---|---|
metrics |
Array of objects |
Runtime metrics |
Parameter |
Type |
Description |
---|---|---|
metric |
String |
Runtime metric. The value can be cpuUsage (CPU usage), memUsage (physical memory usage), gpuUtil (GPU usage), gpuMemUsage (GPU memory usage), npuUtil (NPU usage), or npuMemUsage (NPU memory usage). |
value |
Array of numbers |
Value of a runtime metric. An average value is collected every minute. |
Parameter |
Type |
Description |
---|---|---|
error_msg |
String |
Error message when calling an API failed. This parameter is unavailable if an API is successfully called. |
error_code |
String |
Error code when calling an API failed. For details, see "Error Codes" in ModelArts API Reference. This parameter is unavailable if an API is successfully called. |
error_solution |
String |
Solution to an API calling failure. This parameter is unavailable if an API is successfully called. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.