Obtaining the Details About a Training Job
Sample Code
In ModelArts notebook, you do not need to enter authentication parameters for session authentication. For details about session authentication of other development environments, see Session Authentication.
- Method 1: Use the specified job_id.
from modelarts.session import Session from modelarts.estimatorV2 import Estimator session = Session() estimator = Estimator(session=session, job_id="618222c4-dc2f-4cfe-bc49-72b075b7552f") job_info = estimator.get_job_info() print(job_info)
- Method 2: Use the training job created in Creating a Training Job.
job_info = job_instance.get_job_info() print(job_info)
Parameters
|
Parameter |
Mandatory |
Type |
Description |
|---|---|---|---|
|
session |
Yes |
Object |
Session object. For details about the initialization method, see Session Authentication. |
|
job_id |
Yes |
String |
ID of a training job. You can obtain job_id using the training job created in Creating a Training Job, for example, job_instance.job_id, or from the response obtained in Obtaining Training Jobs. |
|
Parameter |
Type |
Description |
|---|---|---|
|
kind |
String |
Training job type, which defaults to job. Options:
|
|
metadata |
JobMetadata object |
Metadata of a training job. |
|
status |
Status object |
Status of a training job. When creating a training job, you do not need to set this parameter. |
|
algorithm |
JobAlgorithmResponse object |
Algorithm used by a training job. The following formats are supported:
|
|
tasks |
Array of TaskResponse objects |
Tasks of a heterogeneous training job. |
|
spec |
spec object |
Specifications of a training job. |
|
Parameter |
Type |
Description |
|---|---|---|
|
id |
String |
Training job ID, which is generated and returned by ModelArts after a training job is created. |
|
name |
String |
Name of a training job. The value must contain 1 to 64 characters consisting of only digits, letters, underscores (_), and hyphens (-). |
|
workspace_id |
String |
Workspace where a training job is deployed. Default value: 0 |
|
description |
String |
Description of a training job, which defaults to NULL. The value must contain 0 to 256 characters. |
|
create_time |
Long |
Time when a training job was created, in milliseconds. The value is generated and returned by ModelArts after a training job is created. |
|
user_name |
String |
Username for creating a training job. The username is generated and returned by ModelArts after a training job is created. |
|
annotations |
Map<String,String> |
Declaration template of a training job. For heterogeneous jobs, the default value of job_template is Template RL. For other jobs, the default value is Template DL. |
|
Parameter |
Type |
Description |
|---|---|---|
|
phase |
String |
Level-1 status of a training job. The value will remain unchanged. Options: Creating, Pending, Running, Failed, Completed, Terminating, Terminated, and Abnormal |
|
secondary_phase |
String |
Level-2 status of a training job. The value can be changed. Options: Creating, Queuing, Running, Failed, Completed, Terminating, Terminated, CreateFailed, TerminatedFailed, Unknown, and Lost |
|
duration |
Long |
Running duration of a training job, in milliseconds |
|
node_count_metrics |
Array<Array<Integer>> |
Node count changes during the runtime of a training job |
|
tasks |
Array of strings |
Task of a training job |
|
start_time |
String |
Start time of a training job. The value is in timestamp format. |
|
task_statuses |
Array of objects |
Status of a training job task |
|
Parameter |
Type |
Description |
|---|---|---|
|
task |
String |
Task of a training job |
|
exit_code |
Integer |
Exit code of a training job task |
|
message |
String |
Error message of a training job task |
|
Parameter |
Type |
Description |
|---|---|---|
|
id |
String |
Algorithm ID Options:
|
|
name |
String |
Algorithm name |
|
subscription_id |
String |
Subscription ID of the subscribed algorithm, which must be used with item_version_id |
|
item_version_id |
String |
Version ID of the subscribed algorithm, which must be used with subscription_id |
|
code_dir |
String |
Code directory of a training job, for example, /usr/app/. This parameter must be used with boot_file. Leave this parameter blank if id, or subscription_id and item_version_id are specified. |
|
boot_file |
String |
Boot file of a training job, which must be stored in the code directory, for example, /usr/app/boot.py. This parameter must be used with code_dir. Leave this parameter blank if id, or subscription_id and item_version_id are specified. |
|
autosearch_config_path |
String |
YAML configuration path of an auto search job. An OBS URL is required. |
|
autosearch_framework_path |
String |
Framework code directory of an auto search job. An OBS URL is required. |
|
command |
String |
Boot command for starting the container of the custom image used for creating a training job. The value of this parameter can be the same as the code_dir value. |
|
parameters |
Array of Parameter objects |
Running parameters of a training job. |
|
policies |
policies object |
Policies supported by a training job. |
|
inputs |
Array of Input objects |
Input of a training job. |
|
outputs |
Array of Output objects |
Output of a training job. |
|
engine |
engine object |
Engine of a training job. Leave this parameter blank if the job is created using id of the algorithm in algorithm management, or subscription_id and item_version_id of the subscribed algorithm. |
|
environments |
Map<String,String> |
Environment variables of a training job in the format of "key":"value". Leave this parameter blank. |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Parameter name |
|
value |
String |
Parameter value |
|
description |
String |
Parameter description |
|
constraint |
constraint object |
Parameter constraint |
|
i18n_description |
i18n_description object |
Internationalization description |
|
Parameter |
Type |
Description |
|---|---|---|
|
type |
String |
Parameter type |
|
editable |
Boolean |
Whether the parameter is editable |
|
required |
Boolean |
Whether the parameter is mandatory |
|
sensitive |
Boolean |
Whether the parameter is sensitive |
|
valid_type |
String |
Valid type |
|
valid_range |
Array of strings |
Valid range |
|
Parameter |
Type |
Description |
|---|---|---|
|
language |
String |
Internationalization language |
|
description |
String |
Description |
|
Parameter |
Type |
Description |
|---|---|---|
|
auto_search |
auto_search object |
Hyperparameter search configuration |
|
Parameter |
Type |
Description |
|---|---|---|
|
skip_search_params |
String |
Hyperparameter parameters that need to be skipped |
|
reward_attrs |
Array of objects |
Search metrics |
|
search_params |
Array of objects |
Search parameters |
|
algo_configs |
Array of objects |
Search algorithm configurations |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Metric name |
|
mode |
String |
Search mode
|
|
regex |
String |
Regular expression of a metric |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Hyperparameter name |
|
param_type |
String |
Parameter type
|
|
lower_bound |
String |
Lower bound of the hyperparameter |
|
upper_bound |
String |
Upper bound of the hyperparameter |
|
discrete_points_num |
String |
Number of discrete points of a hyperparameter with continuous values |
|
discrete_values |
Array of strings |
Discrete hyperparameter values |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Name of the search algorithm |
|
params |
Array of AutoSearchAlgoConfigParameter objects |
Search algorithm parameters |
|
Parameter |
Type |
Description |
|---|---|---|
|
key |
String |
Parameter key |
|
value |
String |
Parameter value |
|
type |
String |
Parameter type |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Name of the data input channel |
|
description |
String |
Description of the data input channel |
|
local_dir |
String |
Local directory of the container to which the data input channel is mapped |
|
remote |
InputDataInfo object |
Information of the data input |
|
remote_constraint |
Array of objects |
Data input constraint |
|
Parameter |
Type |
Description |
|---|---|---|
|
dataset |
dataset object |
Dataset as the data input |
|
obs |
obs object |
OBS in which data input and output are stored |
|
Parameter |
Type |
Description |
|---|---|---|
|
id |
String |
Dataset ID of a training job |
|
version_id |
String |
Dataset version ID of a training job |
|
obs_url |
String |
OBS URL of the dataset for a training job, which is automatically parsed by ModelArts based on the dataset ID and dataset version IDs, for example, /usr/data/ |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs_url |
String |
OBS URL of the dataset for a training job, for example, /usr/data/ |
|
Parameter |
Type |
Description |
|---|---|---|
|
data_type |
String |
Data input type, including the data storage location and dataset |
|
attributes |
String |
Attributes when a dataset functions as the data input Options:
|
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Name of the data output channel |
|
description |
String |
Description of the data output channel |
|
local_dir |
String |
Local directory of the container to which the data output channel is mapped |
|
remote |
remote object |
Information of the data output |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs_url |
String |
OBS URL to which data is exported |
|
Parameter |
Type |
Description |
|---|---|---|
|
engine_id |
String |
Engine ID selected for a training job, which can be engine_id, engine_name and engine_version, or image_url |
|
engine_name |
String |
Name of the engine selected for a training job. Leave this parameter blank if engine_id is specified. |
|
engine_version |
String |
Version of the engine selected for a training job. Leave this parameter blank if engine_id is specified. |
|
image_url |
String |
Custom image URL selected for a training job |
|
Parameter |
Type |
Description |
|---|---|---|
|
role |
String |
Role of a heterogeneous training job task Options:
|
|
algorithm |
algorithm object |
Algorithm configuration |
|
task_resource |
FlavorResponse object |
Flavors for a training job or an algorithm |
|
Parameter |
Type |
Description |
|---|---|---|
|
code_dir |
String |
Absolute path of the directory where the algorithm boot file is stored |
|
boot_file |
String |
Absolute path of the algorithm boot file |
|
inputs |
inputs object |
Algorithm input channel |
|
outputs |
outputs object |
Algorithm output channel |
|
engine |
engine object |
Engine on which a heterogeneous job depends |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Name of the data input channel |
|
local_dir |
String |
Local path of the container to which the data input and output channels are mapped |
|
remote |
remote object |
Actual data input, which can only be OBS for heterogeneous jobs |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs |
obs object |
OBS in which data input and output are stored |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs_url |
String |
OBS URL of the dataset for a training job, for example, /usr/data/ |
|
Parameter |
Type |
Description |
|---|---|---|
|
name |
String |
Name of the data output channel |
|
local_dir |
String |
Local directory of the container to which the data output channel is mapped |
|
remote |
remote object |
Information of the data output |
|
mode |
String |
Data transmission mode, which defaults to upload_periodically |
|
period |
String |
Data transmission period, which defaults to 30s |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs |
obs object |
OBS to which data is exported |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs_url |
String |
OBS URL to which data is exported |
|
Parameter |
Type |
Description |
|---|---|---|
|
engine_id |
String |
Engine ID of a heterogeneous job, for example, caffe-1.0.0-python2.7 |
|
engine_name |
String |
Engine name of a heterogeneous job, for example, Caffe |
|
engine_version |
String |
Engine version of a heterogeneous job |
|
v1_compatible |
Boolean |
Whether v1 is compatible |
|
run_user |
String |
User UID for which the engine is started by default |
|
Parameter |
Type |
Description |
|---|---|---|
|
flavor_id |
String |
ID of the resource flavor |
|
flavor_name |
String |
Name of the resource flavor |
|
max_num |
Integer |
Maximum number of nodes with the resource flavor |
|
flavor_type |
String |
Resource flavor type. Options:
|
|
billing |
billing object |
Billing information of a resource flavor |
|
flavor_info |
flavor_info object |
Resource flavor details |
|
attributes |
Map<String,String> |
Other flavor attributes |
|
Parameter |
Type |
Description |
|---|---|---|
|
code |
String |
Billing code |
|
unit_num |
Integer |
Number of billing units |
|
Parameter |
Type |
Description |
|---|---|---|
|
max_num |
Integer |
Maximum number of nodes that can be selected. Value 1 indicates that the distributed mode is not supported. |
|
cpu |
cpu object |
CPU specifications |
|
gpu |
gpu object |
GPU specifications |
|
npu |
npu object |
Ascend specifications |
|
memory |
memory object |
Memory information |
|
Parameter |
Type |
Description |
|---|---|---|
|
arch |
String |
CPU architecture |
|
core_num |
Integer |
Number of cores |
|
Parameter |
Type |
Description |
|---|---|---|
|
unit_num |
Integer |
Number of GPUs |
|
product_name |
String |
Product name |
|
memory |
String |
Memory |
|
Parameter |
Type |
Description |
|---|---|---|
|
unit_num |
String |
Number of NPUs |
|
product_name |
String |
Product name |
|
memory |
String |
Memory |
|
Parameter |
Type |
Description |
|---|---|---|
|
size |
Integer |
Memory size |
|
unit |
String |
Number of memory units |
|
Parameter |
Type |
Description |
|---|---|---|
|
resource |
Resource object |
Resource flavors of a training job, which can either be flavor_id or pool_id and flavor_id |
|
volumes |
Array of objects |
Volumes attached for a training job |
|
log_export_path |
log_export_path object |
Export path of training job logs |
|
Parameter |
Type |
Description |
|---|---|---|
|
policy |
String |
Resource flavor mode of a training job. Options: regular, economic, and turbo |
|
flavor_id |
String |
Resource flavor ID of a training job |
|
flavor_name |
String |
Read-only flavor name returned by ModelArts when flavor_id is specified |
|
node_count |
Integer |
Number of resource replicas selected for a training job Minimum value: 1 |
|
pool_id |
String |
Resource pool ID selected for a training job |
|
flavor_detail |
flavor_detail object |
Flavors for a training job or an algorithm |
|
Parameter |
Type |
Description |
|---|---|---|
|
flavor_type |
String |
Resource flavor type. Options:
|
|
billing |
billing object |
Billing information of a resource flavor |
|
flavor_info |
flavor_info object |
Resource flavor details |
|
Parameter |
Type |
Description |
|---|---|---|
|
code |
String |
Billing code |
|
unit_num |
Integer |
Number of billing units |
|
Parameter |
Type |
Description |
|---|---|---|
|
max_num |
Integer |
Maximum number of nodes that can be selected. Value 1 indicates that the distributed mode is not supported. |
|
cpu |
cpu object |
CPU specifications |
|
gpu |
gpu object |
GPU specifications |
|
npu |
npu object |
Ascend specifications |
|
memory |
memory object |
Memory information |
|
disk |
disk object |
Disk information |
|
Parameter |
Type |
Description |
|---|---|---|
|
arch |
String |
CPU architecture |
|
core_num |
Integer |
Number of cores |
|
Parameter |
Type |
Description |
|---|---|---|
|
unit_num |
Integer |
Number of GPUs |
|
product_name |
String |
Product name |
|
memory |
String |
Memory |
|
Parameter |
Type |
Description |
|---|---|---|
|
unit_num |
String |
Number of NPUs |
|
product_name |
String |
Product name |
|
memory |
String |
Memory |
|
Parameter |
Type |
Description |
|---|---|---|
|
size |
Integer |
Memory size |
|
unit |
String |
Number of memory units |
|
Parameter |
Type |
Description |
|---|---|---|
|
size |
String |
Disk size |
|
unit |
String |
Unit of the disk size, which is GB generally |
|
Parameter |
Type |
Description |
|---|---|---|
|
nfs |
nfs object |
Disks attached in NFS mode |
|
Parameter |
Type |
Description |
|---|---|---|
|
nfs_server_path |
String |
NFS server path |
|
local_path |
String |
Path for attaching disks to the training container |
|
read_only |
Boolean |
Whether the disks attached to the container in NFS mode are read-only |
|
Parameter |
Type |
Description |
|---|---|---|
|
obs_url |
String |
OBS URL for storing training job logs |
|
host_path |
String |
Path of the host where training job logs are stored |
|
Parameter |
Type |
Description |
|---|---|---|
|
error_msg |
String |
Error message when calling an API failed. This parameter is unavailable if an API is successfully called. |
|
error_code |
String |
Error code when calling an API failed. For details, see "Error Codes" in ModelArts API Reference. This parameter is unavailable if an API is successfully called. |
|
error_solution |
String |
Solution to an API calling failure. This parameter is unavailable if an API is successfully called. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.