Updated on 2024-10-29 GMT+08:00

Viewing Training Job Details

  1. Log in to the ModelArts console.
  2. In the navigation pane, choose Model Training > Training Jobs.
  3. In the training job list, click the target job name to switch to the training job details page.
  4. On the left side of the training job details page, view basic job settings and algorithm parameters.
    • Basic job settings
      Table 1 Basic job settings

      Parameter

      Description

      Job ID

      Unique ID of the training job.

      Status

      Status of the training job.

      Created

      Time when the training job is created.

      Duration

      Running duration of the training job.

      Retries

      Number of times that the training job automatically restarts upon a fault during training. This parameter is available only when Auto Restart is enabled during training job creation.

      Description

      Description of the training job.

      You can click the edit icon to update the description of a training job.

      Job Priority

      Priority of the training job.

    • Algorithm parameters
      Table 2 Algorithm parameters

      Parameter

      Description

      Algorithm Name

      Algorithm used in the training job. You can click the algorithm name to go to the algorithm details page.

      Preset images

      Preset image used by the training job This parameter is available only for training jobs created using a preset image.

      Custom image

      Custom image used by the training job. This parameter is available only for training jobs created using a custom image.

      Code Directory

      OBS path to the code directory of the training job.

      You can click Edit Code on the right to edit the training script code in OBS Online Editor. OBS Online Editor is not available for a training job in the Pending, Creating, or Running status.

      NOTE:

      This parameter is not supported when you use a subscribed algorithm to create a training job.

      Boot File

      Location where the training boot file is stored.

      NOTE:

      This parameter is not supported when you use a subscribed algorithm to create a training job.

      User ID

      ID of the user who runs the container.

      Local Code Directory

      Path to the training code in the training container.

      Work Directory

      Path to the training boot file in the training container.

      Compute Nodes

      Number of compute nodes set for the training job.

      Dedicated resource pool

      Dedicated resource pool information. This parameter is available only when a training job uses a dedicated resource pool.

      Specifications

      Training specifications used by the training job.

      Input > Input Path

      OBS path where the input data is stored.

      Input > Parameter Name

      Input path parameter specified in the algorithm code.

      Input > Obtained from

      Method of obtaining the training job input.

      Input > Local Path (Training Parameter Value)

      Path for storing the input data in the ModelArts backend container. After the training is started, ModelArts downloads the data stored in OBS to the backend container.

      Output > Output Path

      OBS path where the output data is stored.

      Output > Parameter Name

      Output path parameter specified in the algorithm code.

      Output > Obtained from

      Method of obtaining the training job output.

      Output > Local Path (Training Parameter Value)

      Path for storing the output data in the ModelArts backend container.

      Hyperparameter

      Hyperparameters used in the training job.

      Environment Variable

      Environment variables for the training job.