Updated on 2024-04-30 GMT+08:00

ModelArts Metrics

Description

The cloud service platform provides Cloud Eye to help you better understand the status of your ModelArts real-time services and models. You can use Cloud Eye to automatically monitor your ModelArts real-time services and model loads in real time and manage alarms and notifications so that you can obtain the performance metrics of ModelArts and models.

Namespace

SYS.ModelArts

Monitoring Metrics

Table 1 ModelArts metrics

Metric ID

Metric Name

Description

Value Range

Monitored Entity

Monitoring Interval

cpu_usage

CPU Usage

CPU usage of ModelArts

Unit: %

≥ 0%

ModelArts model loads

1 minute

mem_usage

Memory Usage

Memory usage of ModelArts

Unit: %

≥ 0%

ModelArts model loads

1 minute

gpu_util

GPU Usage

GPU usage of ModelArts

Unit: %

≥ 0%

ModelArts model loads

1 minute

gpu_mem_usage

GPU Memory Usage

GPU memory usage of ModelArts

Unit: %

≥ 0%

ModelArts model loads

1 minute

npu_util

NPU Usage

NPU usage of ModelArts

Unit: %

≥ 0%

ModelArts model loads

1 minute

npu_mem_usage

NPU Memory Usage

NPU memory usage of ModelArts

Unit: %

≥ 0%

ModelArts model loads

1 minute

successfully_called_times

Number of Successful Calls

Times that ModelArts has been successfully called

Unit: times/minute

≥ counts/minute

ModelArts models

ModelArts real-time services

1 minute

failed_called_times

Number of Failed Calls

Times that ModelArts failed to be called

Unit: times/minute

≥ counts/minute

ModelArts models

ModelArts real-time services

1 minute

total_called_times

Total Calls

Times that ModelArts is called

Unit: times/minute

≥ counts/minute

ModelArts model loads

ModelArts real-time services

1 minute

disk_read_rate

Disk Read Rate

Disk read rate of ModelArts

Unit: bit/minute

≥ bit/minute

ModelArts model loads

1 minute

disk_write_rate

Disk Write Rate

Disk write rate of ModelArts

Unit: bit/minute

≥ bit/minute

ModelArts model loads

1 minute

send_bytes_rate

Uplink rate

Outbound traffic rate of ModelArts

Unit: bit/minute

≥ bit/minute

ModelArts model loads

1 minute

recv_bytes_rate

Downlink rate

Inbound traffic rate of ModelArts

≥ bit/minute

ModelArts model loads

1 minute

req_count_2xx

2xx Responses

Number of times that the API returns a 2xx response

≥ counts/minute

ModelArts real-time services

1 minute

req_count_4xx

4xx Errors

Number of times that the API returns a 4xx error

≥ counts/minute

ModelArts real-time services

1 minute

req_count_5xx

5xx Errors

Number of times that the API returns a 5xx error

≥ counts/minute

ModelArts real-time services

1 minute

avg_latency

Average Latency

Average latency of the API

≥ ms

ModelArts real-time services

1 minute

If a measurement object has multiple measurement dimensions, all the measurement dimensions are mandatory when you use an API to query monitoring metrics.

  • The following provides an example of using the multi-dimensional dim to query a single monitoring metric: dim.0=service_id,530cd6b0-86d7-4818-837f-935f6a27414d&dim.1="model_id,3773b058-5b4f-4366-9035-9bbd9964714a
  • The following provides an example of using the multi-dimensional dim to query monitoring metrics in batches:

    "dimensions": [

    {

    "name": "service_id",

    "value": "530cd6b0-86d7-4818-837f-935f6a27414d"

    }

    {

    "name": "model_id",

    "value": "3773b058-5b4f-4366-9035-9bbd9964714a"

    }

    ]

Dimensions

Table 2 Dimension description

Key

Value

service_id

Real-time service ID

model_id

Model ID