Updated on 2024-08-05 GMT+08:00

Container Metrics and Dimensions

Table 1 Container metrics

Metric

Description

Value Range

Unit

Total CPU cores (aom_container_cpu_limit_core)

Total number of CPU cores restricted for a measured object

≥ 1

Cores

Used CPU cores (aom_container_cpu_used_core)

Number of CPU cores used by a measured object

≥ 0

Cores

CPU usage (aom_container_cpu_usage)

CPU usage of a measured object. That is, the percentage of the used CPU cores to the total CPU cores restricted for a measured object.

0–100

%

Disk read rate (aom_container_disk_read_kilobytes)

Volume of data read from a disk per second

≥ 0

KB/s

Disk write rate (aom_container_disk_write_kilobytes)

Volume of data written into a disk per second

≥ 0

KB/s

Available file system capacity (aom_container_filesystem_available_capacity_megabytes)

Available file system capacity of a measured object. This metric is available only for containers using the Device Mapper storage drive in the Kubernetes cluster of version 1.11 or later.

≥ 0

MB

Total file system capability (aom_container_filesystem_capacity_megabytes)

Total file system capacity of a measured object. This metric is available only for containers using the Device Mapper storage drive in the Kubernetes cluster of version 1.11 or later.

≥ 0

MB

File system usage (aom_container_filesystem_usage)

File system usage of a measured object. That is, the percentage of the used file system to the total file system. This metric is available only for containers using the Device Mapper storage drive in the Kubernetes cluster of version 1.11 or later.

0–100

%

Total GPU memory (aom_container_gpu_memory_free_megabytes)

Total GPU memory of a measured object

> 0

MB

GPU memory usage (aom_container_gpu_memory_usage)

Percentage of the used GPU memory to the total GPU memory

0–100

%

Used GPU memory (aom_container_gpu_memory_used_megabytes)

GPU memory used by a measured object

≥ 0

MB

GPU usage (aom_container_gpu_usage)

GPU usage of a measured object

0–100

%

Total NPU memory (aom_container_npu_memory_free_megabytes)

Total NPU memory of a measured object

> 0

MB

NPU memory usage (aom_container_npu_memory_usage)

Percentage of the used NPU memory to the total NPU memory

0–100

%

Used NPU memory (aom_container_npu_memory_used_megabytes)

NPU memory used by a measured object

≥ 0

MB

NPU usage (aom_container_npu_usage)

NPU usage of a measured object

0–100

%

Total physical memory (aom_container_memory_request_megabytes)

Total physical memory restricted for a measured object

≥ 0

MB

Physical memory usage (aom_container_memory_usage)

Percentage of the used physical memory to the total physical memory restricted for a measured object

0–100

%

Used physical memory (aom_container_memory_used_megabytes)

Used physical memory of a measured object

≥ 0

MB

Downlink rate (BPS) (aom_container_network_receive_bytes)

Inbound traffic rate of a measured object

≥ 0

Byte/s

Downlink rate (PPS) (aom_container_network_receive_packets)

Number of data packets received by a NIC per second

≥ 0

Packet/s

Downlink error rate (aom_container_network_receive_error_packets)

Number of error packets received by a NIC per second

≥ 0

Count/s

Error packets (aom_container_network_rx_error_packets)

Number of error packets received by a measured object

≥ 0

Count

Uplink rate (BPS) (aom_container_network_transmit_bytes)

Outbound traffic rate of a measured object

≥ 0

Byte/s

Uplink error rate (aom_container_network_transmit_error_packets)

Number of error packets sent by a NIC per second

≥ 0

Count/s

Uplink rate (PPS) (aom_container_network_transmit_packets)

Number of data packets sent by a NIC per second

≥ 0

Packet/s

Status (aom_process_status)

Docker container status

0 or 1

  • 0: Normal
  • 1: Abnormal

N/A

Working set memory usage (aom_container_memory_workingset_usage)

Usage of the working set memory

0–100

%

Used working set memory (aom_container_memory_workingset_used_megabytes)

Sum of resident set size (RSS) memory and cache

≥ 0

MB

Table 2 Dimensions of container metrics

Dimension

Description

appID

Service ID

appName

Service name

clusterId

Cluster ID

clusterName

Cluster name

containerID

Container ID

containerName

Container name

deploymentName

Kubernetes deployment name

kind

Application type

nameSpace

Cluster namespace

podID

Instance ID

podName

Instance name

serviceID

Inventory ID

gpuID

GPU ID

npuName

NPU name

npuID

NPU ID