Updated on 2025-03-31 GMT+08:00

iMetal Metrics

This section describes out-of-band monitoring metrics of iMetal servers.

iMetal Server Hardware Monitoring Metrics

Table 1 iMetal server hardware monitoring metrics

Metric Name

Metric

Description

Input Power

power_input_watts

Input power of the power supply

Output Power

power_output_watts

Output power of the power supply

Component Temperature

device_temperature

Temperature of the component

Server Health

host_health

The health of the server

CPU Health

cpu_health

The health of the CPU

Memory Health

memory_health

The health of the memory

Disk Health

disk_health

The health of the disk

Power Supply Health

power_health

The health of the power supply

Network Interface Health

nic_health

The health of the network interface

Fan Health

fan_health

The health of the fan

iMetal Server Alarm Trend Metrics

Table 2 iMetal server alarm trend metrics

Metric

Description

host

Collects the number of alarms generated for the entire system at a specified time. The value is the same as the number of alarms whose dimension is host_health.

type_cpu

Collects the number of alarms generated for a processor at a specified time. The value is the same as the number of alarms whose dimension is cpu_health.

type_memory

Collects the number of alarms generated for the memory at a specified time. The value is the same as the number of alarms whose dimension is memory_health.

type_disk

Collects the number of alarms generated for the disks at a specified time. The value is the same as the number of alarms whose dimension is disk_health.

type_power

Collects the number of alarms generated for the power supply at a specified time. The value is the same as the number of alarms whose dimension is power_health.

type_fan

Collects the number of alarms generated for the fans at a specified time. The value is the same as the number of alarms whose dimension is fan_health.

type_nic

Collects the number of alarms generated for the network interfaces at a specified time. The value is the same as the number of alarms whose dimension is nic_health.

level_critical

Collects the number of critical alarms generated at a specified time. The value is the same as the number of critical alarms in the alarms.

level_major

Collects the number of major alarms generated at a specified time. The value is the same as the number of major alarms in the alarms.

iRack Monitoring Metrics

Table 3 iRack monitoring metrics

Metric

Description

rack_power

Indicates the power of a rack.

rack_temp

Indicates the temperature of a rack.