Help Center> Cloud Container Engine> User Guide> New Console> Monitoring, Logs, and Alarms> Monitoring Overview

Monitoring Overview

CCE works with AOM to comprehensively monitor clusters. When a node is created, the ICAgent (the DaemonSet named icagent in the kube-system namespace of the cluster) of AOM is installed by default. The ICAgent collects monitoring data of underlying resources and workloads running on the cluster. It also collects monitoring data of custom metrics of the workload.

Resource metrics
Basic resource monitoring includes CPU, memory, and disk monitoring. For details, see Resource Metrics. You can view these metrics of clusters, nodes, and workloads on the CCE or AOM console.
Custom metrics
The ICAgent collects custom metrics of applications and uploads them to AOM. For details, see Custom Monitoring.

In addition, you can install the Prometheus add-on in a cluster and use Prometheus to collect and display monitoring data. For details, see Monitoring by Using the prometheus Add-on.

Resource Metrics

**Table 1** Resource metrics
Metric	Description
CPU Allocation Rate	Indicates the percentage of CPUs allocated to workloads.
Memory Allocation Rate	Indicates the percentage of memory allocated to workloads.
CPU Usage	Indicates the CPU usage.
Memory Usage	Indicates the memory usage.
Disk Usage	Indicates the disk usage.
Down	Indicates the speed at which data is downloaded to a node. The unit is KB/s.
Up	Indicates the speed at which data is uploaded from a node. The unit is KB/s.
Disk Read Rate	Indicates the data volume read from a disk per second. The unit is KB/s.
Disk Write Rate	Indicates the data volume written to a disk per second. The unit is KB/s.

Viewing Cluster Monitoring Data

Access the cluster details page. In the navigation pane, choose Cluster Information. In the right pane, you can view the CPU and memory usage of all nodes (excluding master nodes) in the cluster in the last hour.

The cluster monitoring page displays the monitoring status of cluster resources, CPU, memory, and disk usage of all nodes in a cluster, and CPU and memory allocation rates.

Explanation of monitoring metrics:

CPU allocation rate = Sum of CPU quotas requested by pods in the cluster/Sum of CPU quotas that can be allocated of all nodes (excluding master nodes) in the cluster
Memory allocation rate = Sum of memory quotas requested by pods in the cluster/Sum of memory quotas that can be allocated of all nodes (excluding master nodes) in the cluster
CPU usage: Average CPU usage of all nodes (excluding master nodes) in a cluster
Memory usage: Average memory usage of all nodes (excluding master nodes) in a cluster

Allocatable node resources (CPU or memory) = Total amount – Reserved amount – Eviction thresholds. For details, see Formula for Calculating the Reserved Resources of a Node.

CCE provides the status, availability zone (AZ), CPU usage, and memory usage of master nodes.

Viewing Monitoring Data of Worker Nodes

In addition to viewing monitoring data of all nodes, you can also view monitoring data of a single node. Access the cluster details page. Choose Nodes in the navigation pane and click Monitor in the Operation column of the target node.

Monitoring data comes from AOM. You can view the monitoring data of a node, including the CPU, memory, disk, network, and GPU.

Viewing Workload Monitoring Data

You can view monitoring data of a workload on the Monitoring tab page of the workload details page. Access the cluster details page. Choose Workloads in the navigation pane and click Monitor in the Operation column of the target workload.

Monitoring data comes from AOM. You can view the monitoring data of a workload, including the CPU, memory, network, and GPU, on the AOM console.

You can also click View More to go to the AOM console and view monitoring data of the workload.

Viewing Pod Monitoring Data

You can view monitoring data of a pod on the Pods tab page of the workload details page.

Parent topic: Monitoring, Logs, and Alarms

Last Article: Monitoring, Logs, and Alarms

Next Article: Custom Monitoring

Did this article solve your problem?

Thank you for your score！Your feedback would help us improve the website.

Products

Compute

Application

Dedicated Cloud

Storage

Management & Deployment

Migration

Network

Enterprise Intelligence

Video

Database

Edge Cloud Services

DevCloud

Security

Cloud Communications

Internet of Things

Solutions

Industry-Specific Solutions

General-Purpose Solutions

Security

DevOps

Enterprise Intelligence

Essential Platform

Big Data

Visual Cognition

Speech and Semantics

Support

Help Center

Customer Services

Developers

Console

语言 - Language

中国站 - 简体中文

中国站 - English

International - 简体中文

International - English