Updated on 2024-06-26 GMT+08:00

Workload Monitoring

To monitor the resource usage of workloads, choose Monitoring Center > Workloads. This page provides information about all workloads in a cluster and monitoring data of a single workload, such as the CPU/memory usage and network inbound/outbound rate.

Navigation Path

  1. Log in to the CCE console and click the cluster name to access the cluster console.
  2. In the navigation pane on the left, choose Monitoring Center. Then, click Workloads.

    This page displays information about all workloads. To view the monitoring information of a workload, click the workload name to access its Overview tab and switch to the Pods or Monitoring tab.

Workloads

This page lists the name, status, number of pods (normal/all), namespace, image name, CPU usage, and memory usage of each workload.

Figure 1 Workloads

You can search for the desired workload by workload type (in the upper right corner), namespace, and workload name or status.

You can click Export to export data of all workloads or selected workloads. The exported file is in .xlsx format, and the file name contains the timestamp.

Overview

You can click the workload name to view the resource overview, including the workload status, number of pods (abnormal/total), and abnormal events. You can also view the monitoring overview of the last hour, including the CPU usage, memory usage, and network inbound/outbound rate.

Figure 2 Resource overview and monitoring overview

The Overview tab also shows the pod usage trend. You can switch the metrics in the upper right corner of the chart to view the CPU usage, used CPUs, memory usage, and used memory of each pod of the workload. You can also click Top 5 (Descending) or Top 5 (Ascending) in the upper left corner to view the top 5 data in descending or ascending order.

Figure 3 Pod usage trend

For more metrics, go to the Monitoring tab.

Pods

This tab lists the name, status, namespace, IP address, node, number of restarts, CPU request/limit, memory request/limit, CPU usage, and memory usage of each pod.

Figure 4 Pods

You can search for the desired pod by name, status, namespace, IP address, or node. You can click Export to export data of all pods or selected pods. The exported file is in .xlsx format, and the file name contains the timestamp.

You can click the name of a pod to view its monitoring data. For more information, see Pod Monitoring.

Monitoring

This tab shows the resource usage of the workload in each dimension in the last 1 hour, last 8 hours, last 24 hours, or a custom period. To view more monitoring information, click View Dashboard to access the Dashboard page. For details, see Using Dashboard.

Figure 5 Workload monitoring
  • CPU Metrics
    • CPU usage: the percentage of the CPU used by containers in all pods of the workload in different time periods to the total CPU limit for all containers.
    • CPU throttled: the average percentage of time containers have been throttled in all pods of the workload in different time periods.
  • Memory Metrics
    • Memory usage: the percentage of memory used by containers in all pods of the workload in different time periods to the total memory limit for all containers.
  • Networking Metrics
    • Total outbound rate: the total number of bytes transmitted by containers in all pods of the workload per second in different time periods.
    • Total inbound rate: the total number of bytes received by containers in all pods of the workload per second in different time periods.
    • Packet loss rate (transmit): the percentage of packets not received by the recipient to packets sent from containers in all pods of the workload in different time periods.
    • Packet loss rate (receive): the percentage of packets not received by containers in all pods of the workload to packets sent to the containers in different time periods.
  • Pod Metrics
    • Pod CPU usage: the percentage of CPU used by each pod of the workload in different time periods to the CPU limit for each pod.
    • Pod memory usage: the percentage of memory used by each pod of the workload in different time periods to the memory limit for each pod.
    • Pod status and quantity: the total number of pods in the Unavailable, Unready, Running, Completed, or Other state of the workload in different time periods.
    • Pod quantity trend: the number of pods (replicas) of the workload in different time periods.