Enabling Monitoring for Huawei Cloud Clusters

This section describes how to enable monitoring for Huawei Cloud clusters.

Constraints

Before enabling monitoring for Huawei Cloud clusters, kube-prometheus-stack may have been installed. If the add-on is in the Installing, Upgrading, Deleting, or Rolling back state, monitoring cannot be enabled. For details about the add-on status, see Add-on Status Description.

Prerequisites

A Huawei Cloud cluster has been registered with UCS. For details, see Huawei Cloud Clusters.

Procedure

Log in to the UCS console. In the navigation pane, choose Container Intelligent Analysis.
Select a fleet or a cluster not in the fleet, and click Enable Monitoring.

Figure 1 Selecting a fleet or a cluster not in the fleet
Select a Huawei Cloud cluster.
Click Next: Configure Connection to complete the metric collection settings.

Specifications
- Deployment Mode: The Agent and Server modes are supported. The add-on deployed in Agent mode occupies fewer cluster resources and provides Prometheus metric collection for clusters. However, it does not support the HPA and health diagnosis functions based on custom Prometheus statements. The add-on deployed in Server mode provides Prometheus metric collection for clusters and supports the HPA and health diagnosis functions based on custom Prometheus statements. This mode depends on the PVC and consumes a large amount of memory.
- Add-on Specifications: If Deployment Mode is set to Agent, the default add-on specifications are used. If Deployment Mode is set to Server, the add-on specifications include Demo (≤ 100 containers), Small (≤ 2,000 containers), Medium (≤ 5,000 containers), and Large (> 5,000 containers). Different specifications have different requirements on cluster resources, such as CPUs and memory. For details about the resource quotas of different add-on specifications, see Resource Quota Requirements of Different Specifications.
Parameters
- Interconnection Mode: Currently, only AOM can be interconnected.
- AOM Instance: CIA reports metrics to AOM in a unified manner. You need to select an AOM instance of the Prometheus for CCE type. The default metrics are collected for free but custom metrics are billed by AOM.
- Collection Period: period for Prometheus to collect and report metrics. The value ranges from 10 to 120 seconds. The default value is 15 seconds.
  Storage: (Required when Deployment Mode is set to Server) Used for temporary storage (PVC) of Prometheus data. By default, Huawei Cloud clusters use PVCs of the csi-disk-topology storage type. If an available PVC (pvc-prometheus-server) exists in the namespace monitoring, it can be used as the storage source.
  - EVS Disk Type: You can select High I/O, Ultra-high I/O, or Common I/O.
  - Capacity: capacity specified when a PVC is created or the maximum storage limit when the pod storage is selected.
  Using EVS disks for add-on storage will incur extra expenditures. For details, see Product Pricing Details.
For details about the add-on, see kube-prometheus-stack.
Click Confirm. The Clusters tab (Container Insights > Clusters) is displayed. The access status of the cluster is Installing.

After monitoring is enabled for the cluster, metrics such as the CPU usage and CPU allocation rate of the cluster are displayed in the list, indicating that the cluster is monitored by CIA.

If monitoring fails to be enabled for the cluster, rectify the fault by referring to FAQs.

Parent topic: Enabling Cluster Monitoring

Previous topic: Overview

Next topic: Enabling Monitoring for On-Premises Clusters

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel

For any further questions, feel free to contact us through the chatbot.

Chatbot