CCE Advanced HPA

The CCE Advanced HPA add-on (formerly cce-hpa-controller) is developed by CCE. It can be used to flexibly scale in or out Deployments based on metrics such as CPU usage and memory usage.

After installing this add-on, you can create CronHPA and CustomedHPA policies. For details, see Creating a Scheduled CronHPA Policy and Creating a CustomedHPA Policy.

Main Functions

Scaling can be performed based on the percentage of the current number of pods.
The minimum scaling step can be set.
Different scaling operations can be performed based on the actual metric values.

Notes and Constraints

If the CCE Advanced HPA version is earlier than 1.2.11, the Prometheus add-on must be installed. If the CCE Advanced HPA version is 1.2.11 or later, an add-on that can provide metrics API must be installed. You can select one of the following add-ons based on your cluster version and requirements.
- Kubernetes Metrics Server: provides basic resource usage metrics, such as container CPU and memory usage. The default collection period is 60s. It is supported by all cluster versions.
- Cloud Native Cluster Monitoring: To use HPA policies, enable local data storage in this add-on. This add-on is available in clusters v1.17 or later. The default collection period is 15s.
  - Auto scaling based on basic resource metrics: Prometheus needs to be registered as a metrics API. For details, see Providing Basic Resource Metrics Through the Metrics API. If Kubernetes Metrics Server has been installed in the cluster, the Metrics API is provided by default. No manual registration is required.
  - Auto scaling based on custom metrics: In addition to registering Prometheus as a Metrics API service, you need to aggregate custom metrics to the Kubernetes API server. For details, see Creating an HPA Policy Using Custom Metrics.
- Prometheus (EOM): If Kubernetes Metrics Server is not installed in the cluster, you need to manually create the Metrics API for Prometheus. For details, see Providing Resource Metrics Through the Metrics API. This add-on supports only clusters v1.21 or earlier. This add-on is no longer maintained in clusters v1.21 or later. This add-on is not recommended for CCE clusters. You are advised to use Cloud Native Cluster Monitoring or Kubernetes Metrics Server instead.

Installing the Add-on

Log in to the CCE console and click the cluster name to access the cluster console.
In the navigation pane, choose Add-ons. Locate CCE Advanced HPA on the right and click Install.
On the Install Add-on page, configure the specifications as needed.
- If you selected Preset, the add-on specifications will be automatically configured based on the recommended values by CCE. These values are suitable for most scenarios and can be viewed on the console.
- If you selected Custom, you can modify the number of replicas, vCPUs, and memory of each add-on component as required.
  
  Replicas: HA is not possible with just one replica, so one replica is used only for verification. In commercial scenarios, you can configure multiple replicas based on the cluster specifications.
  
  CPU Quota and Memory Quota: The resource quotas of a component are affected by how many containers and scaling policies in a cluster. For typical situations, it is recommended that you configure 500m CPU cores and 1,000 MiB of memory for every 5,000 containers in a cluster. As for scaling policies, 100m CPU cores and 500 MiB of memory should be configured for every 1,000 of them.
Configure the add-on parameters.
- AHPA Policy: After this function is enabled, historical monitoring metrics can be used to predict the required number of replicas and scales accordingly. For details, see Creating an AHPA Policy.
  To enable AHPA, make sure to install the Cloud Native Cluster Monitoring add-on and enable the function that reports monitoring data to AOM. For details, see Cloud Native Cluster Monitoring.

Configure deployment policies for the add-on pods.

Scheduling policies do not take effect on the DaemonSet pods of the add-on.
When configuring multi-AZ deployment or node affinity, ensure that there are nodes meeting the scheduling policy and that resources are sufficient in the cluster. Otherwise, the add-on pods cannot run.

**Table 1** Configurations for add-on scheduling
Parameter	Description
Multi-AZ Deployment	Preferred: Deployment pods of the add-on will be preferentially scheduled to nodes in different AZs. If all the nodes in the cluster are deployed in the same AZ, the pods will be scheduled to different nodes in that AZ. Equivalent mode: Deployment pods of the add-on are evenly scheduled to the nodes in the cluster in each AZ. If a new AZ is added, you are advised to increase add-on pods for cross-AZ HA deployment. With the Equivalent multi-AZ deployment, the difference between the number of add-on pods in different AZs will be less than or equal to 1. If resources in one of the AZs are insufficient, pods cannot be scheduled to that AZ. Forcible: Deployment pods of the add-on are forcibly scheduled to nodes in different AZs. There can be at most one pod in each AZ. If nodes in a cluster are not in different AZs, some add-on pods cannot run properly. If a node is faulty, the add-on pods on it may fail to be migrated.
Node Affinity	Not configured: Node affinity is disabled for the add-on pods. Specify node: Specify the nodes where the add-on pods are deployed. If you do not specify the nodes, the add-on pods will be randomly scheduled based on the default cluster scheduling policy. Specify node pool: Specify the node pool where the add-on pods are deployed. If you do not specify the node pools, the add-on pods will be randomly scheduled based on the default cluster scheduling policy. Customize affinity: Enter the labels of the nodes where the add-on pods are to be deployed for more flexible scheduling policies. If you do not specify node labels, the add-on pods will be randomly scheduled based on the default cluster scheduling policy. If multiple custom affinity policies are configured, ensure that there are nodes that meet all the affinity policies in the cluster. Otherwise, the add-on pods cannot run.
Toleration	Using both taints and tolerations enables (but does not require) the add-on's Deployment pods to be scheduled on nodes with matching taints, and allows control over pod eviction policies when host nodes are tainted. The add-on applies default toleration policies for the node.kubernetes.io/not-ready and node.kubernetes.io/unreachable taints on pods. The tolerance time window is 60s. For details, see Configuring Tolerance Policies.

Click Install.

Components

**Table 2** Add-on components
Component	Description	Resource Type
customedhpa-controller	CCE auto scaling component, which scales in or out Deployments based on metrics such as CPU usage and memory usage	Deployment

Helpful Links

After the add-on is installed, you can create multiple auto scaling policies for workloads.

For details about the differences and principles of these auto scaling policies, see Workload Scaling Rules.

Release History

**Table 3** CCE Advanced HPA updates
Add-on Version	Supported Cluster Version	New Feature
1.5.85	v1.29 v1.30 v1.31 v1.32 v1.33 v1.34 v1.35 v1.36	Supported CCE clusters v1.36.
1.5.84	v1.29 v1.30 v1.31 v1.32 v1.33 v1.34 v1.35	Fixed some issues.
1.5.74	v1.29 v1.30 v1.31 v1.32 v1.33 v1.34 v1.35	Supported CCE clusters v1.35.
1.5.57	v1.28 v1.29 v1.30 v1.31 v1.32 v1.33 v1.34	Supported the PodLevelResources feature gate.
1.5.51	v1.28 v1.29 v1.30 v1.31 v1.32 v1.33 v1.34	Fixed some issues.
1.5.41	v1.28 v1.29 v1.30 v1.31 v1.32 v1.33 v1.34	Supported CCE clusters v1.34.
1.5.38	v1.27 v1.28 v1.29 v1.30 v1.31 v1.32 v1.33	Fixed some issues.
1.5.37	v1.27 v1.28 v1.29 v1.30 v1.31 v1.32 v1.33	Supported CCE clusters v1.33.
1.5.32	v1.25 v1.27 v1.28 v1.29 v1.30 v1.31 v1.32	Supported CCE clusters v1.32.
1.5.24	v1.25 v1.27 v1.28 v1.29 v1.30 v1.31	Fixed some issues.
1.5.21	v1.25 v1.27 v1.28 v1.29 v1.30 v1.31	Supported CCE clusters v1.31. Supported intelligent scalability based on application trend prediction.
1.5.3	v1.21 v1.23 v1.25 v1.27 v1.28 v1.29 v1.30	AHPA is available.
1.4.30	v1.21 v1.23 v1.25 v1.27 v1.28 v1.29 v1.30	Supported CCE clusters v1.30.
1.4.3	v1.21 v1.23 v1.25 v1.27 v1.28 v1.29	Fixed some issues.
1.4.2	v1.21 v1.23 v1.25 v1.27 v1.28 v1.29	Supported CCE clusters v1.29.
1.3.43	v1.21 v1.23 v1.25 v1.27 v1.28	Fixed some issues.
1.3.42	v1.21 v1.23 v1.25 v1.27 v1.28	Supported CCE clusters v1.28.
1.3.14	v1.19 v1.21 v1.23 v1.25 v1.27	Supported CCE clusters v1.27.
1.3.10	v1.19 v1.21 v1.23 v1.25	Periodic scaling is not affected by the cooldown period.
1.3.7	v1.19 v1.21 v1.23 v1.25	Supported anti-affinity scheduling of add-on pods on nodes in different AZs.
1.3.3	v1.19 v1.21 v1.23 v1.25	Supported CCE clusters v1.25. Allowed CronHPA to adjust the number of Deployment pods in the skip scenarios.
1.3.1	v1.19 v1.21 v1.23	Supported CCE clusters v1.23.
1.2.12	v1.15 v1.17 v1.19 v1.21	Optimized the add-on performance to reduce resource consumption.
1.2.11	v1.15 v1.17 v1.19 v1.21	Resource metrics can be obtained using the Kubernetes Metrics API. Unready pods are taken into account during the calculation of compute resource usage.
1.2.10	v1.15 v1.17 v1.19 v1.21	Supported CCE clusters v1.21.
1.2.4	v1.15 v1.17 v1.19	Regular upgrade of add-on dependencies Allowed custom add-on resource specifications.
1.2.3	v1.15 v1.17 v1.19	Supported ARM64 nodes.
1.2.2	v1.15 v1.17 v1.19	Enhanced the health check function.
1.2.1	v1.15 v1.17 v1.19	Supported CCE clusters v1.19. Updated the add-on to a stable version.
1.1.3	v1.15 v1.17	Supported periodic scaling rules.

Parent Topic: Scheduling and Elasticity Add-ons

Previous topic: CCE Cluster Autoscaler

Next topic: CCE Cloud Bursting Engine for CCI

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

For any further questions, feel free to contact us through the chatbot.

Chatbot