VPA

The VPA add-on supports vertical pod autoscaling. It automates the adjustment of CPU and memory resource requests for pods based on their historical resource usage.

For details about the open-source Vertical Pod Autoscaler, see autoscaler.

Overview

VPA collects and analyzes resource metrics for each container, adjusts the requested resources based on actual usage, and maintains the ratio of resource limit to request before and after the adjustment. VPA can increase or decrease CPU and memory resources as needed.

The rules are as follows:

VPA generates the CPU and memory resource recommendations using the data collected by the Metrics API.
VPA, in theory, recommends a minimum of 250 MiB of memory for each pod and 250 MiB divided by the number of containers in the pod for each container. It also recommends a minimum of 25m vCPUs for each pod and 25m divided by the number of containers in the pod for each container.
When setting up a VPA, you can establish the minimum and maximum number of elastic resources in containers by configuring the containerPolicies field.
If a container has both resource request and limit configured, VPA will provide resource recommendations. It will adjust the requested resources of the container to match the recommendations and generate recommended resource limit based on the ratio of the original resource request to the limit set during the container's initial creation.
Assume that the requested vCPUs of a container are 100m and the limit is 200m (with a ratio of 1:2). If VPA recommends a requested vCPU of 80m, the container's vCPU limit will be 160m.
VPA ensures its recommendations align with other resource limits. If the VPA recommendations conflict with a resource limit, they will not be adjusted to fit the limit. This means that the resource configuration suggested by VPA may go beyond other resource limits.
Assume that the requested memory of a namespace cannot exceed 2 GiB. If VPA recommends a high memory configuration for a pod in that namespace, the total memory requested by the namespace may exceed 2 GiB after the pod's resource configuration is updated. This means the pod will not be scheduled.

Prerequisites

The cluster version must be v1.25 or later.
An add-on that provides Metrics API has been installed in the cluster. You can select one of the following add-ons based on your service requirements:
- Kubernetes Metrics Server: provides basic resource usage metrics, such as container CPU and memory usage.
- Cloud Native Cluster Monitoring: provides basic resource usage metrics using Prometheus. You need to register Prometheus as a service of Metrics API. For details, see Providing Basic Resource Metrics Through the Metrics API.

Installing the Add-on

Log in to the CCE console and click the cluster name to access the cluster console.
In the navigation pane, choose Add-ons. Locate VPA on the right and click Install.
On the Install Add-on page, configure the specifications as needed.
- If you selected Preset, you can choose between Small, Medium, or Large based on the number of pods in the cluster. The system will automatically set the number of add-on pods and resource quotas according to the preset specifications. You can see the configurations on the console.
- If you selected Custom, you can adjust the number of pods and resource quotas as needed. High availability is not possible with a single pod. If an error occurs on the node where the add-on instance runs, the add-on will fail.

Configure deployment policies for the add-on pods.

Scheduling policies do not take effect on add-on pods of the DaemonSet type.
When configuring multi-AZ deployment or node affinity, ensure that there are nodes meeting the scheduling policy and that resources are sufficient in the cluster. Otherwise, the add-on cannot run.

**Table 1** Configurations for add-on scheduling
Parameter	Description
Multi-AZ Deployment	Preferred: Deployment pods of the add-on will be preferentially scheduled to nodes in different AZs. If all the nodes in the cluster are deployed in the same AZ, the pods will be scheduled to different nodes in that AZ. Forcible: Deployment pods of the add-on are forcibly scheduled to nodes in different AZs. There can be at most one pod in each AZ. If nodes in a cluster are not in different AZs, some add-on pods cannot run properly. If a node is faulty, add-on pods on it may fail to be migrated.
Node Affinity	Not configured: Node affinity is disabled for the add-on. Specify node: Specify the nodes where the add-on is deployed. If you do not specify the nodes, the add-on will be randomly scheduled based on the default cluster scheduling policy. Specify node pool: Specify the node pool where the add-on is deployed. If you do not specify the node pools, the add-on will be randomly scheduled based on the default cluster scheduling policy. Customize affinity: Enter the labels of the nodes where the add-on is to be deployed for more flexible scheduling policies. If you do not specify node labels, the add-on will be randomly scheduled based on the default cluster scheduling policy. If multiple custom affinity policies are configured, ensure that there are nodes that meet all the affinity policies in the cluster. Otherwise, the add-on cannot run.
Toleration	Using both taints and tolerations allows (not forcibly) the add-on Deployment to be scheduled to a node with the matching taints, and controls the Deployment eviction policies after the node where the Deployment is located is tainted. The add-on adds the default tolerance policy for the node.kubernetes.io/not-ready and node.kubernetes.io/unreachable taints, respectively. The tolerance time window is 60s. For details, see Configuring Tolerance Policies.

Click Install.

Components

**Table 2** Add-on components
Component	Description	Resource Type
vpa-admission-controller	Change the resource requests for a pod to the recommendations generated by the VPA when the pod is created.	Deployment
vpa-recommender	Collect the actual CPU and memory metrics of a pod and generate resource recommendations for the requested resources based on the actual resource usage.	Deployment
vpa-updater	Evict a pod whose actual resource requests are different from the VPA recommendations and trigger pod recreation so that the resources recommendations can apply to the new pod.	Deployment

Helpful Links

After the add-on is installed, you can create VPA policies to automatically adjust the CPU and memory requested by pods. For details, see Creating a VPA Policy.
For details about how VPA works, see How VPA Works.

Release History

**Table 3** Vertical Pod Autoscaler add-on
Add-on Version	Supported Cluster Version	New Feature	Community Version
1.0.25	v1.25 v1.27 v1.28 v1.29 v1.30 v1.31 v1.32	CCE clusters v1.32 are supported.	1.3.0
1.0.11	v1.25 v1.27 v1.28 v1.29 v1.30 v1.31	Updated the add-on to its community version 1.3.0.	1.3.0
1.0.6	v1.25 v1.27 v1.28 v1.29 v1.30 v1.31	CCE clusters v1.31 are supported.	1.1.2
1.0.4	v1.25 v1.27 v1.28 v1.29 v1.30	Vertical Pod Autoscaler (VPA) is now available.	1.1.2