Updated on 2024-09-19 GMT+08:00

How CronFederatedHPA Works

CronFederatedHPA is needed because FederatedHPA can only scale in or out pods for workloads based on metrics data. However, metric-based scaling brings in latency. CronFederatedHPA can automatically scale in or out pods for workloads at regular intervals.

You can configure a CronFederatedHPA for workloads whose resource usage changes periodically, so that pods can be added before predicated peak hours and reclaimed at off-peak hours.

How CronFederatedHPA Works

Figure 1 shows the working principle of CronFederatedHPA. When creating a CronFederatedHPA, you can specify a time to adjust the maximum and minimum numbers of pods in a FederatedHPA or directly specify the number of pods desired.

Figure 1 Working principle of CronFederatedHPA

Using CronFederatedHPA Separately

If CronFederatedHPA is separately used, it periodically adjusts the number of pods for workloads. After you set the effective time and desired number of pods in a CronFederatedHPA, pods will be periodically scaled after the CronFederatedHPA is in effect.

Figure 2 Using CronFederatedHPA separately

The detailed procedure is as follows:

  1. Create a CronFederatedHPA and set the effective time and desired number of pods.
    • Effective time: the time when the CronFederatedHPA takes effect.
    • Desired number of pods: the desired number of pods when the CronFederatedHPA takes effect.
  2. When the CronFederatedHPA takes effect, the number of existing pods in the workload will be compared with the desired number of pods set in 1. If the desired number is greater, pods are scaled out for the workload. If the desired number is smaller, pods are scaled in.

    Number of existing pods: the number of pods in the workload before the CronFederatedHPA takes effect.

Using Both CronFederatedHPA and FederatedHPA

If both FederatedHPA and CronFederatedHPA are used, CronFederatedHPA runs based on FederatedHPA and periodically adjusts the maximum and minimum numbers of pods in the FederatedHPA for scheduled scaling.

Figure 3 Using both FederatedHPA and CronFederatedHPA

The detailed procedure is as follows:

  1. Create a CronFederatedHPA and set the effective time and desired number of pods.
    • Effective time: the time when the CronFederatedHPA takes effect.
    • Desired number of pods: the number of pods set in the CronFederatedHPA. When CronFederatedHPA takes effect, this number will be used as a reference for adjusting the maximum and minimum numbers of pods in the FederatedHPA. The maximum and minimum numbers can be used as starting points for adjusting the number of pods for a workload.
  2. When the CronFederatedHPA takes effect, the number of existing pods of the workload, maximum number of pods and minimum number of pods in the FederatedHPA, and desired number of pods set in 1 will be compared to determine how much the maximum and minimum numbers of pods in the FederatedHPA will be adjusted. Then, the FederatedHPA scales in or out pods for the workload based on the adjusted maximum and minimum numbers of pods.
    • Number of existing pods: the number of pods in the workload before the CronFederatedHPA takes effect.
    • Maximum number of pods in the FederatedHPA: the maximum number of pods for a workload.
    • Minimum number of pods in the FederatedHPA: the minimum number of pods for a workload.

Figure 4 and Table 1 show the possible scaling scenarios when both FederatedHPA and CronFederatedHPA are used. You can learn about how CronFederatedHPA takes effect on the FederatedHPA and workload based on the number of existing pods, maximum number of pods, minimum number of pods, and desired number of pods.

Figure 4 Scaling scenarios when both policies are used
Table 1 Scaling scenarios when both policies are used

Scenario No.

Description

Desired Number of Pods

(in a CronFederatedHPA)

Number of Existing Pods

(in a Workload)

Minimum/Maximum Number of Pods

(in a FederatedHPA)

Result

1

Desired number of pods < Minimum number of pods ≤ Number of existing pods ≤ Maximum number of pods

3

5

4/10

  • The minimum number of pods in the FederatedHPA is changed to 3.
  • The number of existing pods of the workload is not changed.

2

Desired number of pods = Minimum number of pods ≤ Number of existing pods ≤ Maximum number of pods

4

5

4/10

  • The minimum number of pods in the FederatedHPA is not changed.
  • The number of existing pods of the workload is not changed.

3

Minimum number of pods < Desired number of pods < Number of existing pods ≤ Maximum number of pods

5

6

4/10

  • The minimum number of pods in the FederatedHPA is changed to 5.
  • The number of existing pods of the workload is not changed.

4

Minimum number of pods < Desired number of pods = Number of existing pods ≤ Maximum number of pods

5

5

4/10

  • The minimum number of pods in the FederatedHPA is changed to 5.
  • The number of existing pods of the workload is not changed.

5

Minimum number of pods ≤ Number of existing pods < Desired number of pods < Maximum number of pods

6

5

4/10

  • The minimum number of pods in the FederatedHPA is changed to 6.
  • The number of existing pods of the workload is changed to 6.

6

Minimum number of pods ≤ Number of existing pods < Desired number of pods = Maximum number of pods

10

4

4/10

  • The minimum number of pods in the FederatedHPA is changed to 10.
  • The number of existing pods of the workload is changed to 10.

7

Minimum number of pods ≤ Number of existing pods ≤ Maximum number of pods < Desired number of pods

11

4

4/10

  • The minimum and maximum numbers of pods in the FederatedHPA are both changed to 11.
  • The number of existing pods of the workload is changed to 11.