Updated on 2024-07-02 GMT+08:00

Guaranteed Egress Network Bandwidth

An egress network bandwidth is guaranteed by configuring network priorities for the following scenarios:

  • The egress network bandwidth used by online and offline services is balanced to ensure sufficient network bandwidth for online services. When the threshold is reached for online services, the bandwidth usage of offline services will be reduced.
  • When online services occupy a small number of network resources, offline services can use more bandwidth. When online services occupy a large number of network resources, the resource usage of offline services will be reduced to ensure that more network bandwidth prioritizes online services.

Notes and Constraints

To use a guaranteed egress network bandwidth, ensure the following requirements are met:

  • Only nodes running Huawei Cloud EulerOS 2.0 are supported.
  • Only CCE Turbo clusters of v1.23 or later are supported.
  • Volcano of v1.9.0 or later must be installed in the cluster, and hybrid deployment must be enabled by setting colocation_enable in the advanced settings to true.
  • Before enabling, modifying, or disabling a guaranteed egress network bandwidth, ensure Volcano is working properly.
  • For pods that have been running on the node before Volcano is installed, manually restart the pods after enabling guaranteed network bandwidth so that the feature can take effect.
  • Uninstalling Volcano or disabling hybrid deployment does not affect the guaranteed egress network bandwidth settings already existing on the node. To disable the guaranteed egress network bandwidth, disable network isolation.
  • If bandwidth limit is enabled, the protocol stack cache may be stacked. For protocols without backpressure mechanisms, such as UDP, packet loss and ENOBUFS may occur.
  • Bandwidth limit increases the risk that offline services cannot obtain bandwidth. Services may even be abnormal due to insufficient bandwidth or pod health check may fail.
  • Egress network bandwidth guarantee is not prioritized in the following scenarios:
    • When network bandwidth limit is used for hybrid online or offline pods, the priority of network bandwidth limit is higher than that of the current function.
    • When a pod uses the node network (hostNetwork), the egress network bandwidth guarantee function does not take effect.

Procedure

The following describes how to enable or disable egress network bandwidth guarantee.

  1. Log in to the CCE console and click the cluster name to access the cluster console.
  2. In the navigation pane, choose Nodes. On the Node Pools tab page, locate the target node pool and choose More > Mixed configuration.

    Ensure that node pool hybrid deployment and network isolation are enabled.

  3. (Optional) Modify parameters for egress network bandwidth guarantee.

    The modified parameters take effect for all nodes running Huawei Cloud EulerOS 2.0 in the cluster.

    Table 1 Network isolation parameters

    Parameter

    Parameter

    Description

    Default Value

    Configuration Range

    Online Job Bandwidth Threshold (%)

    onlineBandwidthWatermarkPercent

    Ratio of the total bandwidth threshold of online services to the assured bandwidth of the node type

    Total bandwidth threshold of online services = Assured bandwidth of the node type x onlineBandwidthWatermarkPercent/100

    80

    Value range: 1 to 1000

    NOTE:

    The actual network bandwidth may be larger than the assured bandwidth but less than the maximum bandwidth. Therefore, the value can be greater than 100.

    Offline Job Bandwidth Percentage Upper Limit (%) When Online Job Bandwidth Usage Does Not Exceed the Threshold

    offlineHighBandwidthPercent

    Ratio of the maximum total bandwidth usage of offline services to the assured bandwidth of the node type when the bandwidth usage of online services does not exceed the threshold.

    If the total bandwidth usage of online services on the same node does not exceed the value of Assured bandwidth of the node type x onlineBandwidthWatermarkPercent/100, the total bandwidth usage of offline services on the same node cannot exceed the value of Assured bandwidth of the node type x offlineHighBandwidthPercent/100.

    40

    Offline Job Bandwidth Percentage Upper Limit (%) When Online Job Bandwidth Usage Exceeds the Threshold

    offlineLowBandwidthPercent

    Ratio of the maximum total bandwidth usage of offline services to the assured bandwidth of the node type when the bandwidth usage of online services exceeds the threshold.

    If the total bandwidth usage of online services on the same node exceeds the value of Assured bandwidth of the node type x onlineBandwidthWatermarkPercent/100, the total bandwidth usage of offline services on the same node cannot exceed the value of Assured bandwidth of the node type x offlineLowBandwidthPercent/100.

    10

    Figure 1 Example of egress network bandwidth guarantee

    In the preceding figure, when the bandwidth of the online job is lower than the bandwidth baseline, the bandwidth threshold of the offline job is relatively high, indicating that the offline job can use certain bandwidth. When the bandwidth of the online job exceeds the bandwidth baseline, the bandwidth threshold of the offline job will be lowered accordingly to reduce the bandwidth used by the offline job so that a higher bandwidth can be reserved for the online job.