Updated on 2024-06-17 GMT+08:00

Preparing GPU Virtualization Resources

This section describes how you can plan and prepare basic software and hardware before using GPU virtualization.

Basic planning

Resource

Version

Cluster

v1.25.15-r7 or later

OS

Huawei Cloud EulerOS 2.0

GPU

T4 and V100

GPU driver

470.57.02, 470.103.01, 470.141.03, 510.39.01, and 510.47.03

Container runtime

containerd

Add-ons

The following add-ons must be installed in a cluster:

Procedure 1: Adding GPU Nodes to a Cluster and Labeling the Nodes

If there are GPU nodes that comply with the basic planning in your cluster, skip this procedure.

  1. Add nodes that support GPU virtualization to your cluster. For details, see Adding Nodes to On-Premises Clusters.
  2. Label the nodes with accelerator: nvidia-{GPU model}. For details, see Adding Labels/Taints to Nodes.

    Figure 1 Labeling nodes that support GPU virtualization

Procedure 2: Installing the Add-ons

If the add-ons that comply with the basic planning have been installed in your cluster, you can skip this procedure.

If the driver version is changed, restart the node to apply the change.

  1. Log in to the UCS console and click the cluster name to access the cluster console. In the navigation pane on the left, choose Add-ons. In the Add-ons Installed area, check whether volcano and gpu-device-plugin have been installed.
  2. If volcano is not installed, install it by referring to volcano.

    If gpu-device-plugin is not installed, install it by referring to gpu-device-plugin.