Help Center/
Cloud Container Engine/
User Guide/
Clusters/
Upgrading a Cluster/
Troubleshooting for Pre-upgrade Check Exceptions/
Key GPU Add-on Parameters
Updated on 2024-09-02 GMT+08:00
Key GPU Add-on Parameters
Check Items
Check whether the configuration of the CCE AI Suite add-on in a cluster has been intrusively modified. If so, upgrading the cluster may fail.
Solution
- Use kubectl to access the cluster.
- Run the following command to obtain the add-on instance details:
kubectl get ds nvidia-driver-installer -nkube-system -oyaml
- Check whether the UpdateStrategy value is changed to OnDelete. If so, change it back to RollingUpdate.
- Check whether the NVIDIA_DRIVER_DOWNLOAD_URL value is the same as the GPU driver version on the add-on page. If not, correct the version on the add-on page.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
The system is busy. Please try again later.
For any further questions, feel free to contact us through the chatbot.
Chatbot