Help Center/ ModelArts/ Troubleshooting/ DevEnviron/ Code Running Failures/ What Do I Do If cudaCheckError Occurs During Training?

Updated on 2024-06-11 GMT+08:00

View PDF

What Do I Do If cudaCheckError Occurs During Training?

Symptom

The following error occurs when the training code is executed in a notebook:

cudaCheckError() failed : no kernel image is available for execution on the device

Possible Cause

Parameters arch and code in setup.py have not been set to match the GPU compute power.

Solution

For Tesla V100 GPUs, the GPU compute power is -gencode arch=compute_70,code=[sm_70,compute_70]. Set the compilation parameters in setup.py accordingly.

Parent topic: Code Running Failures

Previous topic: Why Does the Instance Break Down When dead kernel Is Displayed During Training Code Running?

Next topic: What Do I Do If Insufficient Space Is Displayed in DevEnviron?

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel