Help Center/
ModelArts/
Troubleshooting/
Training Jobs/
In-Cloud Migration Adaptation Issues/
Error Message "RuntimeError: std::exception" Displayed for a PyTorch 1.0 Engine
Updated on 2022-12-08 GMT+08:00
Error Message "RuntimeError: std::exception" Displayed for a PyTorch 1.0 Engine
Symptom
When a PyTorch 1.0 image is used, the following error message is displayed:
"RuntimeError: std::exception"
Possible Causes
The soft link of libmkldnn in the PyTorch 1.0 image conflicts with that of the native Torch. For details, see conv1d fails in PyTorch 1.0.
Solution
- This issue is caused by library conflict in the environment. To resolve this issue, add the following code at the very beginning of the boot script:
import os os.system("rm /home/work/anaconda3/lib/libmkldnn.so") os.system("rm /home/work/anaconda3/lib/libmkldnn.so.0")
- Use the local PyCharm to remotely access notebook for debugging.
Summary and Suggestions
Before creating a training job, use the ModelArts development environment to debug the training code to maximally eliminate errors in code migration.
- Use the online notebook environment for debugging. For details, see Using JupyterLab to Develop a Model.
- Use the local IDE (PyCharm or VS Code) to access the cloud environment for debugging. For details, see Using the Local IDE to Develop a Model.
Parent topic: In-Cloud Migration Adaptation Issues
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
The system is busy. Please try again later.
For any further questions, feel free to contact us through the chatbot.
Chatbot