Help Center/ ModelArts/ Troubleshooting/ DevEnviron/ Code Running Failures/ Why Does the Instance Break Down When dead kernel Is Displayed During Training Code Running?
Updated on 2024-06-11 GMT+08:00

Why Does the Instance Break Down When dead kernel Is Displayed During Training Code Running?

The notebook instance breaks down during training code running due to insufficient memory caused by large data volume or excessive training layers.

After this error occurs, the system automatically restarts the notebook instance to fix the instance breakdown. In this case, only the breakdown is fixed. If you run the training code again, the failure will still occur. To solve the problem of insufficient memory, you are advised to create a new notebook instance and use a resource pool of higher specifications, such as a GPU or dedicated resource pool, to run the training code. An existing notebook instance that has been successfully created cannot be scaled up using resources with higher specifications.