Why Is the Job Still Queued When Resources Are Sufficient?
- If a public resource pool is used, the resources may be used by other users. Please wait or find solutions in Why Is a Training Job Always Queuing?.
- If a dedicated resource pool is used, perform the following operations:
- Check whether other jobs (including inference jobs, training jobs, and development environment jobs) are running in the dedicated resource pool.
On the Dashboard page, you can go to the details page of the running jobs or instances to check whether the dedicated resource pool is used. You can stop them based on your needs to release resources.
Figure 1 Dashboard
- Go to the details page of the dedicated resource pool to check whether there are other queuing jobs.
If yes, the new job needs to be queued.
Figure 2 Queuing jobs
- Check whether resources are fragmented.
For example, the cluster has two nodes, and there are four idle cards on each node. However, your job requires eight cards on one node. In this case, the idle resources cannot be allocated to your job.
- Check whether other jobs (including inference jobs, training jobs, and development environment jobs) are running in the dedicated resource pool.
General Issues FAQs
- What Is ModelArts?
- What Are the Relationships Between ModelArts and Other Services?
- What Are the Differences Between ModelArts and DLS?
- How Do I Purchase or Enable ModelArts?
- How Do I Obtain an Access Key?
- How Do I Upload Data to OBS?
- What Do I Do If the System Displays a Message Indicating that the AK/SK Pair Is Unavailable?
- What Do I Do If a Message Indicating Insufficient Permissions Is Displayed When I Use ModelArts?
- How Do I Use ModelArts to Train Models Based on Structured Data?
- What Are Regions and AZs?
- How Do I View All Files Stored in OBS on ModelArts?
- Where Are Datasets of ModelArts Stored in a Container?
- Which AI Frameworks Does ModelArts Support?
- What Are the Functions of ModelArts Training and Inference?
- How Do I View an Account ID and IAM User ID?
- Can AI-assisted Identification of ModelArts Identify a Specific Label?
- How Does ModelArts Use Tags to Manage Resources by Group?
- How Do I View All ModelArts Monitoring Metrics in AOM?
- Why Is the Job Still Queued When Resources Are Sufficient?
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbotmore