Help Center/ Cloud Container Engine/ User Guide/ O&M/ Health Center/ Diagnosis Items and Rectification Solutions
Updated on 2025-01-07 GMT+08:00

Diagnosis Items and Rectification Solutions

Cluster Diagnosis Items and Rectification Solutions

Pod Diagnosis Items and Rectification Solutions

Table 1 Pod diagnosis items and rectification solutions

Diagnosis Item

Rectification Solution

FailedScheduling

Insufficient memory

The available memory of the node is insufficient. Expand the memory capacity.

Insufficient cpu

The available CPU of the node is insufficient. Expand the CPU capacity.

skip schedule deleting pod

The pod is being deleted.

Other information

If the pod fails to be scheduled, view the pod information.

kubectl describe <pod-name>

FailedAttachVolume

Check the status of the Everest add-on and node network connection, and ensure that the node has required permissions.

FailedMount

Check the status of the Everest add-on and node network connection, and ensure that the node has required permissions.

InvalidDiskCapacity

Check the disk capacity of the node and the actual available space. Ensure that the disk capacity is correctly set and meets the storage requirements of applications or services. Delete unnecessary files to release disk space. If a dynamic volume is used, ensure that the storage backend configuration is correct and available. Expand the disk capacity or adjust the storage requirements of applications or services as needed.

BackOffPullImage

Ensure that the image tag is correct.

FailedPullImage

Ensure that the image tag is correct.

ErrImageNeverPull

Check the local image. You are advised to set the image pull policy to IfNotPresent or Always.

InspectFailed

Check the integrity of the image.

FailedPostStartHook

Check the configuration and script of the post-start hook to ensure that they are correct. View the hook execution log to obtain the error information and rectify the fault in the hook script based on the error information. If possible, manually execute the post-start hook script to check whether the environment or permissions are correct.

FailedPreStopHook

Check the configuration and script of the pre-stop hook to ensure that they are correct. View the hook execution log to obtain the error information and rectify the fault in the hook script based on the error information. If possible, manually execute the pre-stop hook script to check whether the environment or permissions are correct.

ProbeWarning

Check the probe configuration to ensure that the probe is correctly configured and can correctly evaluate the container health status. View the alarm information to find the possible faults, and adjust the probe configuration or rectify the faults in the container as needed.

Unhealthy

Check the pod or container logs to find error information. Ensure that applications or services are correctly started and running in the container. Check the container resource usage to determine whether resources are insufficient. Take measures based on logs and monitoring information, such as restarting pods or containers to rectify application or service faults.

FailedCreatePodContainer

Check the pod and container configurations to ensure that the YAML file is correct, including the container image, resource request, and limit.

Preempting

You are advised to set proper resource requests and limits for the load to prevent preemption caused by insufficient resources.

Killing

Check the resource usage and ensure that the resource requests and limits of pods and nodes are properly set to prevent containers from being terminated due to insufficient resources.