Help Center > > User Guide> MRS Manager Operation Guide> Health Check Management> HDFS Health Check

HDFS Health Check

Updated at: Sep 12, 2019 GMT+08:00

Average Packet Sending Time

Indicator name: Average transfer time of sending packets Statistics

Indicator description: This indicator specifies the average time for DataNode in HDFS to send packets. If the average packet sending time exceeds 2,000,000 nanoseconds, the indicator is unhealthy.

Recovery guidance: If this indicator is abnormal, check whether the cluster network speed is normal and whether the memory or CPU usage is too high. You also need to check whether the HDFS load in the cluster is too high.

Service Health Status

Indicator name: Service Status

Indicator description: This indicator is used to check whether the service status of HDFS is normal. If a node is faulty, the service is unhealthy.

Recovery guidance: If this indicator is abnormal, you are advised to check whether the status of the KrbServer, LdapServer, and ZooKeeper services is Bad. If the service status is Bad, rectify the fault. Then check whether a file write failure occurs because HFDS SafeMode is ON, use the HDFS client to check whether data cannot be written into HDFS, and find the causes of the HDFS data write failure. At last, rectify the fault according to the alarm help.

Alarm Check

Indicator name: Alarm information

Indicator description: This indicator is used to check whether an uncleared alarm exists in the service. If an uncleared alarm exists, the indicator is unhealthy.

Recovery guidance: If this indicator is abnormal, rectify the fault according to the alarm help.

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?







Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel