Help Center > > User Guide> MRS Manager Operation Guide> Health Check Management> ZooKeeper Health Check

ZooKeeper Health Check

Updated at: Sep 12, 2019 GMT+08:00

Average Latency of Request Processing by ZooKeeper

Indicator name: Average Latency

Indicator description: This indicator is used to check the average latency for the ZooKeeper service to process a request. If the average latency is greater than 300 ms, the indicator is unhealthy.

Recovery guidance: If this indicator is abnormal, check whether the cluster network speed is normal and whether the memory or CPU usage is too high.

Usage of ZooKeeper Connections

Indicator name: ZooKeeper Connections Usage

Indicator description: This indicator is used to check whether the memory usage of ZooKeeper exceeds 80%. If the memory usage exceeds the threshold, the indicator is unhealthy.

Recovery guidance: If this indicator is abnormal, you are advised to increase memory for the ZooKeeper service. You can increase memory by increasing the value of -Xmx in the GC_OPTS parameter of the ZooKeeper service. After the modification, restart the ZooKeeper service.

Service Health Status

Indicator name: Service Status

Indicator description: This indicator is used to check whether the service status of ZooKeeper is in normal state. If the service status is abnormal, the indicator is unhealthy.

Recovery guidance: If this indicator is abnormal, you are advised to check whether the status of the KrbServer and LdapServer services is Bad. If the service status is Bad, rectify the fault. Then log in to the ZooKeeper client to check whether data cannot be written into ZooKeeper and find the causes of the ZooKeeper data write failure according to the error message. At last, rectify the fault according to ALM-13000.

Alarm Check

Indicator name: Alarm information

Indicator description: This indicator is used to check whether an uncleared alarm exists in the service. If an uncleared alarm exists, the indicator is unhealthy.

Recovery guidance: If this indicator is abnormal, you are advised to rectify the fault according to the alarm help.

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?







Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel