ALM-45000 HetuEngine Service Unavailable
Alarm Description
The system checks the HetuEngine service status every 300 seconds. This alarm is generated when the HetuEngine service is unavailable.
This alarm is cleared when the HetuEngine service recovers.
Alarm Attributes
Alarm ID |
Alarm Severity |
Alarm Type |
Service Type |
Auto Cleared |
---|---|---|---|---|
45000 |
Critical |
Error handling |
HetuEngine |
Yes |
Alarm Parameters
Type |
Parameter |
Description |
---|---|---|
Location Information |
Source |
Specifies the cluster for which the alarm is generated. |
ServiceName |
Specifies the service for which the alarm is generated. |
|
RoleName |
Specifies the role for which the alarm is generated. |
|
HostName |
Specifies the host for which the alarm is generated. |
Impact on the System
FusionInsight Manager cannot be used to perform operations on the HetuEngine cluster, and HetuEngine functions are unavailable.
Possible Causes
- The KrbServer service is abnormal.
- The ZooKeeper service is abnormal.
- The HDFS service is abnormal.
- The Yarn service is abnormal.
- The DBService service is abnormal.
- The Hive service is abnormal.
- There is no HetuEngine HSBroker instance that is running properly.
Handling Procedure
Check the KrbServer service status.
- On FusionInsight Manager, choose O&M > Alarm > Alarm.
- In the alarm list, check whether the "ALM-25500 KrbServer Service Unavailable" alarm is generated.
- Clear "ALM-25500 KrbServer Service Unavailable" according to the alarm help.
- In the alarm list, check whether the alarm "ALM-45000 HetuEngine Service Unavailable" is cleared.
- If yes, no further action is required.
- If no, go to 5.
Check the ZooKeeper service status.
- In the alarm list, check whether the alarm "ALM-12007 Process Fault" is generated.
- In the alarm list, click in the row that contains the "Process Fault" alarm. Check whether the name of the service for which the alarm is generated is ZooKeeper in Location Information.
- Clear "ALM-12007 Process Fault" according to the alarm help.
- In the alarm list, check whether the alarm "ALM-45000 HetuEngine Service Unavailable" is cleared.
- If yes, no further action is required.
- If no, go to 9.
Check the HDFS service status.
- In the alarm list, check whether the "ALM-14000 HDFS Service Unavailable" alarm is generated.
- Clear "ALM-14000 HDFS Service Unavailable" according to the alarm help.
- In the alarm list, check whether the "ALM-45000 HetuEngine Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 12.
Check the YARN service status.
- In the alarm list, check whether the "ALM-18000 YARN Service Unavailable" alarm is generated.
- Clear "ALM-18000 YARN Service Unavailable" according to the alarm help.
- In the alarm list, check whether the "ALM-45000 HetuEngine Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 15.
Check the DBService service status.
- In the alarm list, check whether the "ALM-27001 DBService Service Unavailable" alarm is generated.
- Clear "ALM-27001 DBService Service Unavailable" according to the alarm help.
- In the alarm list, check whether the "ALM-45000 HetuEngine Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 18.
Check the Hive service status.
- In the alarm list, check whether the "ALM-16004 Hive Service Unavailable" alarm is generated.
- Clear "ALM-16004 Hive Service Unavailable" according to the alarm help.
- In the alarm list, check whether the "ALM-45000 HetuEngine Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 21.
Check whether there is no HetuEngine HSBroker instance that is running properly.
- On FusionInsight Manager, choose Cluster > Name of the desired cluster > Services > HetuEngine. On the page that is displayed, click the Instance tab.
- Check whether there is no HSBroker instance that is running properly.
- In the alarm list, check whether the "ALM-45000 HetuEngine Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 24.
Check the network connection between HetuEngine and ZooKeeper, HDFS, YARN, DBService, and Hive.
- On FusionInsight Manager, choose Cluster > Name of the desired cluster > Services > HetuEngine. On the page that is displayed, click the Instance tab.
- Click the host name in the HSBroker row and record the management IP address in the Basic Information area.
- Log in to the host where HSBroker resides as user omm using the IP address obtained in 25.
- Run the ping command to check whether the network connection between the host where HSBroker resides and the hosts where ZooKeeper, HDFS, Yarn, DBService, and Hive reside is in the normal state.
- Contact the network administrator to restore the network.
- In the alarm list, check whether the "ALM-45000 HetuEngine Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 30.
Collect fault information.
- On FusionInsight Manager, choose O&M > Log > Download.
- Expand the Service drop-down list. In the Services dialog box that is displayed, select HetuEngine under the target cluster name, and click OK.
- Expand the Hosts drop-down list. In the Select Host dialog box that is displayed, select the hosts to which the role belongs, and click OK.
- Click the edit icon in the upper right corner, and set Start Date and End Date for log collection to 30 minutes ahead of and after the alarm generation time, respectively. Then, click Download.
- Contact O&M engineers and provide the collected logs.
Alarm Clearance
After the fault is rectified, the system automatically clears this alarm.
Related Information
None.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot