ALM-23001 Loader Service Unavailable (For MRS 2.x or Earlier)
Description
The system checks the Loader service availability every 60 seconds. This alarm is generated if the Loader service is unavailable and is cleared after the Loader service recovers.
Attribute
Alarm ID |
Alarm Severity |
Auto Clear |
---|---|---|
23001 |
Critical |
Yes |
Parameters
Parameter |
Description |
---|---|
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
Impact on the System
Data loading, import, and conversion are unavailable.
Possible Causes
- The services that Loader depends on are abnormal.
- ZooKeeper is abnormal.
- HDFS is abnormal.
- DBService is abnormal.
- Yarn is abnormal.
- MapReduce is abnormal.
- The network is faulty. Loader cannot communicate with its dependent services.
- Loader is running improperly.
Procedure
- Check the ZooKeeper status.
- Go to the MRS cluster details page and click Components.
- Choose ZooKeeper and check whether the health status of ZooKeeper is normal.
- Choose More > Restart Service to restart ZooKeeper. After ZooKeeper starts, check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 1.d.
- On MRS Manager, check whether the ALM-12007 Process Fault alarm is reported.
- In Alarm Details of the "ALM-12007 Process Fault" alarm, check whether ServiceName is ZooKeeper.
- Clear the alarm according to the handling suggestions of "ALM-12007 Process Fault".
- Check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 2.a.
- Check the HDFS status.
- Go to the MRS cluster details page and choose Alarms.
- On MRS Manager, check whether the "ALM-14000 HDFS Service Unavailable alarm" is reported.
- Clear the alarm according to the handling suggestions of "ALM-14000 HDFS Service Unavailable".
- Check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 3.a.
- Check the DBService status.
- Go to the MRS cluster details page and click Components.
- Choose DBService to check whether the health status of DBService is normal.
- Choose More > Restart Service to restart DBService. After DBService starts, check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 4.a.
- Check the MapReduce status.
- Go to the MRS cluster details page and click Components.
- Choose MapReduce and check whether the health status of MapReduce is normal.
- Choose More > Restart Service to restart MapReduce. After MapReduce starts, check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 5.a.
- Check the Yarn status.
- Go to the MRS cluster details page and click Components.
- Choose Yarn and check whether the health status of Yarn is normal.
- Choose More > Restart Service to restart Yarn. After Yarn starts, check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 5.d.
- On MRS Manager, check whether the "ALM-18000 Yarn Service Unavailable" alarm is reported.
- Clear the alarm according to the handling suggestions of "ALM-18000 Yarn Service Unavailable".
- Check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 6.a.
- Check the network connections between Loader and its dependent components.
- Go to the MRS cluster details page and click Components.
- Click Loader.
- Click Instance. The Sqoop instance list is displayed.
- Record the management IP addresses of all Sqoop instances.
- Log in to the hosts using the IP addresses obtained in 6.d. Run the following commands to switch the user:
sudo su - root
su - omm
- Run the ping command to check whether the network connection between the hosts where the Sqoop instances reside and the dependent components is normal. (The dependent components include ZooKeeper, DBService, HDFS, MapReduce, and Yarn. The method to obtain the IP addresses of the dependent components is the same as that used to obtain the IP addresses of the Sqoop instances.)
- Contact the network administrator to repair the network.
- Check whether the "ALM-23001 Loader Service Unavailable" alarm is cleared.
- If yes, no further action is required.
- If no, go to 7.
- Collect fault information.
- On MRS Manager, choose .
- Contact the O&M engineers and send the collected logs.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot