ALM-24004 Flume Fails to Read Data (For MRS 2.x or Earlier)
Description
The alarm module monitors the Flume source status. This alarm is generated if the duration that Flume source fails to read data exceeds the threshold.
Users can modify the threshold as required.
This alarm is cleared if the source reads data successfully.
Attribute
Alarm ID |
Alarm Severity |
Auto Clear |
---|---|---|
24004 |
Major |
Yes |
Parameters
Parameter |
Description |
---|---|
ServiceName |
Specifies the service for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
ComponentType |
Specifies the component type for which the alarm is generated. |
ComponentName |
Specifies the component name for which the alarm is generated. |
Impact on the System
Data collection is stopped.
Possible Causes
- The Flume source is faulty.
- The network is faulty.
Procedure
- Check whether the Flume source is normal.
- Check whether the Flume source is the spoolDir type.
- Query the spoolDir directory and check whether all files have been sent.
- If yes, no further action is required.
- If no, go to 1.e.
- Check whether the Flume source is the Kafka type.
- Log in to the Kafka client and run the following commands to check whether all topic data configured for the Kafka source has been consumed.
cd /opt/client/Kafka/kafka/bin
./kafka-consumer-groups.sh --bootstrap-server Kafka cluster IP address:21007 --new-consumer --describe --group example-group1 --command-config
../config/consumer.properties- If yes, no further action is required.
- If no, go to 1.e.
- Go to the cluster details page and click Components.
- Choose .
- Click the Flume instance of the faulty node and check whether the value of the Source Speed Metrics is 0.
- If yes, go to 2.a.
- If no, no further action is required.
- Check the status of the network between the Flume source and faulty node.
- Check whether the Flume source is the avro type.
- Log in to the host where the faulty node resides. Run the following command to switch to user root:
sudo su - root
- Run the ping Flume source IP address command to check whether the Flume source can be pinged.
- Contact the network administrator to repair the network.
- Wait for a while and check whether the alarm is cleared.
- If yes, no further action is required.
- If no, go to 3.
- Collect fault information.
- On MRS Manager, choose .
- Contact the O&M engineers and send the collected logs.
Related Information
N/A
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.