Help Center/ Data Lake Insight/ FAQs/ Flink Jobs/ O&M Guide/ Why Is the Flink Job Abnormal Due to Heartbeat Timeout Between JobManager and TaskManager?

Updated on 2023-05-19 GMT+08:00

View PDF

Why Is the Flink Job Abnormal Due to Heartbeat Timeout Between JobManager and TaskManager?

Symptom

JobManager and TaskManager heartbeats timed out. As a result, the Flink job is abnormal.

Figure 1 Error information
Click to enlarge

Possible Causes

Check whether the network is intermittently disconnected and whether the cluster load is high.
If Full GC occurs frequently, check the code to determine whether memory leakage occurs.
Figure 2 Full GC

Handling Procedure

If Full GC occurs frequently, check the code to determine whether memory leakage occurs.
Allocate more resources for a single TaskManager.

Contact technical support to modify the cluster heartbeat configuration.

Parent topic: O&M Guide

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel