Help Center/ Data Lake Insight/ Service Overview/ Security/ Recovery from Failures

Updated on 2025-07-11 GMT+08:00

View PDF

Recovery from Failures

System-Level
DLI uses a decoupled storage and compute architecture. In the event of a system fault, a compute cluster can be automatically recovered due to Kubernetes' resource scheduling and failover mechanism.

Job-Level
You can enable automatic restart and recovery for Flink and Spark jobs. After this function is enabled, jobs will be automatically restarted and recovered if exceptions occur.

Parent Topic: Security

Previous topic: Security Risk Monitoring

Next topic: Update Management

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel

For any further questions, feel free to contact us through the chatbot.