Updated on 2025-09-11 GMT+08:00

Overview

The resilience center module provides the contingency plans to ensure stable system running. You can customize a contingency plan for each type of fault that may occur in the system. After a fault occurs, you can quickly rectify the fault based on your contingency plan to minimize the impact of the fault on services.

You can create a custom contingency plan on the Resilience Center > Contingency Plans page of COC. You need to fill in the basic information, select a handling method from the Scripts or Jobs page, and associate the corresponding scripts or jobs to form a complete contingency plan.

You can also view, modify, or delete the created contingency plans at any time. This ensures that the contingency plans always adapt to the actual system conditions and service requirements, and provides continuous and effective support for system resilience.

Benefits

In terms of fault rectification efficiency, contingency plans can be made in advance so that O&M personnel do not need to think about countermeasures from scratch when a fault occurs. This greatly shortens the fault rectification time and reduces service loss caused by system interruption. In terms of risk control, the preset contingency plan makes the fault rectification process more standard and orderly. This avoids unnecessary problems caused by misoperations, and ensures service data security and system stability. In addition, you can flexibly adjust the contingency plans based on service changes and system upgrade. This ensures that the countermeasures are always effective and improves the risk resistance capability of the overall O&M system.

Typical Scenarios

  • Server breakdown: If a core server breaks down unexpectedly, O&M personnel can immediately invoke the contingency plan created in advance. The contingency plan is associated with the server restart script and data restoration job. The server can be quickly restored by performing operations according to the contingency plan, reducing the service interruption duration.
  • Database data exception: If data in the database is incorrect or lost, the related contingency plan can instruct O&M personnel to execute the data rollback script and restore the data based on the data backup job to ensure the accuracy and integrity of service data.
  • Network attack protection: For possible network attacks, you can set the script for enabling the firewall to enhance the configuration and traffic cleansing in the contingency plan. When an attack occurs, the script can be quickly executed to defend against the attack and maintain the smooth running of the system