Updated on 2025-05-22 GMT+08:00

OPS07-04 Supporting Fault Recovery Process

  • Risk level

    High

  • Key strategies

    If a fault occurs on the live network, fix it fast to minimize its impacts. A series of control processes must be performed throughout the fault lifecycle, including fault prevention, detection, locating, recovery, review, and continuous improvement (including fault drills). Recovery capabilities must be built based on the failure mode library to ensure long-term convergence of the mean time to repair for quick fault recovery.

  • Design suggestions
    Figure 1 Quick fault recovery process