Updated on 2025-05-22 GMT+08:00

Overview

Reliability tests are performed to ensure that a system can meet the expected reliability objectives within the specified lifecycle. Unlike function tests, reliability tests involve service loads and faults, which poses higher requirements on the test environment and capabilities.

By proactively introducing faults, reliability tests and drills detect system risks in advance, and enhance test quality, risk contingency plans, monitoring and alarms, and fault emergency efficiency. In this way, faults can be effectively prevented, handled in a timely manner, and regression verification can be performed after faults are rectified. Distributed system resilience is built based on faults to continuously improve software quality, enhance the confidence of the team in software production and operation, and reduce similar problems during service running.

To ensure test effectiveness, the test environment must be the same as the production environment.

Huawei Cloud provides fault injection service (MAS-CAST), performance test service (CodeArts PerfTest), and multi-site high availability service (MAS), which can be used for fault injection tests, pressure load tests, long-term stability tests, and disaster drills.