Updated on 2025-12-09 GMT+08:00

Service Resilience

The VIAS ensures platform stability and reliability under various potential failure scenarios by implementing a multi-layered, comprehensive resilience architecture. The platform uses a three-tier reliability framework, integrating multiple technical solutions such as cross-availability zone (AZ) disaster recovery (DR), intra-AZ instance redundancy, and instance health monitoring. These measures collectively enhance service resilience and stability, safeguarding business continuity and data security for users.

  • Cross-AZ DR

    The platform supports deployment across multiple AZs, distributing critical services and data to achieve geographic fault tolerance and isolation. In the event of a failure within one AZ, such as network outages or power disruptions, the platform automatically switches to another AZ, ensuring uninterrupted service delivery.

  • Intra-AZ instance redundancy

    Using multi-node deployment and load balancing technologies, each key service component is replicated across multiple instances. If an instance fails, the system seamlessly redirects traffic to healthy instances, maintaining high service availability.

  • Instance health check and automatic recovery

    The platform features an intelligent health check mechanism that continuously tracks instance performance metrics like CPU, memory, disk usage, and network status. Upon detecting anomalies, it triggers automatic recovery processes, attempting to resolve issues or restart the affected instance.