Updated on 2023-10-27 GMT+08:00

Service Resilience

HSS uses a four-level reliability architecture. It provides inspection, resistance, and recovery capabilities to help you manually or automatically recover services, enhancing data durability and reliability.

Table 1 Reliability architecture

Category

Capability

Description

Type

Inspection

Situation Awareness (SA)

HSS interconnects with SA and evaluates asset risks based on alarms, vulnerabilities, and baseline check results.

System

Cloud Eye

With Cloud Eye, you can understand the resource usage and status of HSS, receive alarm notifications in a timely manner, and react to changes to keep your services run smoothly.

System

Resistance

Attack prevention

The agent provides self-protection, anti-removal, and anti-tamper capabilities.

Security

Data backup

All key data can be backed up. Even if the database is completely damaged, services can be restored using the backup data.

System

Service self-protection

HSS consists of microservices, which are independently deployed, started, and stopped.

The agent strictly controls its resource usage. If its resource usage exceeds the threshold, the agent is isolated or a bypassing operation is performed to avoid affecting user workloads. If system resources are insufficient, the agent performance will be degraded.

System

Restoration

System restoration

A VM or service can be manually or automatically rebuilt if it is faulty.

System

Process protection

If a process exits, the process will be automatically started to facilitate service recovery.

System