Updated on 2025-07-24 GMT+08:00

Monitored DCS Events

Introduction

Event monitoring provides event collection, query, and alarm reporting. These major events or DCS operation events can be collected to Cloud Eye to alarm.

Namespace

SYS.DCS

Monitored events

Table 1 DCS events

Event Name

Event ID

Event Severity

Description

Solution

Impact

Full synchronization during online migration retry

migrationFullResync

Minor

If online migration fails, full synchronization will be triggered because incremental synchronization cannot be performed.

Check whether full sync retries are triggered repeatedly. Check whether the source instance is connected and whether it is overloaded. If full sync retries are triggered repeatedly, contact O&M personnel.

The migration task is disconnected from the source instance, triggering another full sync. As a result, the CPU usage of the source instance may increase sharply.

Automatic failover

masterStandbyFailover

Minor

The master node was abnormal, promoting a replica to master.

Check whether services can recover by themselves. If applications cannot recover, restart them.

Persistent connections to the instance will be interrupted.

Memcached master/standby switchover

memcachedMasterStandbyFailover

Minor

The master node was abnormal, promoting the standby node to master.

Check whether services can recover by themselves. If applications cannot recover, restart them.

Persistent connections to the instance will be interrupted.

Redis server abnormal

redisNodeStatusAbnormal

Major

The Redis server status was abnormal.

Check whether services are affected. If yes, contact O&M personnel.

If the master node is abnormal, an automatic failover is performed. If a standby node is abnormal and the client directly connects to the standby node for read/write splitting, no data can be read.

Redis server recovered

redisNodeStatusNormal

Major

The Redis server status recovered.

Check whether services can recover. If applications cannot reconnect, restart them.

Recover from an exception.

Sync failure in data migration

migrateSyncDataFail

Major

Online migration failed.

Reconfigure the migration task and migrate data again. If the fault persists, contact O&M personnel.

Data migration fails.

Memcached instance abnormal

memcachedInstanceStatusAbnormal

Major

The Memcached node status was abnormal.

Check whether services are affected. If yes, contact O&M personnel.

The Memcached instance is abnormal and may not be accessed.

Memcached instance recovered

memcachedInstanceStatusNormal

Major

The Memcached node status recovered.

Check whether services can recover. If applications cannot reconnect, restart them.

Recover from an exception.

Instance backup failure

instanceBackupFailure

Major

The DCS instance fails to be backed up due to an OBS access failure.

Retry backup manually.

Automated backup fails.

Instance node abnormal restart

instanceNodeAbnormalRestart

Major

DCS nodes restarted unexpectedly when they became faulty.

Check whether services can recover by themselves. If applications cannot recover, restart them.

Persistent connections to the instance will be interrupted.

Long-running Lua scripts stopped

scriptsStopped

Informational

Lua scripts that had timed out automatically stopped running.

Optimize Lua scrips to prevent execution timeout.

The execution of the lua scripts takes a long time and is forcibly interrupted. If the execution of the lua scripts takes a long time, the entire instance will be blocked.

Node restarted

nodeRestarted

Informational

After write operations had been performed, the node automatically restarted to stop Lua scripts that had timed out.

Check whether services can recover by themselves. If applications cannot recover, restart them.

Persistent connections to the instance will be interrupted.

Bandwidth scaling

bandwidthAutoScalingTriggered

Informational

The instance bandwidth was used up, triggering bandwidth scaling.

Check the services on this instance.

Instance bandwidth used up. The increased bandwidth will be billed.

Specifications scaled

specAutoScalingTriggeredSuccess

Informational

The instance specifications were scaled successfully.

Check the services on this instance.

Instance scaled up. Check its information.

Scale specifications failed

specAutoScalingTriggeredFail

Critical

The instance specifications fail to be scaled.

Auto scaling failed. Contact technical support.

Instance scaling failed. Log in to the console to check whether services are affected.