Monitored DCS Events
Introduction
Event monitoring provides event collection, query, and alarm reporting. These major events or DCS operation events can be collected to Cloud Eye to alarm.
Namespace
SYS.DCS
Monitored events
Event Name |
Event ID |
Event Severity |
Description |
Solution |
Impact |
---|---|---|---|---|---|
Full synchronization during online migration retry |
migrationFullResync |
Minor |
If online migration fails, full synchronization will be triggered because incremental synchronization cannot be performed. |
Check whether full sync retries are triggered repeatedly. Check whether the source instance is connected and whether it is overloaded. If full sync retries are triggered repeatedly, contact O&M personnel. |
The migration task is disconnected from the source instance, triggering another full sync. As a result, the CPU usage of the source instance may increase sharply. |
Automatic failover |
masterStandbyFailover |
Minor |
The master node was abnormal, promoting a replica to master. |
Check whether services can recover by themselves. If applications cannot recover, restart them. |
Persistent connections to the instance will be interrupted. |
Memcached master/standby switchover |
memcachedMasterStandbyFailover |
Minor |
The master node was abnormal, promoting the standby node to master. |
Check whether services can recover by themselves. If applications cannot recover, restart them. |
Persistent connections to the instance will be interrupted. |
Redis server abnormal |
redisNodeStatusAbnormal |
Major |
The Redis server status was abnormal. |
Check whether services are affected. If yes, contact O&M personnel. |
If the master node is abnormal, an automatic failover is performed. If a standby node is abnormal and the client directly connects to the standby node for read/write splitting, no data can be read. |
Redis server recovered |
redisNodeStatusNormal |
Major |
The Redis server status recovered. |
Check whether services can recover. If applications cannot reconnect, restart them. |
Recover from an exception. |
Sync failure in data migration |
migrateSyncDataFail |
Major |
Online migration failed. |
Reconfigure the migration task and migrate data again. If the fault persists, contact O&M personnel. |
Data migration fails. |
Memcached instance abnormal |
memcachedInstanceStatusAbnormal |
Major |
The Memcached node status was abnormal. |
Check whether services are affected. If yes, contact O&M personnel. |
The Memcached instance is abnormal and may not be accessed. |
Memcached instance recovered |
memcachedInstanceStatusNormal |
Major |
The Memcached node status recovered. |
Check whether services can recover. If applications cannot reconnect, restart them. |
Recover from an exception. |
Instance backup failure |
instanceBackupFailure |
Major |
The DCS instance fails to be backed up due to an OBS access failure. |
Retry backup manually. |
Automated backup fails. |
Instance node abnormal restart |
instanceNodeAbnormalRestart |
Major |
DCS nodes restarted unexpectedly when they became faulty. |
Check whether services can recover by themselves. If applications cannot recover, restart them. |
Persistent connections to the instance will be interrupted. |
Long-running Lua scripts stopped |
scriptsStopped |
Informational |
Lua scripts that had timed out automatically stopped running. |
Optimize Lua scrips to prevent execution timeout. |
The execution of the lua scripts takes a long time and is forcibly interrupted. If the execution of the lua scripts takes a long time, the entire instance will be blocked. |
Node restarted |
nodeRestarted |
Informational |
After write operations had been performed, the node automatically restarted to stop Lua scripts that had timed out. |
Check whether services can recover by themselves. If applications cannot recover, restart them. |
Persistent connections to the instance will be interrupted. |
Bandwidth scaling |
bandwidthAutoScalingTriggered |
Informational |
The instance bandwidth was used up, triggering bandwidth scaling. |
Check the services on this instance. |
Instance bandwidth used up. The increased bandwidth will be billed. |
Specifications scaled |
specAutoScalingTriggeredSuccess |
Informational |
The instance specifications were scaled successfully. |
Check the services on this instance. |
Instance scaled up. Check its information. |
Scale specifications failed |
specAutoScalingTriggeredFail |
Critical |
The instance specifications fail to be scaled. |
Auto scaling failed. Contact technical support. |
Instance scaling failed. Log in to the console to check whether services are affected. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot