Updated on 2024-11-13 GMT+08:00
MRS Cluster Alarm Handling Reference
- ALM-12001 Audit Log Dumping Failure
- ALM-12004 OLdap Resource Abnormal
- ALM-12005 OKerberos Resource Abnormal
- ALM-12006 Node Fault
- ALM-12007 Process Fault
- ALM-12010 Manager Heartbeat Interruption Between the Active and Standby Nodes
- ALM-12011 Manager Data Synchronization Exception Between the Active and Standby Nodes
- ALM-12012 NTP Service Is Abnormal
- ALM-12014 Partition Lost
- ALM-12015 Partition Filesystem Readonly
- ALM-12016 CPU Usage Exceeds the Threshold
- ALM-12017 Insufficient Disk Capacity
- ALM-12018 Memory Usage Exceeds the Threshold
- ALM-12027 Host PID Usage Exceeds the Threshold
- ALM-12028 Number of Processes in the D State and Z State on a Host Exceeds the Threshold
- ALM-12033 Slow Disk Fault
- ALM-12034 Periodical Backup Failure
- ALM-12035 Unknown Data Status After Recovery Task Failure
- ALM-12037 NTP Server Abnormal
- ALM-12038 Monitoring Indicator Dumping Failure
- ALM-12039 Active/Standby OMS Databases Not Synchronized
- ALM-12040 Insufficient System Entropy
- ALM-12041 Incorrect Permission on Key Files
- ALM-12042 Incorrect Configuration of Key Files
- ALM-12045 Read Packet Dropped Rate Exceeds the Threshold
- ALM-12046 Write Packet Dropped Rate Exceeds the Threshold
- ALM-12047 Read Packet Error Rate Exceeds the Threshold
- ALM-12048 Write Packet Error Rate Exceeds the Threshold
- ALM-12049 Network Read Throughput Rate Exceeds the Threshold
- ALM-12050 Network Write Throughput Rate Exceeds the Threshold
- ALM-12051 Disk Inode Usage Exceeds the Threshold
- ALM-12052 TCP Temporary Port Usage Exceeds the Threshold
- ALM-12053 Host File Handle Usage Exceeds the Threshold
- ALM-12054 Invalid Certificate File
- ALM-12055 Certificate File Is About to Expire
- ALM-12057 Metadata Not Configured with the Task to Periodically Back Up Data to a Third-Party Server
- ALM-12061 Process Usage Exceeds the Threshold
- ALM-12062 OMS Parameter Configurations Mismatch with the Cluster Scale
- ALM-12063 Unavailable Disk
- ALM-12064 Host Random Port Range Conflicts with Cluster Used Port
- ALM-12066 Trust Relationships Between Nodes Become Invalid
- ALM-12067 Tomcat Resource Is Abnormal
- ALM-12068 ACS Resource Exception
- ALM-12069 AOS Resource Exception
- ALM-12070 Controller Resource Is Abnormal
- ALM-12071 Httpd Resource Is Abnormal
- ALM-12072 FloatIP Resource Is Abnormal
- ALM-12073 CEP Resource Is Abnormal
- ALM-12074 FMS Resource Is Abnormal
- ALM-12075 PMS Resource Is Abnormal
- ALM-12076 GaussDB Resource Is Abnormal
- ALM-12077 User omm Expired
- ALM-12078 Password of User omm Expired
- ALM-12079 User omm Is About to Expire
- ALM-12080 Password of User omm Is About to Expire
- ALM-12081 User ommdba Expired
- ALM-12082 User ommdba Is About to Expire
- ALM-12083 Password of User ommdba Is About to Expire
- ALM-12084 Password of User ommdba Expired
- ALM-12085 Service Audit Log Dump Failure
- ALM-12087 System Is in the Upgrade Observation Period
- ALM-12089 Inter-Node Network Is Abnormal
- ALM-12091 Abnormal disaster Resources
- ALM-12099 core dump Occurred
- ALM-12100 AD Service Connection Failed
- ALM-12101 AZ Unhealthy
- ALM-12102 AZ HA Component Is Not Deployed Based on DR Requirements
- ALM-12103 Executor Resource Exception
- ALM-12104 Abnormal Knox Resources
- ALM-12110 Failed to get ECS temporary AK/SK
- ALM-12172 Failed to Report Metrics to Cloud Eye
- ALM-12180 Suspended Disk I/O
- ALM-12186 CGroup Task Usage Exceeds the Threshold
- ALM-12187 Failed to Expand Disk Partition Capacity
- ALM-12188 diskmgt Disk Monitoring Unavailable
- ALM-12190 Number of Knox Connections Exceeds the Threshold
- ALM-12191 Disk I/O Usage Exceeds the Threshold
- ALM-12192 Host Load Exceeds the Threshold
- ALM-12200 Password Is About to Expire
- ALM-12201 Process CPU Usage Exceeds the Threshold
- ALM-12202 Process Memory Usage Exceeds the Threshold
- ALM-12203 Process Full GC Duration Exceeds the Threshold
- ALM-12204 Wait Duration of a Disk Read Exceeds the Threshold
- ALM-12205 Wait Duration of a Disk Write Exceeds the Threshold
- ALM-12206 Password Has Expired
- ALM-12207 Slow Disk Processing Timeout
- ALM-13000 ZooKeeper Service Unavailable
- ALM-13001 Available ZooKeeper Connections Are Insufficient
- ALM-13002 ZooKeeper Direct Memory Usage Exceeds the Threshold
- ALM-13003 GC Duration of the ZooKeeper Process Exceeds the Threshold
- ALM-13004 ZooKeeper Heap Memory Usage Exceeds the Threshold
- ALM-13005 Failed to Set the Quota of Top Directories of ZooKeeper Components
- ALM-13006 Znode Number or Capacity Exceeds the Threshold
- ALM-13007 Available ZooKeeper Client Connections Are Insufficient
- ALM-13008 ZooKeeper Znode Usage Exceeds the Threshold
- ALM-13009 ZooKeeper Znode Capacity Usage Exceeds the Threshold
- ALM-13010 Znode Usage of a Directory with Quota Configured Exceeds the Threshold
- ALM-14000 HDFS Service Unavailable
- ALM-14001 HDFS Disk Usage Exceeds the Threshold
- ALM-14002 DataNode Disk Usage Exceeds the Threshold
- ALM-14003 Number of Lost HDFS Blocks Exceeds the Threshold
- ALM-14006 Number of HDFS Files Exceeds the Threshold
- ALM-14007 NameNode Heap Memory Usage Exceeds the Threshold
- ALM-14008 DataNode Heap Memory Usage Exceeds the Threshold
- ALM-14009 Number of Dead DataNodes Exceeds the Threshold
- ALM-14010 NameService Service Is Abnormal
- ALM-14011 DataNode Data Directory Is Not Configured Properly
- ALM-14012 JournalNode Is Out of Synchronization
- ALM-14013 Failed to Update the NameNode FsImage File
- ALM-14014 NameNode GC Time Exceeds the Threshold
- ALM-14015 DataNode GC Time Exceeds the Threshold
- ALM-14016 DataNode Direct Memory Usage Exceeds the Threshold
- ALM-14017 NameNode Direct Memory Usage Exceeds the Threshold
- ALM-14018 NameNode Non-heap Memory Usage Exceeds the Threshold
- ALM-14019 DataNode Non-heap Memory Usage Exceeds the Threshold
- ALM-14020 Number of Entries in the HDFS Directory Exceeds the Threshold
- ALM-14021 NameNode Average RPC Processing Time Exceeds the Threshold
- ALM-14022 NameNode Average RPC Queuing Time Exceeds the Threshold
- ALM-14023 Percentage of Total Reserved Disk Space for Replicas Exceeds the Threshold
- ALM-14024 Tenant Space Usage Exceeds the Threshold
- ALM-14025 Tenant File Object Usage Exceeds the Threshold
- ALM-14026 Blocks on DataNode Exceed the Threshold
- ALM-14027 DataNode Disk Fault
- ALM-14028 Number of Blocks to Be Supplemented Exceeds the Threshold
- ALM-14029 Number of Blocks in a Replica Exceeds the Threshold
- ALM-14030 HDFS Allows Write of Single-Replica Data
- ALM-14031 DataNode Process Is Abnormal
- ALM-14032 JournalNode Process Is Abnormal
- ALM-14033 ZKFC Process Is Abnormal
- ALM-14034 Router Process Is Abnormal
- ALM-14035 HttpFS Process Is Abnormal
- ALM-14036 NameNode Is In Safe Mode
- ALM-14037 DataNodes Outside the Cluster
- ALM-14038 Router Heap Memory Usage Exceeds the Threshold
- ALM-14039 Slow DataNodes Exist in the Cluster
- ALM-16000 Percentage of Sessions Connected to the HiveServer to Maximum Number Allowed Exceeds the Threshold
- ALM-16001 Hive Warehouse Space Usage Exceeds the Threshold
- ALM-16002 Hive SQL Execution Success Rate Is Lower Than the Threshold
- ALM-16003 Background Thread Usage Exceeds the Threshold
- ALM-16004 Hive Service Unavailable
- ALM-16005 The Heap Memory Usage of the Hive Process Exceeds the Threshold
- ALM-16006 The Direct Memory Usage of the Hive Process Exceeds the Threshold
- ALM-16007 Hive GC Time Exceeds the Threshold
- ALM-16008 Non-Heap Memory Usage of the Hive Process Exceeds the Threshold
- ALM-16009 Map Number Exceeds the Threshold
- ALM-16045 Hive Data Warehouse Is Deleted
- ALM-16046 Hive Data Warehouse Permission Is Modified
- ALM-16047 HiveServer Has Been Deregistered from ZooKeeper
- ALM-16048 Tez or Spark Library Path Does Not Exist
- ALM-16051 Percentage of Sessions Connected to MetaStore Exceeds the Threshold
- ALM-16052 Latency for MetaStore to Access the Meta Database During Table Creation Exceeds the Threshold
- ALM-16053 Average HQL Submission Time of Hive in the Last 5 Minutes Exceeds the Threshold
- ALM-17003 Oozie Service Unavailable
- ALM-17004 Oozie Heap Memory Usage Exceeds the Threshold
- ALM-17005 Oozie Non Heap Memory Usage Exceeds the Threshold
- ALM-17006 Oozie Direct Memory Usage Exceeds the Threshold
- ALM-17007 Garbage Collection (GC) Time of the Oozie Process Exceeds the Threshold
- ALM-17008 Abnormal Connection Between Oozie and ZooKeeper
- ALM-17009 Abnormal Connection Between Oozie and DBService
- ALM-17010 Abnormal Connection Between Oozie and HDFS
- ALM-17011 Abnormal Connection Between Oozie and Yarn
- ALM-18000 Yarn Service Unavailable
- ALM-18002 NodeManager Heartbeat Lost
- ALM-18003 NodeManager Unhealthy
- ALM-18008 Heap Memory Usage of ResourceManager Exceeds the Threshold
- ALM-18009 Heap Memory Usage of JobHistoryServer Exceeds the Threshold
- ALM-18010 ResourceManager GC Time Exceeds the Threshold
- ALM-18011 NodeManager GC Time Exceeds the Threshold
- ALM-18012 JobHistoryServer GC Time Exceeds the Threshold
- ALM-18013 ResourceManager Direct Memory Usage Exceeds the Threshold
- ALM-18014 NodeManager Direct Memory Usage Exceeds the Threshold
- ALM-18015 JobHistoryServer Direct Memory Usage Exceeds the Threshold
- ALM-18016 Non Heap Memory Usage of ResourceManager Exceeds the Threshold
- ALM-18017 Non Heap Memory Usage of NodeManager Exceeds the Threshold
- ALM-18018 NodeManager Heap Memory Usage Exceeds the Threshold
- ALM-18019 Non Heap Memory Usage of JobHistoryServer Exceeds the Threshold
- ALM-18020 Yarn Task Execution Timeout
- ALM-18021 Mapreduce Service Unavailable
- ALM-18022 Insufficient Yarn Queue Resources
- ALM-18023 Number of Pending Yarn Tasks Exceeds the Threshold
- ALM-18024 Pending Yarn Memory Usage Exceeds the Threshold
- ALM-18025 Number of Terminated Yarn Tasks Exceeds the Threshold
- ALM-18026 Number of Failed Yarn Tasks Exceeds the Threshold
- ALM-18027 JobHistoryServer Process Is Abnormal
- ALM-18028 TimeLineServer Process Is Abnormal
- ALM-19000 HBase Service Unavailable
- ALM-19006 HBase Replication Sync Failed
- ALM-19007 HBase GC Time Exceeds the Threshold
- ALM-19008 Heap Memory Usage of the HBase Process Exceeds the Threshold
- ALM-19009 Direct Memory Usage of the HBase Process Exceeds the Threshold
- ALM-19011 RegionServer Region Number Exceeds the Threshold
- ALM-19012 HBase System Table Directory or File Lost
- ALM-19013 Duration of Regions in transaction State Exceeds the Threshold
- ALM-19014 Capacity Quota Usage on ZooKeeper Exceeds the Threshold Severely
- ALM-19015 Quantity Quota Usage on ZooKeeper Exceeds the Threshold
- ALM-19016 Quantity Quota Usage on ZooKeeper Exceeds the Threshold Severely
- ALM-19017 Capacity Quota Usage on ZooKeeper Exceeds the Threshold
- ALM-19018 HBase Compaction Queue Size Exceeds the Threshold
- ALM-19019 Number of HBase HFiles to Be Synchronized Exceeds the Threshold
- ALM-19020 Number of HBase WAL Files to Be Synchronized Exceeds the Threshold
- ALM-19021 Handler Usage of RegionServer Exceeds the Threshold
- ALM-19022 HBase Hotspot Detection Is Unavailable
- ALM-19023 Region Traffic Restriction for HBase
- ALM-19024 RPC Requests P99 Latency on RegionServer Exceeds the Threshold
- ALM-19025 Damaged StoreFile in HBase
- ALM-19026 Damaged WAL Files in HBase
- ALM-19030 P99 Latency of RegionServer RPC Request Exceeds the Threshold
- ALM-19031 Number of RegionServer RPC Connections Exceeds the Threshold
- ALM-19032 Number of Tasks in the RegionServer RPC Write Queue Exceeds the Threshold
- ALM-19033 Number of Tasks in the RegionServer RPC Read Queue Exceeds the Threshold
- ALM-19034 Number of RegionServer WAL Write Timeouts Exceeds the Threshold
- ALM-19035 Size of the RegionServer Call Queue Exceeds the Threshold
- ALM-19036 Bad Blocks Exist in HBase Key Directory Data
- ALM-20002 Hue Service Unavailable
- ALM-23001 Loader Service Unavailable
- ALM-23003 Loader Task Execution Failure
- ALM-23004 Loader Heap Memory Usage Exceeds the Threshold
- ALM-23005 Loader Non-Heap Memory Usage Exceeds the Threshold
- ALM-23006 Loader Direct Memory Usage Exceeds the Threshold
- ALM-23007 Garbage Collection (GC) Time of the Loader Process Exceeds the Threshold
- ALM-24000 Flume Service Unavailable
- ALM-24001 Flume Agent Exception
- ALM-24003 Flume Client Connection Interrupted
- ALM-24004 Exception Occurs When Flume Reads Data
- ALM-24005 Exception Occurs When Flume Transmits Data
- ALM-24006 Heap Memory Usage of Flume Server Exceeds the Threshold
- ALM-24007 Flume Server Direct Memory Usage Exceeds the Threshold
- ALM-24008 Flume Server Non Heap Memory Usage Exceeds the Threshold
- ALM-24009 Flume Server Garbage Collection (GC) Time Exceeds the Threshold
- ALM-24010 Flume Certificate File Is Invalid or Damaged
- ALM-24011 Flume Certificate File Is About to Expire
- ALM-24012 Flume Certificate File Has Expired
- ALM-24013 Flume MonitorServer Certificate File Is Invalid or Damaged
- ALM-24014 Flume MonitorServer Certificate Is About to Expire
- ALM-24015 Flume MonitorServer Certificate File Has Expired
- ALM-25000 LdapServer Service Unavailable
- ALM-25004 Abnormal LdapServer Data Synchronization
- ALM-25005 nscd Service Exception
- ALM-25006 Sssd Service Exception
- ALM-25007 Number of SlapdServer Connections Exceeds the Threshold
- ALM-25008 SlapdServer CPU Usage Exceeds the Threshold
- ALM-25500 KrbServer Service Unavailable
- ALM-25501 Too Many KerberosServer Requests
- ALM-26051 Storm Service Unavailable
- ALM-26052 Number of Available Supervisors of the Storm Service Is Less Than the Threshold
- ALM-26053 Storm Slot Usage Exceeds the Threshold
- ALM-26054 Nimbus Heap Memory Usage Exceeds the Threshold
- ALM-27001 DBService Service Unavailable
- ALM-27003 DBService Heartbeat Interruption Between the Active and Standby Nodes
- ALM-27004 Data Inconsistency Between Active and Standby DBServices
- ALM-27005 Database Connections Usage Exceeds the Threshold
- ALM-27006 Disk Space Usage of the Data Directory Exceeds the Threshold
- ALM-27007 Database Enters the Read-Only Mode
- ALM-29000 Impala Service Unavailable
- ALM-29004 Impalad Process Memory Usage Exceeds the Threshold
- ALM-29005 Number of JDBC Connections to Impalad Exceeds the Threshold
- ALM-29006 Number of ODBC Connections to Impalad Exceeds the Threshold
- ALM-29010 Number of Queries Being Submitted by Impalad Exceeds the Threshold
- ALM-29011 Number of Queries Being Executed by Impalad Exceeds the Threshold
- ALM-29012 Number of Queries Being Waited by Impalad Exceeds the Threshold
- ALM-29013 Impalad FGC Time Exceeds the Threshold
- ALM-29014 Catalog FGC Time Exceeds the Threshold
- ALM-29015 Catalog Process Memory Usage Exceeds the Threshold
- ALM-29016 Impalad Instance in the Sub-healthy State
- ALM-29100 Kudu Service Unavailable
- ALM-29104 Tserver Process Memory Usage Exceeds the Threshold
- ALM-29106 Tserver Process CPU Usage Exceeds the Threshold
- ALM-29107 Tserver Process Memory Usage Exceeds the Threshold
- ALM-38000 Kafka Service Unavailable
- ALM-38001 Insufficient Kafka Disk Capacity
- ALM-38002 Kafka Heap Memory Usage Exceeds the Threshold
- ALM-38004 Kafka Direct Memory Usage Exceeds the Threshold
- ALM-38005 GC Duration of the Broker Process Exceeds the Threshold
- ALM-38006 Percentage of Kafka Partitions That Are Not Completely Synchronized Exceeds the Threshold
- ALM-38007 Status of Kafka Default User Is Abnormal
- ALM-38008 Abnormal Kafka Data Directory Status
- ALM-38009 Busy Broker Disk I/Os (Applicable to Versions Later Than MRS 3.1.0)
- ALM-38009 Kafka Topic Overload (Applicable to MRS 3.1.0 and Earlier Versions)
- ALM-38010 Topics with Single Replica
- ALM-38011 User Connection Usage on Broker Exceeds the Threshold
- ALM-38012 Number of Broker Partitions Exceeds the Threshold
- ALM-38013 Produce Request Latency in the Request Queue Exceeds the Threshold
- ALM-38014 Total Produce Request Latency Exceeds the Threshold
- ALM-38015 Fetch Request Latency in the Request Queue Exceeds the Threshold
- ALM-38016 Total Fetch Request Latency Exceeds the Threshold
- ALM-38017 Partition Reassignment Duration Exceeds the Threshold
- ALM-38018 Kafka Consumer Lag
- ALM-43001 Spark2x Service Unavailable
- ALM-43006 Heap Memory Usage of the JobHistory2x Process Exceeds the Threshold
- ALM-43007 Non-Heap Memory Usage of the JobHistory2x Process Exceeds the Threshold
- ALM-43008 The Direct Memory Usage of the JobHistory2x Process Exceeds the Threshold
- ALM-43009 JobHistory2x Process GC Time Exceeds the Threshold
- ALM-43010 Heap Memory Usage of the JDBCServer2x Process Exceeds the Threshold
- ALM-43011 Non-Heap Memory Usage of the JDBCServer2x Process Exceeds the Threshold
- ALM-43012 Direct Heap Memory Usage of the JDBCServer2x Process Exceeds the Threshold
- ALM-43013 JDBCServer2x Process GC Time Exceeds the Threshold
- ALM-43017 JDBCServer2x Process Full GC Number Exceeds the Threshold
- ALM-43018 JobHistory2x Process Full GC Number Exceeds the Threshold
- ALM-43019 Heap Memory Usage of the IndexServer2x Process Exceeds the Threshold
- ALM-43020 Non-Heap Memory Usage of the IndexServer2x Process Exceeds the Threshold
- ALM-43021 Direct Memory Usage of the IndexServer2x Process Exceeds the Threshold
- ALM-43022 IndexServer2x Process GC Time Exceeds the Threshold
- ALM-43023 IndexServer2x Process Full GC Number Exceeds the Threshold
- ALM-43028 JDBCServer Session Overflow
- ALM-43029 JDBCServer Job Submission Timed Out
- ALM-44000 Presto Service Unavailable
- ALM-44004 Presto Coordinator Resource Group Queuing Tasks Exceed the Threshold
- ALM-44005 Presto Coordinator Process GC Time Exceeds the Threshold
- ALM-44006 Presto Worker Process GC Time Exceeds the Threshold
- ALM-45000 HetuEngine Service Unavailable
- ALM-45001 Faulty HetuEngine Compute Instances
- ALM-45003 HetuEngine QAS Disk Capacity Is Insufficient
- ALM-45004 Tasks Stacked on HetuEngine Compute Instance
- ALM-45005 CPU Usage of HetuEngine Compute Instance Exceeded the Threshold
- ALM-45006 Memory Usage of a HetuEngine Compute Instance Exceeded the Threshold
- ALM-45007 Number of Workers of a HetuEngine Compute Instance Is Less Than the Threshold
- ALM-45008 Query Latency of HetuEngine Compute Instances Exceeds the Threshold
- ALM-45009 Task Failure Rate of HetuEngine Compute Instances Exceeds the Threshold
- ALM-45175 Average Time for Calling OBS Metadata APIs Is Greater than the Threshold
- ALM-45176 Success Rate of Calling OBS Metadata APIs Is Lower than the Threshold
- ALM-45177 Success Rate of Calling OBS Data Read APIs Is Lower than the Threshold
- ALM-45178 Success Rate of Calling OBS Data Write APIs Is Lower Than the Threshold
- ALM-45179 Number of Failed OBS readFully API Calls Exceeds the Threshold
- ALM-45180 Number of Failed OBS read API Calls Exceeds the Threshold
- ALM-45181 Number of Failed OBS write API Calls Exceeds the Threshold
- ALM-45182 Number of Throttled OBS Operations Exceeds the Threshold
- ALM-45275 Ranger Service Unavailable
- ALM-45276 Abnormal RangerAdmin Status
- ALM-45277 RangerAdmin Heap Memory Usage Exceeds the Threshold
- ALM-45278 RangerAdmin Direct Memory Usage Exceeds the Threshold
- ALM-45279 RangerAdmin Non Heap Memory Usage Exceeds the Threshold
- ALM-45280 RangerAdmin GC Duration Exceeds the Threshold
- ALM-45281 UserSync Heap Memory Usage Exceeds the Threshold
- ALM-45282 UserSync Direct Memory Usage Exceeds the Threshold
- ALM-45283 UserSync Non Heap Memory Usage Exceeds the Threshold
- ALM-45284 UserSync Garbage Collection (GC) Time Exceeds the Threshold
- ALM-45285 TagSync Heap Memory Usage Exceeds the Threshold
- ALM-45286 TagSync Direct Memory Usage Exceeds the Threshold
- ALM-45287 TagSync Non Heap Memory Usage Exceeds the Threshold
- ALM-45288 TagSync Garbage Collection (GC) Time Exceeds the Threshold
- ALM-45289 PolicySync Heap Memory Usage Exceeds the Threshold
- ALM-45290 PolicySync Direct Memory Usage Exceeds the Threshold
- ALM-45291 PolicySync Non-Heap Memory Usage Exceeds the Threshold
- ALM-45292 PolicySync GC Duration Exceeds the Threshold
- ALM-45293 Ranger User Synchronization Exception
- ALM-45294 RangerKMS Process Is Abnormal
- ALM-45325 Presto Service Unavailable
- ALM-45326 Number of Presto Coordinator Threads Exceeds the Threshold
- ALM-45327 Presto Coordinator Process GC Time Exceeds the Threshold
- ALM-45328 Presto Worker Process GC Time Exceeds the Threshold
- ALM-45329 Presto Coordinator Resource Group Queuing Tasks Exceed the Threshold
- ALM-45330 Number of Presto Worker Threads Exceeds the Threshold
- ALM-45331 Number of Presto Worker1 Threads Exceeds the Threshold
- ALM-45332 Number of Presto Worker2 Threads Exceeds the Threshold
- ALM-45333 Number of Presto Worker3 Threads Exceeds the Threshold
- ALM-45334 Number of Presto Worker4 Threads Exceeds the Threshold
- ALM-45335 Presto Worker1 Process GC Time Exceeds the Threshold
- ALM-45336 Presto Worker2 Process GC Time Exceeds the Threshold
- ALM-45337 Presto Worker3 Process GC Time Exceeds the Threshold
- ALM-45338 Presto Worker4 Process GC Time Exceeds the Threshold
- ALM-45425 ClickHouse Service Unavailable
- ALM-45426 ClickHouse Service Quantity Quota Usage in ZooKeeper Exceeds the Threshold
- ALM-45427 ClickHouse Service Capacity Quota Usage in ZooKeeper Exceeds the Threshold
- ALM-45428 ClickHouse Disk I/O Exception
- ALM-45429 Table Metadata Synchronization Failed on the Added ClickHouse Node
- ALM-45430 Permission Metadata Synchronization Failed on the Added ClickHouse Node
- ALM-45431 Improper ClickHouse Instance Distribution for Topology Allocation
- ALM-45432 ClickHouse User Synchronization Process Fails
- ALM-45433 ClickHouse AZ Topology Exception
- ALM-45434 A Single Replica Exists in the ClickHouse Data Table
- ALM-45435 Inconsistent Metadata of ClickHouse Tables
- ALM-45436 Skew ClickHouse Table Data
- ALM-45437 Excessive Parts in the ClickHouse Table
- ALM-45438 ClickHouse Disk Usage Exceeds 80%
- ALM-45439 ClickHouse Node Enters the Read-Only Mode
- ALM-45440 Inconsistency Between ClickHouse Replicas
- ALM-45441 Zookeeper Disconnected
- ALM-45442 Too Many Concurrent SQL Statements
- ALM-45443 Slow SQL Queries in the Cluster
- ALM-45444 Abnormal ClickHouse Process
- ALM-45445 Failed to Send Data Files to Remote Shards When ClickHouse Writes Data to a Distributed Table
- ALM-45446 Mutation Task of ClickHouse Is Not Complete for a Long Time
- ALM-45447 ClickHouse Table Read-Only
- ALM-45448 Rapid Increase of Znodes Used by ClickHouse
- ALM-45449 The Counter Number of zxid Used by ClickHouse Exceeds the Threshold
- ALM-45450 ClickHouse Failed to Obtain a Temporary Agency Credential
- ALM-45451 ClickHouse Failed to Access OBS
- ALM-45452 ClickHouse's Local Disk Space Is Below the Cold-Hot Separation Threshold
- ALM-45585 IoTDB Service Unavailable
- ALM-45586 IoTDBServer Heap Memory Usage Exceeds the Threshold
- ALM-45587 IoTDBServer GC Duration Exceeds the Threshold
- ALM-45588 IoTDBServer Direct Memory Usage Exceeds the Threshold
- ALM-45589 ConfigNode Heap Memory Usage Exceeds the Threshold
- ALM-45590 ConfigNode GC Duration Exceeds the Threshold
- ALM-45591 ConfigNode Direct Memory Usage Exceeds the Threshold
- ALM-45592 IoTDBServer RPC Execution Duration Exceeds the Threshold
- ALM-45593 IoTDBServer Flush Execution Duration Exceeds the Threshold
- ALM-45594 IoTDBServer Intra-Space Merge Duration Exceeds the Threshold
- ALM-45595 IoTDBServer Cross-Space Merge Duration Exceeds the Threshold
- ALM-45596 Procedure Execution Failed
- ALM-45615 CDL Service Unavailable
- ALM-45616 CDL Job Execution Exception
- ALM-45617 Data Queued in the CDL Replication Slot Exceeds the Threshold
- ALM-45635 FlinkServer Job Execution Failure
- ALM-45636 Flink Job Checkpoints Keep Failing
- ALM-45636 Number of Consecutive Checkpoint Failures of a Flink Job Exceeds the Threshold
- ALM-45637 FlinkServer Task Is Continuously Under Back Pressure
- ALM-45638 Number of Restarts After FlinkServer Job Failures Exceeds the Threshold
- ALM-45638 Number of Restarts After Flink Job Failures Exceeds the Threshold
- ALM-45639 Checkpointing of a Flink Job Times Out
- ALM-45640 FlinkServer Heartbeat Interruption Between the Active and Standby Nodes
- ALM-45641 Data Synchronization Exception Between the Active and Standby FlinkServer Nodes
- ALM-45642 RocksDB Continuously Triggers Write Traffic Limiting
- ALM-45643 MemTable Size of RocksDB Continuously Exceeds the Threshold
- ALM-45644 Number of SST Files at Level 0 of RocksDB Continuously Exceeds the Threshold
- ALM-45645 Pending Flush Size of RocksDB Continuously Exceeds the Threshold
- ALM-45646 Pending Compaction Size of RocksDB Continuously Exceeds the Threshold
- ALM-45647 Estimated Pending Compaction Size of RocksDB Continuously Exceeds the Threshold
- ALM-45648 RocksDB Frequently Encounters Write-Stopped
- ALM-45649 P95 Latency of RocksDB Get Requests Continuously Exceeds the Threshold
- ALM-45650 P95 Latency of RocksDB Write Requests Continuously Exceeds the Threshold
- ALM-45652 Flink Service Unavailable
- ALM-45653 Invalid Flink HA Certificate File
- ALM-45654 Flink HA Certificate Is About to Expire
- ALM-45655 Flink HA Certificate File Has Expired
- ALM-45736 Guardian Service Unavailable
- ALM-45737 TokenServer Heap Memory Usage Exceeds the Threshold
- ALM-45738 TokenServer Direct Memory Usage Exceeds the Threshold
- ALM-45739 TokenServer Non-Heap Memory Usage Exceeds the Threshold
- ALM-45740 TokenServer GC Duration Exceeds the Threshold
- ALM-45741 Failed to Call the ECS securitykey API
- ALM-45742 Failed to Call the ECS Metadata API
- ALM-45743 Failed to Call the IAM API
- ALM-45744 Average RPC Processing Time of the Guardian TokenServer Exceeds the Threshold
- ALM-45745 Average RPC Queuing Time of the Guardian TokenServer Exceeds the Threshold
- ALM-47001 MemArtsCC Service Unavailable
- ALM-47002 MemArtsCC Disk Fault
- ALM-47003 Memory Usage of the MemArtsCC Worker Process Exceeds the Threshold
- ALM-47004 Average Latency of MemArtsCC Worker Read Requests Exceeds the Threshold
- ALM-50201 Doris Service Unavailable
- ALM-50202 FE CPU Usage Exceeds the Threshold
- ALM-50203 FE Memory Usage Exceeds the Threshold
- ALM-50205 BE CPU Usage Exceeds the Threshold
- ALM-50206 BE Memory Usage Exceeds the Threshold
- ALM-50207 Ratio of Connections to the FE MySQL Port to the Maximum Connections Allowed Exceeds the Threshold
- ALM-50208 Failures to Clear Historical Metadata Image Files Exceed the Threshold
- ALM-50209 Failures to Generate Metadata Image Files Exceed the Threshold
- ALM-50210 Maximum Compaction Score of All BE Nodes Exceeds the Threshold
- ALM-50211 FE Queue Length of BE Periodic Report Tasks Exceeds the Threshold
- ALM-50212 Accumulated Old-Generation GC Duration of the FE Process Exceeds the Threshold
- ALM-50213 Number of Tasks Queuing in the FE Thread Pool for Interacting with BE Exceeds the Threshold
- ALM-50214 Number of Tasks Queuing in the FE Thread Pool for Task Processing Exceeds the Threshold
- ALM-50215 Longest Duration of RPC Requests Received by Each FE Thrift Method Exceeds the Threshold
- ALM-50216 Memory Usage of the FE Node Exceeds the Threshold
- ALM-50217 Heap Memory Usage of the FE Node Exceeds the Threshold
- ALM-50219 Length of the Queue in the Thread Pool for Query Execution Exceeds the Threshold
- ALM-50220 Error Rate of TCP Packet Receiving Exceeds the Threshold
- ALM-50221 BE Data Disk Usage Exceeds the Threshold
- ALM-50222 Disk Status of a Specified Data Directory on BE Is Abnormal
- ALM-50223 Maximum Memory Required by BE Is Greater Than the Remaining Memory of the Machine
- ALM-50224 Failures a Certain Task Type on BE Are Increasing
- ALM-50225 FE Instance Fault
- ALM-50226 BE Instance Fault
- ALM-50227 Concurrent Doris Tenant Queries Exceeds the Threshold
- ALM-50228 Memory Usage of a Doris Tenant Exceeds the Threshold
- ALM-50229 Doris FE Failed to Connect to OBS
- ALM-50230 Doris BE Cannot Connect to OBS
- ALM-50231 Abnormal Tablets Exist in Doris
- ALM-50232 Large Tablets in Doris
- ALM-50401 Number of JobServer Jobs Waiting to Be Executed Exceeds the Threshold
- ALM-50402 JobGateway Service Unavailable
- ALM-12001 Audit Log Dump Failure (For MRS 2.x or Earlier)
- ALM-12002 HA Resource Abnormal (For MRS 2.x or Earlier)
- ALM-12004 OLdap Resource Abnormal (For MRS 2.x or Earlier)
- ALM-12005 OKerberos Resource Abnormal (For MRS 2.x or Earlier)
- ALM-12006 Node Fault (For MRS 2.x or Earlier)
- ALM-12007 Process Fault (For MRS 2.x or Earlier)
- ALM-12010 Manager Heartbeat Interruption Between the Active and Standby Nodes (For MRS 2.x or Earlier)
- ALM-12011 Data Synchronization Exception Between the Active and Standby Manager Nodes (For MRS 2.x or Earlier)
- ALM-12012 NTP Service Abnormal (For MRS 2.x or Earlier)
- ALM-12014 Device Partition Lost (For MRS 2.x or Earlier)
- ALM-12015 Device Partition File System Read-Only (For MRS 2.x or Earlier)
- ALM-12016 CPU Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12017 Insufficient Disk Capacity (For MRS 2.x or Earlier)
- ALM-12018 Memory Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12027 Host PID Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12028 Number of Processes in the D State on the Host Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12031 User omm or Password Is About to Expire (For MRS 2.x or Earlier)
- ALM-12032 User ommdba or Password Is About to Expire (For MRS 2.x or Earlier)
- ALM-12033 Slow Disk Fault (For MRS 2.x or Earlier)
- ALM-12034 Periodic Backup Failure (For MRS 2.x or Earlier)
- ALM-12035 Unknown Data Status After Recovery Task Failure (For MRS 2.x or Earlier)
- ALM-12037 NTP Server Abnormal (For MRS 2.x or Earlier)
- ALM-12038 Monitoring Indicator Dump Failure (For MRS 2.x or Earlier)
- ALM-12039 GaussDB Data Is Not Synchronized (For MRS 2.x or Earlier)
- ALM-12040 Insufficient System Entropy (For MRS 2.x or Earlier)
- ALM-12041 Permission of Key Files Is Abnormal (For MRS 2.x or Earlier)
- ALM-12042 Key File Configurations Are Abnormal (For MRS 2.x or Earlier)
- ALM-12043 DNS Parsing Duration Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12045 Read Packet Dropped Rate Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12046 Write Packet Dropped Rate Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12047 Read Packet Error Rate Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12048 Write Packet Error Rate Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12049 Read Throughput Rate Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12050 Write Throughput Rate Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12051 Disk Inode Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12052 Usage of Temporary TCP Ports Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12053 File Handle Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-12054 Invalid Certificate File (For MRS 2.x or Earlier)
- ALM-12055 Certificate File Is About to Expire (For MRS 2.x or Earlier)
- ALM-12180 Disk Card I/O (For MRS 2.x or Earlier)
- ALM-12357 Failed to Export Audit Logs to OBS (For MRS 2.x or Earlier)
- ALM-13000 ZooKeeper Service Unavailable (For MRS 2.x or Earlier)
- ALM-13001 Available ZooKeeper Connections Are Insufficient (For MRS 2.x or Earlier)
- ALM-13002 ZooKeeper Memory Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14000 HDFS Service Unavailable (For MRS 2.x or Earlier)
- ALM-14001 HDFS Disk Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14002 DataNode Disk Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14003 Number of Lost HDFS Blocks Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14004 Number of Damaged HDFS Blocks Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14006 Number of HDFS Files Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14007 HDFS NameNode Memory Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14008 HDFS DataNode Memory Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14009 Number of Faulty DataNodes Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-14010 NameService Is Abnormal (For MRS 2.x or Earlier)
- ALM-14011 HDFS DataNode Data Directory Is Not Configured Properly (For MRS 2.x or Earlier)
- ALM-14012 HDFS Journalnode Data Is Not Synchronized (For MRS 2.x or Earlier)
- ALM-16000 Percentage of Sessions Connected to the HiveServer to the Maximum Number Allowed Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-16001 Hive Warehouse Space Usage Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-16002 Hive SQL Execution Success Rate Is Lower Than the Threshold (For MRS 2.x or Earlier)
- ALM-16004 Hive Service Unavailable (For MRS 2.x or Earlier)
- ALM-16005 Number of Failed Hive SQL Executions in the Last Period Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-18000 Yarn Service Unavailable (For MRS 2.x or Earlier)
- ALM-18002 NodeManager Heartbeat Lost (For MRS 2.x or Earlier)
- ALM-18003 NodeManager Unhealthy (For MRS 2.x or Earlier)
- ALM-18004 NodeManager Disk Usability Ratio Is Lower Than the Threshold (For MRS 2.x or Earlier)
- ALM-18006 MapReduce Job Execution Timeout (For MRS 2.x or Earlier)
- ALM-18008 Heap Memory Usage of Yarn ResourceManager Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-18009 Heap Memory Usage of MapReduce JobHistoryServer Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-18010 Number of Pending Yarn Tasks Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-18011 Memory of Pending Yarn Tasks Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-18012 Number of Terminated Yarn Tasks in the Last Period Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-18013 Number of Failed Yarn Tasks in the Last Period Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-19000 HBase Service Unavailable (For MRS 2.x or Earlier)
- ALM-19006 HBase Replication Sync Failed (For MRS 2.x or Earlier)
- ALM-19007 HBase Merge Queue Exceeds the Threshold (for 2.x and Earlier Versions)
- ALM-20002 Hue Service Unavailable (For MRS 2.x or Earlier)
- ALM-23001 Loader Service Unavailable (For MRS 2.x or Earlier)
- ALM-24000 Flume Service Unavailable (For MRS 2.x or Earlier)
- ALM-24001 Flume Agent Is Abnormal (For MRS 2.x or Earlier)
- ALM-24003 Flume Client Connection Interrupted (For MRS 2.x or Earlier)
- ALM-24004 Flume Fails to Read Data (For MRS 2.x or Earlier)
- ALM-24005 Data Transmission by Flume Is Abnormal (For MRS 2.x or Earlier)
- ALM-25000 LdapServer Service Unavailable (For MRS 2.x or Earlier)
- ALM-25004 Abnormal LdapServer Data Synchronization (For MRS 2.x or Earlier)
- ALM-25500 KrbServer Service Unavailable (For MRS 2.x or Earlier)
- ALM-26051 Storm Service Unavailable (For MRS 2.x or Earlier)
- ALM-26052 Number of Available Supervisors in Storm Is Lower Than the Threshold (For MRS 2.x or Earlier)
- ALM-26053 Slot Usage of Storm Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-26054 Heap Memory Usage of Storm Nimbus Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-27001 DBService Unavailable (For MRS 2.x or Earlier)
- ALM-27003 DBService Heartbeat Interruption Between the Active and Standby Nodes (For MRS 2.x or Earlier)
- ALM-27004 Data Inconsistency Between Active and Standby DBServices (For MRS 2.x or Earlier)
- ALM-28001 Spark Service Unavailable (For MRS 2.x or Earlier)
- ALM-38000 Kafka Service Unavailable (For MRS 2.x or Earlier)
- ALM-38001 Insufficient Kafka Disk Capacity (For MRS 2.x or Earlier)
- ALM-38002 Heap Memory Usage of Kafka Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43001 Spark Service Unavailable (For MRS 2.x or Earlier)
- ALM-43006 Heap Memory Usage of the JobHistory Process Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43007 Non-Heap Memory Usage of the JobHistory Process Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43008 Direct Memory Usage of the JobHistory Process Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43009 JobHistory GC Time Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43010 Heap Memory Usage of the JDBCServer Process Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43011 Non-Heap Memory Usage of the JDBCServer Process Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43012 Direct Memory Usage of the JDBCServer Process Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-43013 JDBCServer GC Time Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-44004 Presto Coordinator Resource Group Queuing Tasks Exceed the Threshold (For MRS 2.x or Earlier)
- ALM-44005 Presto Coordinator Process GC Time Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-44006 Presto Worker Process GC Time Exceeds the Threshold (For MRS 2.x or Earlier)
- ALM-45325 Presto Service Unavailable (For MRS 2.x or Earlier)
Parent topic: MRS Cluster O&M
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
The system is busy. Please try again later.
For any further questions, feel free to contact us through the chatbot.
Chatbot