Features
Table 1 describes the commonly used features of COC.
Feature |
Description |
Region Availability |
---|---|---|
Overview |
The following feature modules are available on the COC overview page: resource overview, resource monitoring, application monitoring, security overview, quick entries, and more. You can view and perform operations on work items with ease on the overview page, enjoying simplified and highly efficient O&M. |
Global |
Resource management |
COC provides a resource management view that is bolstered by management capabilities for various resources. By using this feature, you can create resource topologies, aggregate resources by resource type, query resources from the resource list by resource tag, and install the UniAgent components. |
Global |
Application management |
COC provides an application-centric resource management view that is bolstered by the capability of modeling the association between applications and resources. By using this feature, you can manage your resources by application, region, resource group, or resource model, query resources in a resource list by tag, and install the UniAgent components. |
Global |
Patch management |
Manage patches on ECSs, scan OS compliance, and repair OSs whose patches are non-compliant. |
Global |
Batch operations on ECSs |
Batch manage ECSs, including batch starting, stopping, and restarting ECSs, and switching and reinstalling OSs for ECSs. |
Global |
Batch operations on RDS DB instances |
Batch manage RDS DB instances, including starting, stopping, and restarting RDS DB instances in batches. |
Global |
Batch operations on FlexusL instances |
Manage FlexusL instances, including starting, stopping, and restarting instances, and reinstalling OSs in batches. |
Global |
Script management |
Create, modify, and delete scripts, and execute your own and public scripts on VMs (Script management is only allowed on ECSs currently.) |
Global |
Job management |
Create, modify, and delete jobs, and execute jobs on VMs (Job management is only allowed on ECSs currently.) |
Global |
Scheduled O&M |
You can either select job or script execution tasks from existing tasks or create such tasks. There are two task execution methods available: one-time execution and periodic execution. Periodic task execution includes execution using Cron expressions and simple periodic execution. |
Global |
Parameter center |
Manage parameters throughout the whole service lifecycle in regions to continuously monitor parameter correctness and consistency. You can quickly reference O&M scenarios such as job orchestration. |
Global |
Incident center |
You can check all incidents on the incident dashboard in the COC incident center. You can also manually handle incidents, associate incidents with jobs, escalate or de-escalate incidents, forward incidents to their owners, check handling records of incidents, and initiate war room requests with just a few clicks. |
Global |
Alarm center |
Clean raw alarms based on alarm conversion rules and then create alarms. Alarms can be allocated to O&M engineer shifts or individuals so that alarm owners are clear. You can manually clear alarms, convert alarms to incident tickets, or use the automated alarm handling feature. |
Global |
War rooms |
When there is a major or critical fault, a war room can be set up to quickly convene experts such as fault analysis members and application SRE engineers to rectify the fault. This improves the efficiency of collaborative communication, fault diagnosis and demarcation, and fault handling. War rooms also enable you to quickly detect and respond to incidents, shortening the MTTR. |
Global |
Improvement ticket management |
Improvement ticket management is the process of tracking and closing improvement tickets for product, O&M, or management issues found during incident or war room handling, or during drills. |
Global |
Issue management |
Issue management is the process of first discovering issues such as product function defects and poor performance issues during the use of software products, and then recording the fault root causes and resolving the issues during the application. Setting up war rooms is mainly used to reduce the number of product or service faults on the live network. This improves the overall service quality, promote the continuous improvement of product or application quality, and prevent issues from recurring. |
Global |
Alarm conversion rules |
Alarm conversion rules suppress, reduce noise, deduplicate, and distribute routes for all received raw alarms. Vertical suppression and horizontal convergence of multiple monitoring sources are supported for multi-dimensional noise reduction. When configuring an incident forwarding rule, you can specify default objects for assigning incidents and configure notification policy for precise accurate notification. |
Global |
Data source management |
Quickly integrate with existing or external monitoring systems with ease for centralized alarm management. Each monitoring system employs distinct integration access keys for seamless interconnectivity. |
Global |
Change management |
The change center provides a unified platform for engineers to manage change tasks. With the change center, engineers can submit tickets to manage change requests, review, and execution. |
Global |
Chaos drills |
Configure fault drill templates and attack templates and perform fault drills on physical machines, VMs, or Cloud Container Engine (CCE) containers using the templates. You can also manage failure modes. |
Global |
To-do center |
On the to-do task dashboard, you can view the handling status of to-do tasks, historical to-do task statistics, and overview of all to-do tasks. You can also manually create to-do tasks. |
Global |
Execution records |
On the execution record page, you can query service ticket records of operations on patches, scripts, jobs, and ECSs, and view service ticket details. |
Global |
Personnel management |
Centrally manage O&M engineers on COC using this feature. You can manage users of the current tenant on the O&M Engineer Management page. The basic user data on the O&M Engineer Management page is synchronized from Identity and Access Management (IAM) and is used by multiple basic functional modules in creating to-do tasks, performing scheduled O&M, managing notifications, managing incidents, and more. |
Global |
Shift schedule management |
Manage O&M personnel centrally, from multiple dimensions, in different forms, or based on your other custom requirements. You can also create shift scenarios and roles and add personnel managed on the Personnel Management page to the scenarios and roles as required. |
Global |
Notification management |
The notification management module allows you to create notification subscription instances that contain notification scenarios and matching rules. When a change ticket is generated, the notification module first matches the ticket with notification rules and scenarios, then parses the O&M engineers to be notified, the notification content, and notification method, and finally sends the notification messages. |
Global |
Mobile application management |
Bind or modify mobile apps. (Currently, only WeCom is supported.) |
Global |
SLA management |
Service Level Agreement (SLA) provides ticket timeliness management for you. When a ticket triggers a rule, the SLA notifies you to follow up and handle the ticket in a timely manner, and it records details about the ticket SLA triggering. In SLA management, you can use public SLA rules or user-defined rules, and can configure notifications for SLA violation and warning. |
Global |
Account management |
Manage and host ECS accounts and periodically change passwords of ECS accounts. |
Global |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot