Updated on 2024-05-27 GMT+08:00

Component Monitoring

Components refer to the services that you deploy, including containers and common processes.

The component list displays the name, running status, and application of each component. AOM supports drill-down from a component to an instance, and then to a process. By viewing the status of each layer, you can implement dimensional monitoring for components.

Procedure

  1. Log in to the AOM 2.0 console.
  2. In the navigation pane, choose Infrastructure Monitoring > Process Monitoring. Next, click the Component Monitoring tab. Then you can view the component list.

    • The component list displays information such as Component Name, Application, Deployment Mode, and Application Discovery Rules.
    • To view target components, you can set filter criteria (such as the running status, application, cluster name, deployment mode, and component name) above the component list.
    • Enable or disable Hide System Components as required. By default, system components are hidden.
    • Click in the upper right corner of the page and select or deselect the columns to display.

  3. Click in the upper right corner of the page and select a desired value from the drop-down list.

    1. Set a time range to view components. There are two methods to set a time range:

      Method 1: Use a predefined time label, such as Last 30 minutes or Last hour in the upper right corner of the page. You can select a time range as required.

      Method 2: Specify the start time and end time to customize a time range. You can specify 30 days at most.

    2. Set the interval for refreshing information. Click and select a value from the drop-down list, such as Refresh manually or 1 minute auto refresh.

  4. Perform the following operations as required:

    • Adding an alias

      If a component name is complex to identify, you can add an alias for the component.

      In the component list, click in the Operation column of the target component, enter an alias, and click OK. The added alias can be modified but cannot be deleted.

    • Adding a tag

      Tags are identifiers of components. You can distinguish system components from non-system components based on tags. By default, AOM adds the System Service tag to system components (including icagent, css-defender, nvidia-driver-installer, nvidia-gpu-device-plugin, kube-dns, org.tanukisoftware.wrapper.WrapperSimpleApp, evs-driver, obs-driver, sfs-driver, icwatchdog, and sh).

      In the component list, click in the Operation column of the target component. In the displayed dialog box, enter a tag key and value, click , select the Mark as system component check box, and click OK.

      • A maximum of five tags can be created for each component.
      • Tag key: max. 36 characters; tag value: max. 43 characters
      • A tag value can contain only letters, digits, hyphens (-), and underscores (_).

  5. Set filter criteria to search for the desired component.

    Components cannot be searched by alias.

  6. Click the component name. The component details page is displayed.

    • On the Instance List tab page, view the instance details.

      Click an instance name to view the monitoring view and alarm information.

    • On the Host List tab page, view the host details.
    • On the Monitoring Views tab page, select a desired Prometheus instance to view the resource usage of the component. Click in the upper right corner of the page to view resource information in full screen.
    • On the Alarms tab page, view the alarm details of the component. For details, see Viewing Alarms.
    • On the Events tab page, view the event details of the component. For details, see Viewing Events.