Updated on 2024-09-23 GMT+08:00

Dumping MRS Cluster Monitoring Data

Scenarios

The monitoring data reporting function writes the monitoring data collected in the system into a text file and uploads the file to a specified server in FTP or SFTP mode. You can configure interconnection parameters on Manager to save monitoring data to a specified FTP server. In this way, MRS clusters can interconnect with third-party systems.

The FTP protocol does not encrypt data, which poses potential security risks. Therefore, the SFTP protocol is recommended. The ECS corresponding to the dump server must be in the same VPC as the Master nodes of the MRS cluster, and the Master nodes can access the IP address and specified port of the dump server. The FTP service on the dump server is running properly.

Manager supports the collection of all the monitoring data in the managed clusters. The collection period is 30 seconds, 60 seconds, or 300 seconds. The monitoring data is stored in different monitoring files on the FTP server by collection period.

Table 1 Monitoring data files

Monitoring Data

Description

Collection period

  • 30s: real-time metrics that are collected every 30s by default
  • 60s: real-time metrics that are collected every 60s by default
  • 300s: all metrics that are not collected every 30s or 60s

File name format

  • MRS 3.x and later:

    metric_Monitoring data collection period_File creation time.log

    Examples: metric_60_20160908085915.log and metric_300_20160908085613.log

  • Versions earlier than MRS 3.x:

    Cluster name_metric_Monitoring data collection period_File creation time.log

Content format (MRS 3.x as an example)

  • Cluster ID|Cluster name|Display name|Service name| Metric ID|Collection time|Collection host@m@Sub-metric|Unit|Metric value
  • Example:
    The actual files do not contain the parameter names in the format description.
    1|xx1|Host|Host|10000413|2019/06/18 10:05:00|10-66-254-146|KB/s|309.910
    1|xx1|Host|Host|10000413|2019/06/18 10:05:00|10-66-254-152|KB/s|72.870
    2|xx2|Host|Host|10000413|2019/06/18 10:05:00|10-66-254-163|KB/s|100.650

With the Metric ID obtained from the reported file, the third-party can obtain metric details by querying the metric set file in the FusionInsight installation path/om-server/om/etc/om/all-shown-metric-zh_CN on the active/standby OMS node. The file contains the detailed information about all metrics. The content is as follows (taking MRS 3.x as an example):

Real-Time Metric ID,5-Minute Metric ID,Metric Name,Metric Collection Period (s),Collected by Default,Service Belonged To,Role Belonged To
00101,10000101,JobHistoryServer non-heap memory usage,30,false,Mapreduce,JobHistoryServer
00102,10000102,JobHistoryServer non-heap memory allocation volume,30,false,Mapreduce,JobHistoryServer
00103,10000103,JobHistoryServer heap memory usage,30,false,Mapreduce,JobHistoryServer
00104,10000104,JobHistoryServer heap memory allocation volume,30,false,Mapreduce,JobHistoryServer
00105,10000105,Number of blocked threads,30,false,Mapreduce,JobHistoryServer
00106,10000106,Number of running threads,30,false,Mapreduce,JobHistoryServer
00107,10000107,GC time,30,false,Mapreduce,JobHistoryServer
00110,10000110,JobHistoryServer CPU usage,30,false,Mapreduce,JobHistoryServer
...
  • Metric ID meaning:

    For metrics whose collection period is 30s/60s, you can find the corresponding metric description by referring to the first column, that is, Real-Time Metric ID.

    For metrics whose collection period is 300s, you can find the corresponding metric description by referring to the second column, that is, 5-Minute Metric ID.

  • Field description:

    Real-Time Metric ID: indicates the ID of the metric whose collection period is 30s or 60s.

    5-Minute Metric ID: indicates the ID of a 5-minute (300s) metric.

    Metric Collection Period (s): indicates the collection period of real-time metrics. The value can be 30 or 60.

    Service Belonged To: indicates the name of the service to which a metric belongs, for example, HDFS and HBase.

    Role Belonged To: indicates the name of the role to which a metric belongs, for example, JobServer and RegionServer.

Dumping MRS Cluster Monitoring Data (MRS 3.x and Later)

  1. Log in to FusionInsight Manager.
  2. Choose System > Interconnection > Upload Performance Data.
  3. Toggle on Upload Performance Data.

    The performance data upload service is disabled by default. indicates that the service is enabled.

  4. Set the upload parameters according to Table 2.

    Table 2 Upload parameters

    Parameter

    Description

    FTP IP Address Mode

    Specifies the server IP address mode. This parameter is mandatory. The value can be IPV4 or IPV6.

    FTP IP Address

    Specifies the IP address of the FTP server for storing monitoring files after the monitoring metric data is interconnected. This parameter is mandatory.

    FTP Port

    Specifies the port for connecting to the FTP server. This parameter is mandatory.

    FTP Username

    Specifies the username for logging in to the FTP server. This parameter is mandatory.

    FTP Password

    Specifies the password for logging in to the FTP server. This parameter is mandatory.

    Save Path

    Specifies the path for storing monitoring files on the FTP server. This parameter is mandatory.

    Dump Interval (second)

    Specifies the interval at which monitoring files are periodically stored on the FTP server, in seconds. This parameter is mandatory. The system will periodically upload files to the corresponding FTP server at the specified interval.

    Dump Mode

    Specifies the protocol used for sending monitoring files. This parameter is mandatory. The value can be SFTP or FTP. You are advised to use the SFTP mode based on SSH v2. Otherwise, security risks may be incurred.

    SFTP Service Public Key

    Specifies the public key of the FTP server. This parameter is optional. It is valid only when Dump Mode is set to SFTP.

  5. Click OK.

    If the dump mode is SFTP and the public key of the SFTP service is empty, the system displays a security risk warning. You need to evaluate the security risk and then save the configuration.

Dumping MRS Cluster Monitoring Data (MRS 2.x and Earlier)

  1. On MRS Manager, click System.
  2. In Configuration, click Configure Monitoring Metric Dump under Monitoring and Alarm.
  3. Table 3 describes the parameters for dumping.

    Table 3 Parameters

    Parameter

    Description

    Mandatory

    Dump Monitoring Metric

    Whether to enable monitoring metric data interconnection.

    Yes

    FTP IP Address

    FTP server for storing monitoring files after monitoring metric data is interconnected.

    Yes

    FTP Port

    Port for connecting to the FTP server.

    Yes

    FTP Username

    Username for logging in to the FTP server.

    Yes

    FTP Password

    Password for logging in to the FTP server.

    Yes

    Save Path

    Path for storing monitoring files on the FTP server.

    Yes

    Dump Interval (second)

    Interval at which monitoring files are periodically stored on the FTP server, in seconds. The system will periodically upload files to the corresponding FTP server at the specified interval.

    Yes

    Dump Mode

    Protocol used for sending monitoring files. The options are FTP and SFTP.

    Yes

    SFTP Service Public Key

    Public key of the FTP server. This parameter is available only when Dump Mode is set to SFTP. You are advised to configure a public key. Otherwise, security risks may arise.

    No

  4. Click OK.