Updated on 2024-11-29 GMT+08:00

Configuring Automatic Metadata Extraction

Scenario

This section describes how to configure the time and dump information for automatic metadata extraction on Manager.

The dump function allows you to save metadata to a specified location using FTP or SFTP. The following scenarios are supported:

  • FTP-Server has been deployed in the cluster to dump metadata to HDFS of the cluster.
  • An FTP or SFTP server has been deployed outside the cluster to dump metadata to the local disk of an external server.

Prerequisites

You have logged in to Manager.

You have created a user who is associated with the hadoop user group and System_administrator role.

Procedure

  1. Log in to Manager using the created user and choose Cluster > Name of the desired cluster > Services > Metadata > MetadataServer. The Metadata web UI is displayed.
  2. Click Metadata Configuration and configure the parameters. See Table 1.

    Table 1 Parameters for configuring automatic metadata extraction

    Parameter

    Description

    Started

    Start time for automatically extracting metadata

    Interval (hours)

    Interval for executing automatic extraction tasks, in second

    Auto FTP Update

    Specifies whether to enable FTP dumping. Select Yes to enable it and No to disable it. If you select Yes, the FTP server configuration parameters are available.

    SFTP

    Indicates whether to use SFTP. Select Yes to enable it and No to disable it.

    This parameter is available only when Auto FTP Update is set to Yes.

    Service IP Address

    IP address used by the FTP dump server.

    This parameter is available only when Auto FTP Update is set to Yes.

    Service Port

    Port used by the FTP dump server. If the IP address of the FTP-Server in the cluster is specified, you need to enable the FTP protocol and use the FTP-Server control port.

    This parameter is available only when Auto FTP Update is set to Yes.

    Server Directory

    Dump directory of the FTP server. The directory must exist on the FTP server. If not, metadata fails to update.

    This parameter is available only when Auto FTP Update is set to Yes.

    Username

    Username for accessing the FTP dump server

    This parameter is available only when Auto FTP Update is set to Yes.

    Password

    Password for accessing the FTP dump server

    This parameter is available only when Auto FTP Update is set to Yes.

    • To manually extract metadata, click Update Metadata and select OK to update metadata. Determine whether to upload metadata to the configured FTP server as required.
    • If you disable FTPS when configuring metadata extraction parameters, FTP will be used, which results in potential security risks.