Updated on 2024-07-01 GMT+08:00

Creating a Hive Masking Task

You can mask sensitive Hive data.

This section describes how to create a data masking task for Hive.

Prerequisites

Constraints

Currently, only HIVE is supported.

Creating and Running a Hive Masking Task

  1. Log in to the management console.
  2. Click in the upper left corner and select a region or project.
  3. In the navigation tree on the left, click . Choose Security and Compliance > Data Security Center .
  4. In the left navigation pane, choose Data Asset Protection > Data Masking and click the Hive tab. The MRS masking page is displayed.
  5. On the Hive anonymization tab page, click and set Hive anonymization to enable.
  6. Click Create Task. On the displayed Configure Data Source page, configure parameters according to Table 1.

    Table 1 Parameter description

    Parameter

    Description

    Task Name

    You can customize the name of a data masking task.

    The task name must meet the following requirements:
    • Contain 1 to 255 characters.
    • Consist of letters, digits, underscores (_), and hyphens (-).

    Select Data Source

    Select a data source. Only Hive is supported.

    Data Source

    NOTE:

    If no Hive database is available, click Add Database to add database assets. For details, see Authorizing Access to Big Data Assets.

    Database instance: Select the database instance where the data you want to mask is located.

    Database: Select the name of the database where the data you want to mask is located.

    Table name: Select the name of the database table where the data you want to mask is located.

    If you select the check box, data in this column is copied to the Data Type column.

  7. Click Next.

    Figure 1 Setting a masking algorithm
    1. Select the data columns you want to mask.
    2. Select a data masking algorithm. For details about data masking algorithms, see Configuring a Data Masking Rule.

  8. Click Next.

    Figure 2 Masking period

    Select and set the execution period of a masking task.

    • Manual: Manually enable a masking task and execute it based on masking rules.
    • Hourly: Execute a data masking task every several hours.

      Example: If the masking task needs to be executed every two hours, set this parameter to 02:00.

    • Daily: Execute a data masking task at a specified time every day.

      Example: If the masking task needs to be executed at 12:00 every day, set this parameter to 12:00:00.

    • Weekly: Execute a data masking task at a specified time every week.

      Example: If the masking task needs to be executed at 12:00 every Monday, set this parameter to 12:00:00 every Monday.

    • Monthly: Execute a data masking task at a specified time on a specified day every month.

      Example: If the masking task needs to be executed at 12:00 on the 12th day of each month, set this parameter to 12:00:00 12th day of every month.

      If you want to execute a data masking task on the 31st day of each month, the system automatically executes the task on the last day of every month.

  9. Click Next.

    Figure 3 Setting target data
    1. Select a database instance and database name, and enter the database table name.

      If the entered data table name already exists, the system updates the data table in the target database.

      If the entered data table name does not exist, the system automatically creates a data table with the same name in the target database.

      Do not fill in an existing service data table. Otherwise, services may be affected.

    2. Set the column name of the target data type.

      By default, the system generates a name that is the same as the name of the data source column. You can retain the default name or change it as required.

  10. Click Finish.
  11. On the Hive page. In the Operation column of the target anonymization task, click Execute.
  12. The data masking task is executed as configured.

Checking the Running Status of a Hive Data Masking Task

  • On the Hive tab page, view the task running status, as shown in Running status of the Hive data masking task.

    The task statuses are described as follows:

    • Completed: The data masking task has been successfully executed.
    • Running: The data masking task is being executed.
    • Pending execution: The data masking task is not executed.
    • Stopped: The data masking task has been manually stopped.
    • Failed: The data masking task fails to be executed.
      Figure 4 Running status of the Hive data masking task

Editing and Deleting a Hive Data Masking Task

A masking task in the Waiting or Running status cannot be edited or deleted.

  • In the Hive data masking task list, click Edit in the Operation column. For details about how to configure masking task information, see Creating and Running a Hive Masking Task.
    Figure 5 Editing a Hive data masking task
  • In the Hive data masking task list, click Delete in the Operation column.
    Figure 6 Deleting a Hive data masking task

    Deleted masking tasks cannot be restored. Exercise caution when performing this operation.