Creating a Hive Masking Task
You can mask sensitive Hive data.
This section describes how to create a data masking task for Hive.
Prerequisites
- Access to cloud assets has been authorized. For details, see Allowing or Disallowing Access to Cloud Assets.
- Access to Hive databases has been authorized. For details, see Authorizing Access to Big Data Assets.
- Sensitive data has been identified by referring to Creating an Identification Task.
Constraints
Currently, only HIVE is supported.
Creating and Running a Hive Masking Task
- Log in to the management console.
- Click in the upper left corner and select a region or project.
- In the navigation tree on the left, click . Choose .
- In the left navigation pane, choose Data Asset Protection > Data Masking and click the Hive tab. The Hive masking page is displayed.
- Click and set Mask Sensitive Hive Data to to enable Hive masking.
- Click Create Task. On the displayed Configure Data Source page, configure parameters according to Table 1.
Table 1 Parameter description Parameter
Description
Task Name
You can customize the name of a data masking task.
The task name must meet the following requirements:- Contain 1 to 255 characters.
- Consist of letters, digits, underscores (_), and hyphens (-).
Select Data Source
Select a data source. Only Hive is supported.
Data Source
NOTE:If no Hive database is available, click Add Database to add database assets. For details, see Authorizing Access to Big Data Assets.
Database instance: Select the database instance where the data you want to mask is located.
Database: Select the name of the database where the data you want to mask is located.
Table name: Select the name of the database table where the data you want to mask is located.
If you select the check box, data in this column is copied to the Data Type column.
- Click Next.
Figure 1 Setting a masking algorithm
- Select the data columns you want to mask.
- Select a data masking algorithm. For details about data masking algorithms, see Configuring a Data Masking Rule.
- Click Next to switch to the Configure Data Masking Period page and configure the data masking period.
Select and set the execution period of a masking task.
- Manual: Manually enable a masking task and execute it based on masking rules.
- Hourly: Execute a data masking task every several hours.
Example: If the masking task needs to be executed every two hours, set this parameter to 02:00.
- Daily: Execute a data masking task at a specified time every day.
Example: If the masking task needs to be executed at 12:00 every day, set this parameter to 12:00:00.
- Weekly: Execute a data masking task at a specified time every week.
Example: If the masking task needs to be executed at 12:00 every Monday, set this parameter to 12:00:00 every Monday.
- Monthly: Execute a data masking task at a specified time on a specified day every month.
Example: If the masking task needs to be executed at 12:00 on the 12th day of each month, set this parameter to 12:00:00 12th day of every month.
If you want to execute a data masking task on the 31st day of each month, the system automatically executes the task on the last day of every month.
- Click Next.
Figure 2 Setting target data
- Select a database instance and database name, and enter the database table name.
If the entered data table name already exists, the system updates the data table in the target database.
If the entered data table name does not exist, the system automatically creates a data table with the same name in the target database.
Do not fill in an existing service data table. Otherwise, services may be affected.
- Set the column name of the target data type.
By default, the system generates a name that is the same as the name of the data source column. You can retain the default name or change it as required.
- Select a database instance and database name, and enter the database table name.
- Click Finish.
- On the Hive page. In the Operation column of the target anonymization task, click Execute.
- The data masking task is executed as configured.
Checking the Running Status of a Hive Data Masking Task
- On the Hive tab page, view the task running status, as shown in Running status of the Hive data masking task.
The task statuses are described as follows:
- Completed: The data masking task has been successfully executed.
- Running: The data masking task is being executed.
- Pending execution: The data masking task is not executed.
- Stopped: The data masking task has been manually stopped.
- Failed: The data masking task fails to be executed.
Editing and Deleting a Hive Data Masking Task
A masking task in the Waiting or Running status cannot be edited or deleted.
- In the Hive data masking task list, click Edit in the Operation column. For details about how to configure masking task information, see Creating and Running a Hive Masking Task.
Figure 4 Editing a Hive data masking task
- In the Hive data masking task list, click Delete in the Operation column.
Figure 5 Deleting a Hive data masking task
Deleted masking tasks cannot be restored. Exercise caution when performing this operation.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.