Creating an Identification Task
Based on the created identification task, DSC automatically identifies sensitive data in a specified database, OBS bucket, big data source, LTS, or MRS, and generates identification results and reports.
This topic describes how to create an identification task.
Prerequisites
- Access to cloud assets has been authorized. For details, see Allowing or Disallowing Access to Cloud Assets.
- Specific assets have been added or authorized. For details, see Asset Center.
Creating an Identification Task
- Log in to the management console.
- Click in the upper left corner of the management console and select a region or project.
- In the navigation tree on the left, click . Choose .
- In the navigation pane, choose Sensitive Data Identification > Identification Task, as shown in Figure 1.
- In the upper left corner of the task list, click Create Task.
- In the displayed dialog box, set required parameters based on Table 1.
Table 1 Parameter description Parameter
Description
Example Value
Task Name
You can customize the task name.
The task name must meet the following requirements:
- Contain 4 to 255 characters.
- Consist of letters, digits, underscores (_), and hyphens (-).
- The name must start with a letter.
- Be unique.
Test task_01
Sensitive Data
Type of data to be identified. You can select multiple types.
- OBS: DSC is authorized to access your Huawei Cloud OBS assets and identify sensitive data in the assets. For details about how to add OBS assets, see Adding OBS Assets.
- Database: DSC identifies sensitive data of authorized database assets. For details about how to authorize DSC to access your database assets, see Authorizing Access to Database Assets.
- Big data: DSC identifies sensitive data of authorized big data assets. For details about how to authorize DSC to access your big data assets, see Authorizing Access to Big Data Assets.
- MRS: DSC identifies sensitive data of authorized big data assets. For details about how to authorize DSC to access your MRS assets, see Authorizing Access to Big Data Assets.
- LTS: DSC identifies sensitive data of authorized LTS assets. For details about how to add log streams, see Adding a Log Stream.
Database
Identification Template
You can select a built-in or custom template. DSC displays data by level and category based on the template you select. For details about how to add a template, see Creating an Identification Template.
Huawei Cloud Data Security Classifying and Grading Template
Identification Scope
This parameter is displayed when Data Type is set to LTS. Set this parameter to 1 day, 2 days, or 3 days.
1 day
Identification Intensity
This parameter is displayed when Data Type is set to LTS. Select the log identification intensity, which can be High, Medium, or Low. A higher intensity indicates more sampled data.
Low
Identification Period
Set the execution policy of the data identification task.
- Once: The task will be executed once at a specified time.
- Daily: The task is executed at a fixed time every day.
- Weekly: The task is executed at a specified time every week.
- Monthly: The task is executed at a specified time every month.
Once
When to Execute
This parameter is displayed when Identification Period is set to Once.- Now: Select the option and click OK, the system executes the data identification task immediately.
- As scheduled: The task will be executed at a specified time.
Now
Start Time
This parameter is displayed when Identification Period is set to Daily, Weekly, or Monthly.
Select the time when the task is being executed. After the time is selected, the task is executed every day, every week, every month, or at the specified time.
(Optional) Topic
- Select an existing topic from the drop-down list or click View Topic to create a topic for receiving alarm notifications.
- If you do not configure a topic, you can view the identification result in the identification task list. For details, see Identification Results.
None
- (Optional) If you need to set the scan scope for the added assets, see section Adding an Identification Scope.
- Click OK. A message is displayed indicating the task is created successfully.
Adding an Identification Scope
By default, DSC performs a global scan on the selected assets. You can also add a scan scope by referring to this section.
- Log in to the management console.
- Click in the upper left corner of the management console and select a region or project.
- In the navigation tree on the left, click . Choose .
- In the navigation pane, choose Sensitive Data Identification > Identification Task, as shown in Figure 2.
- Click Create Task. The Create Task page is displayed.
- Select the data type, select the name of the asset to be scanned, and click OK.
- In the lower left corner of the page, click the button to add an identification scope. You can add multiple scopes at the same time. For details about the parameter settings, see Table 2.
Table 2 Parameters for configuring the scan scope Asset Type
Configuration Parameter
Description
OBS
Asset
Select the bucket to be scanned from the drop-down list. You can select multiple buckets.
Scan Scope
- File name prefix: For example, if the prefix is dsc_, all files with the prefix dsc_ are scanned.
A maximum of one inclusion condition can be added for the file name prefix.
- File name extension: The file name extension contains the file type following the dot (.). For example, the file name extension dsc_security.txt can be security.txt or .txt. Only the files that meet all the filtering conditions are scanned.
A maximum of one inclusion condition can be added for the file name extension.
- Directory name: Specifies the directory to be scanned. All files in the specified directory are scanned.
A maximum of one inclusion condition can be added for the directory.
After entering the file name prefix/suffix/directory name, click Add as Inclusion Condition to add it as an inclusion condition.
For example, if you select the File name prefix, enter the prefix dsc_, and click Add as Inclusion Condition, only the files whose file name prefix is dsc_ are scanned. If you click Add as Exclusion Condition to as the prefix as an exclusion condition, only files whose prefixes are not dsc_ are scanned.
Scan Depth
- Global Scan: If this parameter is selected, all data is scanned.
- Specify Scan Scope: You can also select Specify Scan Scope and enter the Scan Depth. The depth of the root directory starts at 1 and increases incrementally. However, it must not surpass a depth of 10.
Database/Big Data/MRS
Asset
Select an instance name from the drop-down list. You can select multiple instances.
Scan Scope
- Table name prefix: A maximum of one inclusion condition can be added for the table name prefix. For example, if you enter dsc_ as the prefix of a table name and click Add as Inclusion Condition only the table data whose prefix is dsc_ is scanned. If you click Add as Exclusion Condition to as the prefix as an exclusion condition, only tables whose prefixes are not dsc_ are scanned.
- Table name suffix: A maximum of one inclusion condition can be added for the table name suffix. The principle is the same as that of the prefix.
LTS
Asset
Select an instance name from the drop-down list. You can select multiple instances.
Scan Scope
- Key prefix: If this parameter is added as an inclusion condition, the log content that contains the key prefix is scanned. If this parameter is added as an exclusion condition, the log content except the key prefix is scanned.
- Key suffix: The principle is the same as that of the key prefix.
NOTE:
- A maximum of one inclusion condition can be added for each of the key prefix and suffix.
- A maximum of 10 exclusion conditions can be added for key prefixes and suffixes.
Figure 3 OBS scan scope configuration
- File name prefix: For example, if the prefix is dsc_, all files with the prefix dsc_ are scanned.
Follow-up Procedure
Viewing the Identification Result: After the sensitive data identification task is complete, you can click Identification Result in the Operation column of the row containing the target task, to view the total number of sensitive information items, risk level, and sensitive information classification and grading result of the data assets.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.