Updated on 2024-10-09 GMT+08:00

Creating an Identification Task

Based on the created identification task, DSC automatically identifies sensitive data in a specified database, OBS bucket, big data source, LTS, or MRS, and generates identification results and reports.

This topic describes how to create an identification task.

Prerequisites

Creating an Identification Task

  1. Log in to the management console.
  2. Click in the upper left corner of the management console and select a region or project.
  3. In the navigation tree on the left, click . Choose Security & Compliance > Data Security Center .
  4. In the navigation pane, choose Sensitive Data Identification > Identification Task, as shown in Figure 1.

    Figure 1 Identification tasks

  5. In the upper left corner of the task list, click Create Task.
  6. In the displayed dialog box, set required parameters based on Table 1.

    Table 1 Parameter description

    Parameter

    Description

    Example Value

    Task Name

    You can customize the task name.

    The task name must meet the following requirements:

    • Contain 4 to 255 characters.
    • Consist of letters, digits, underscores (_), and hyphens (-).
    • The name must start with a letter.
    • Be unique.

    Test task_01

    Sensitive Data

    Type of data to be identified. You can select multiple types.

    • OBS: DSC is authorized to access your Huawei Cloud OBS assets and identify sensitive data in the assets. For details about how to add OBS assets, see Adding OBS Assets.
    • Database: DSC identifies sensitive data of authorized database assets. For details about how to authorize DSC to access your database assets, see Authorizing Access to Database Assets.
    • Big data: DSC identifies sensitive data of authorized big data assets. For details about how to authorize DSC to access your big data assets, see Authorizing Access to Big Data Assets.
    • MRS: DSC identifies sensitive data of authorized big data assets. For details about how to authorize DSC to access your MRS assets, see Authorizing Access to Big Data Assets.
    • LTS: DSC identifies sensitive data of authorized LTS assets. For details about how to add log streams, see Adding a Log Stream.

    Database

    Identification Template

    You can select a built-in or custom template. DSC displays data by level and category based on the template you select. For details about how to add a template, see Creating an Identification Template.

    Huawei Cloud Data Security Classifying and Grading Template

    Identification Scope

    This parameter is displayed when Data Type is set to LTS. Set this parameter to 1 day, 2 days, or 3 days.

    1 day

    Identification Intensity

    This parameter is displayed when Data Type is set to LTS. Select the log identification intensity, which can be High, Medium, or Low. A higher intensity indicates more sampled data.

    Low

    Identification Period

    Set the execution policy of the data identification task.

    • Once: The task will be executed once at a specified time.
    • Daily: The task is executed at a fixed time every day.
    • Weekly: The task is executed at a specified time every week.
    • Monthly: The task is executed at a specified time every month.

    Once

    When to Execute

    This parameter is displayed when Identification Period is set to Once.
    • Now: Select the option and click OK, the system executes the data identification task immediately.
    • As scheduled: The task will be executed at a specified time.

    Now

    Start Time

    This parameter is displayed when Identification Period is set to Daily, Weekly, or Monthly.

    Select the time when the task is being executed. After the time is selected, the task is executed every day, every week, every month, or at the specified time.

    (Optional) Topic

    • Select an existing topic from the drop-down list or click View Topic to create a topic for receiving alarm notifications.
    • If you do not configure a topic, you can view the identification result in the identification task list. For details, see Identification Results.

    None

  7. (Optional) If you need to set the scan scope for the added assets, see section Adding an Identification Scope.
  8. Click OK. A message is displayed indicating the task is created successfully.

Adding an Identification Scope

By default, DSC performs a global scan on the selected assets. You can also add a scan scope by referring to this section.

  1. Log in to the management console.
  2. Click in the upper left corner of the management console and select a region or project.
  3. In the navigation tree on the left, click . Choose Security & Compliance > Data Security Center .
  4. In the navigation pane, choose Sensitive Data Identification > Identification Task, as shown in Figure 2.

    Figure 2 Identification tasks

  5. Click Create Task. The Create Task page is displayed.
  6. Select the data type, select the name of the asset to be scanned, and click OK.
  7. In the lower left corner of the page, click the button to add an identification scope. You can add multiple scopes at the same time. For details about the parameter settings, see Table 2.

    Table 2 Parameters for configuring the scan scope

    Asset Type

    Configuration Parameter

    Description

    OBS

    Asset

    Select the bucket to be scanned from the drop-down list. You can select multiple buckets.

    Scan Scope

    • File name prefix: For example, if the prefix is dsc_, all files with the prefix dsc_ are scanned.

      A maximum of one inclusion condition can be added for the file name prefix.

    • File name extension: The file name extension contains the file type following the dot (.). For example, the file name extension dsc_security.txt can be security.txt or .txt. Only the files that meet all the filtering conditions are scanned.

      A maximum of one inclusion condition can be added for the file name extension.

    • Directory name: Specifies the directory to be scanned. All files in the specified directory are scanned.

      A maximum of one inclusion condition can be added for the directory.

    After entering the file name prefix/suffix/directory name, click Add as Inclusion Condition to add it as an inclusion condition.

    For example, if you select the File name prefix, enter the prefix dsc_, and click Add as Inclusion Condition, only the files whose file name prefix is dsc_ are scanned. If you click Add as Exclusion Condition to as the prefix as an exclusion condition, only files whose prefixes are not dsc_ are scanned.

    Scan Depth

    • Global Scan: If this parameter is selected, all data is scanned.
    • Specify Scan Scope: You can also select Specify Scan Scope and enter the Scan Depth. The depth of the root directory starts at 1 and increases incrementally. However, it must not surpass a depth of 10.

    Database/Big Data/MRS

    Asset

    Select an instance name from the drop-down list. You can select multiple instances.

    Scan Scope

    • Table name prefix: A maximum of one inclusion condition can be added for the table name prefix. For example, if you enter dsc_ as the prefix of a table name and click Add as Inclusion Condition only the table data whose prefix is dsc_ is scanned. If you click Add as Exclusion Condition to as the prefix as an exclusion condition, only tables whose prefixes are not dsc_ are scanned.
    • Table name suffix: A maximum of one inclusion condition can be added for the table name suffix. The principle is the same as that of the prefix.

    LTS

    Asset

    Select an instance name from the drop-down list. You can select multiple instances.

    Scan Scope

    • Key prefix: If this parameter is added as an inclusion condition, the log content that contains the key prefix is scanned. If this parameter is added as an exclusion condition, the log content except the key prefix is scanned.
    • Key suffix: The principle is the same as that of the key prefix.
      NOTE:
      • A maximum of one inclusion condition can be added for each of the key prefix and suffix.
      • A maximum of 10 exclusion conditions can be added for key prefixes and suffixes.
    Figure 3 OBS scan scope configuration

Follow-up Procedure

Viewing the Identification Result: After the sensitive data identification task is complete, you can click Identification Result in the Operation column of the row containing the target task, to view the total number of sensitive information items, risk level, and sensitive information classification and grading result of the data assets.