Updated on 2024-04-29 GMT+08:00

Data Catalogs

You can search for and filter assets, and view asset details on the Data Catalog page.

  • Logical assets come from the logical entities and data tables defined and published in DataArts Architecture.
  • Data connections in technical assets come from the data connections in Management Center, and databases, tables, and columns come from metadata collection tasks in DataArts Catalog.
  • Metric assets come from the business metrics defined and published in DataArts Architecture.

Constraints

  • Logical assets and metric assets come from DataArts Architecture and are updated if data is synchronized from DataArts Architecture. However, they cannot be deleted directly in DataArts Architecture. Instead, you must locate and delete them in DataArts Catalog.
  • Data connections in technical assets come from Management Center and are updated if data is synchronized from Management Center. However, they cannot be deleted directly in Management Center. Instead, you must locate and delete them in DataArts Catalog.
  • Information such as databases, tables, and columns in technical assets come from metadata collection tasks. Whether to update and automatically delete such information depends on the parameter settings of metadata collection tasks. For details, see Task Management.
  • Data lineages in technical assets are updated by job scheduling. Data lineages are generated based on the latest job instances. To delete data lineages, you need to delete jobs or job metadata. Stopping jobs alone does not delete data lineages.

Searching for a Data Asset

An asset can be searched by its name, description, or attributes. Fuzzy search is supported.

  1. On the DataArts Studio console, locate an instance and click Access. On the displayed page, locate a workspace and click DataArts Catalog.
    Figure 1 DataArts Catalog

  1. In the left navigation pane, choose Data Map > Data Catalog. Click the Logical Assets, Technical Assets, and Metric Assets tabs as needed.
  2. In the search box, enter a keyword to search for your desired assets.
    • By their names and description
    • By their attributes, which are displayed on the asset details page
      • You can save the search criteria you set.
      • You can import the search criteria you need.

Filtering an Asset

Technical assets can be filtered by the following criteria:

  • Data connection: the data connection that your target asset uses.
  • Type: the type of your target asset.
  • Classification: the category that your asset is classified into.
  • Tag: the tag that your asset includes.
  • Security level: the security level of your target asset.

The following uses type as an example to demonstrate how to filter an asset.

  1. Select Table under Types. Table assets are displayed.
  2. In the Types area, Table, Column, Database, Bucket, and ColumnFamily are supported by default. If you select All, the system displays assets of all types.

Viewing the Details of an Asset

This section describes how to view data table details on the Technical Assets page.

  1. In the list of technical assets, select a table and click its name to access its details page.
  2. On the Details tab page, view the basic attributes of the technical metadata; edit the description; add or delete classifications, tags, and security levels for the table, table columns, or OBS objects.

    Figure 2 Details tab page

  3. On the Permission tab page, you can apply for data table permissions or grant permissions to other users.

    Figure 3 Permissions tab page

  4. On the Column Attributes tab page, view the column attributes of the table; add or delete classifications, tags, and security levels for the data columns; edit the description.

    Figure 4 Managing column attributes

  5. On the Lineage tab page, view table lineages and impacts. For details on how to set a data lineage, see Viewing Data Lineages Through the Data Map. If a node that supports automatic lineage is configured for a data development job or the lineage of a node is manually configured, the data lineage can be automatically parsed during job execution and displayed in the data catalog.
  6. On the Profile tab page, view the profile of the data table. (Currently, this function is available only for GaussDB(DWS), DLI, and OBS data tables. The profile sampling mode is subject to the metadata collection task configuration.)

    Click Update to update the table profile.

  7. On the Data Preview tab page, preview the business data in the current table. The data can be masked in real time based on the column classification information and the configuration in Masking Policies.

    • Data assets that use DWS, DLI, MRS Hive, and MySQL data connections can be previewed.
    • Column classification information can be automatically set when a collection task is created or manually added in the data classification menu. Automatic classification setting is available only for DWS and DLI data collections.

  8. On the Change History tab page, view the change history of the table.