Updated on 2024-04-29 GMT+08:00

Viewing Data Lineages

You need to create a metadata collection task in DataArts Catalog first. When a data development job meets the automatic lineage parsing requirements or lineages have been manually configured, and when the job is successfully scheduled, you can view the data lineages in DataArts Catalog.

Constraints

  • Data lineage updates depend on job scheduling. Data lineages are generated based on the latest job instances.
  • To delete data lineages, you need to delete jobs or job metadata. Stopping jobs alone does not delete data lineages.

Creating and Running a Metadata Collection Task

Create and run a metadata collection task by referring to Task Management. When creating the task, select the tables whose lineages you want to view.

If a task for collecting the metadata of these tables has been created and run, skip this part.

Starting Job Scheduling

After metadata is collected, the system generates data lineages based on the latest job instances.

  1. On the DataArts Studio console, locate a workspace and click DataArts Factory.
  2. In the navigation pane, click and double-click the job for which lineages have been configured to open it.
  3. Click Execute. The system starts parsing lineages of the job.

    If you click Test, the system will not parse lineages of the job.

    Figure 1 Starting job scheduling

  4. After the job is successfully executed, wait for about 1 minute. The data lineage is generated.

Viewing Data Lineages

  1. On the DataArts Studio console, locate an instance and click Access. On the displayed page, locate a workspace and click DataArts Catalog.

    Figure 2 DataArts Catalog

  2. In the navigation pane, choose Data Catalog. In the right pane, click the Technical Assets tab. On this page, you can query jobs, nodes, and tables.

    In the Types area, click Search All, select Job, Node, and Table, and click OK.

    Jobs do not belong to any data connection. If you select a data connection in the search filters, no result will be returned.

    Figure 3 Selecting types

  3. In the search result, click the name of an asset ending with _job to view its details. On the job details page, click the Job tab and then Edit to go to the job editing page.

    Figure 4 Viewing job details

  4. In the data asset search result, click the name of an asset ending with _node to view its details. On the node details page, you can view the node lineage information.

    • Click the + or - icon beside the node to expand its upstream and downstream links.
    • Click a node to view the its details.
    • Click the Job tab and then Edit to go to the job editing page.
    Figure 5 Viewing lineages of a node

  5. In the data asset search result, click the name of an asset whose icon is a table to view its details. On the table details page, you can view lineages of the table.

    • Click the + or - icon beside the table to expand its upstream and downstream links.
    • Click a table to view the its details.
    Figure 6 Viewing lineages of a table