Help Center> DataArts Studio> User Guide> DataArts Studio Introduction
Updated on 2024-04-03 GMT+08:00

DataArts Studio Introduction

DataArts Studio is a one-stop data operations platform that provides intelligent data lifecycle management. It supports intelligent construction of industrial knowledge libraries and incorporates data foundations such as big data storage, computing, and analysis engines. With DataArts Studio, your enterprise can easily construct end-to-end intelligent data systems. These systems can help eliminate data silos, unify data, accelerate data monetization, and promote digital transformation.

DataArts Studio Development Process

To use DataArts Studio, perform the following steps:
Table 1 DataArts Studio development process

Process

Description

Task

Helpful Link

Process design

Before using DataArts Studio, you are advised to analyze your business, clarify requirements, and design a process based on the capabilities provided by DataArts Studio.
  1. Analyze requirements. Analyze your business, clarify requirements, and obtain the data governance framework to facilitate the design of a data governance process.
  2. Conduct a survey. Determine the capability boundary of DataArts Studio and analyze the subsequent service load.
  3. Design a process. Design the data governance process based on the business status and the capabilities of DataArts Studio. The process covers all the subsequent data governance operations.
  1. Requirement analysis
  2. Business survey
  3. Process design

The process design is closely related to your business. You can design a process by referring to Data Governance Based on Taxi Trip Data. You can learn more by contacting us.

Preparations

If you access DataArts Studio for the first time, register an account with Huawei, buy a DataArts Studio instance, create a workspace and a user, authorize DataArts Studio permissions to the user, and add workspace members and roles.

Making preparations

Preparations

Management Center

Select cloud services for data storage, query, and analysis as required. Then, create data connections required for the cloud services.

Creating data connections

Managing Data Connections

DataArts Migration

Use DataArts Studio to upload data from data sources to the cloud.

DataArts Migration migrates data between homogeneous and heterogeneous data sources such as self-built and cloud-based file systems, relational databases, data warehouses, NoSQLs, big data cloud services, and object storage.

Integrating data

Supported Data Sources

Creating a CDM Cluster

Creating a Link

Table/File Migration Jobs

DataArts Catalog (metadata collection)

Collect metadata of raw data for data management and monitoring.

Collecting metadata

Metadata Collection

DataArts Architecture

Use DataArts Architecture to create entity-relationship (ER) models and dimensional models to standardize and visualize data development and output data governance methods that can guide development personnel to work with ease.

In DataArts Architecture, you can create dimensions, fact tables, summary tables, and metrics that fit your needs.

Adding reviewers

Adding a Reviewer

Managing Configuration Center

Managing the Configuration Center

Designing processes

Designing Processes

Designing subjects

Designing Subjects

Managing lookup tables

Creating Lookup Tables

Formulating data standards

Creating Data Standards

Creating ER models

ER Modeling

Dimensional modeling

Dimensional Modeling

Business metrics

Business Metrics

Technical metrics

Technical Metrics

Data mart building

Creating Summary Tables

DataArts Factory

Use DataArts Factory to manage diverse big data services.

The one-stop big data development environment enables a variety of operations such as data management, data integration, script development, job development, job scheduling, O&M, and monitoring, facilitating data analysis and processing.

Managing data

Data Management Process

Developing scripts

Script Development Process

Developing jobs

Job Development Process

Performing O&M and scheduling

Overview

DataArts Quality

Use DataArts Quality to monitor business and technical metrics. Screen out unqualified data in a single column or cross columns, rows, and tables from the following perspectives: integrity, validity, timeliness, consistency, accuracy, and uniqueness. Use the automatically generated quality rules to standardize data repeatedly.

Monitoring business metrics

Creating a Metric

Creating a Rule

Creating a Scenario

Monitoring data quality

Creating Rule Templates

Creating Quality Jobs

Creating a Comparison Job

DataArts Catalog (data map and permissions)

Use DataArts Studio DataArts Catalog to manage data permissions. DataArts Catalog provides data maps.

Data map

Overview

Data permissions

Overview

DataArts DataService

Use DataArts DataService to centrally manage API services, create data APIs based on tables, and register APIs with DataArts DataService itself for unified management and publication.

Developing APIs

Preparations

Creating an API

Debugging an API

Publishing an API

Managing APIs

Creating Throttling Policies

Calling APIs

Calling APIs