Help Center/ DataArts Studio/ User Guide/ DataArts Studio development process
Updated on 2024-08-30 GMT+08:00

DataArts Studio development process

DataArts Studio is a one-stop data operations platform that provides intelligent data lifecycle management. It supports intelligent construction of industrial knowledge libraries and incorporates data foundations such as big data storage, computing, and analysis engines. With DataArts Studio, your enterprise can easily construct end-to-end intelligent data systems. These systems can help eliminate data silos, unify data, accelerate data monetization, and promote digital transformation.

DataArts Studio Development Process

To use DataArts Studio, perform the following steps:
Table 1 DataArts Studio development process

Process

Description

Task

Helpful Link

Process design

Before using DataArts Studio, you are advised to analyze your business, clarify requirements, and design a process based on the capabilities provided by DataArts Studio.
  1. Analyze requirements. Analyze your business, clarify requirements, and obtain the data governance framework to facilitate the design of a data governance process.
  2. Conduct a survey. Determine the capability boundary of DataArts Studio and analyze the subsequent service load.
  3. Design a process. Design the data governance process based on the business status and the capabilities of DataArts Studio. The process covers all the subsequent data governance operations.
  1. Requirement analysis
  2. Business survey
  3. Process design

The process design is closely related to your business. You can design a process by referring to Data Governance Based on Taxi Trip Data. You can learn more by contacting us.

Obtaining and configuring a DataArts Studio instance

If you are new to DataArts Studio, register an account with Huawei, buy a DataArts Studio instance, and create a workspace.

Obtaining and configuring a DataArts Studio instance

Buying and Configuring a DataArts Studio Instance

Creating an IAM user and assigning DataArts Studio permissions

If you want to authorize other IAM users to use DataArts Studio, you need to create users and assign DataArts Studio permissions to them.

Creating an IAM user and assigning DataArts Studio permissions

Authorizing Users to Use DataArts Studio

Management Center

Select cloud services for data storage, query, and analysis as required. Then, create data connections required for the cloud services.

Creating a data connection

Creating a DataArts Studio Data Connection

DataArts Migration

Use DataArts Studio to upload data from data sources to the cloud.

DataArts Migration migrates data between homogeneous and heterogeneous data sources such as self-built and cloud-based file systems, relational databases, data warehouses, NoSQLs, big data cloud services, and object storage.

Integrating data

Supported Data Sources

Creating a CDM Cluster

Creating a Link Between CDM and a Data Source

Table/File Migration Jobs

DataArts Catalog (metadata collection)

Collect metadata of raw data for data management and monitoring.

Collecting metadata

Collecting Metadata of Data Sources

DataArts Architecture

Use DataArts Architecture to create entity-relationship (ER) models and dimensional models to standardize and visualize data development and output data governance methods that can guide development personnel to work with ease.

In DataArts Architecture, you can create dimensions, fact tables, summary tables, and metrics that fit your needs.

Adding reviewers

Adding a Reviewer

Managing Configuration Center

Managing the Configuration Center

Designing processes

Designing Processes

Designing subjects

Designing Subjects

Managing lookup tables

Creating a Lookup Table

Formulating data standards

Creating Data Standards

Creating ER models

ER Modeling

Dimensional modeling

Dimensional Modeling

Business metrics

Business Metrics

Technical metrics

Technical Metrics

Data mart building

Data Mart

DataArts Factory

Use DataArts Factory to manage diverse big data services.

The one-stop big data development environment enables a variety of operations such as data management, data integration, script development, job development, job scheduling, O&M, and monitoring, facilitating data analysis and processing.

Managing data

Data Management Process

Developing scripts

Script Development Process

Developing jobs

Job Development Process

Performing O&M and scheduling

Overview

DataArts Quality

Use DataArts Quality to monitor business and technical metrics. Screen out unqualified data in a single column or cross columns, rows, and tables from the following perspectives: integrity, validity, timeliness, consistency, accuracy, and uniqueness. Use the automatically generated quality rules to standardize data repeatedly.

Monitoring business metrics

Creating a Metric

Creating a Rule

Creating a Scenario

Monitoring data quality

Creating a Data Quality Rule

Creating a Data Quality Job

Creating a Data Comparison Job

DataArts Catalog (data map and permissions)

Use DataArts Studio DataArts Catalog to manage data permissions. DataArts Catalog provides data maps.

Data map

Viewing Data Assets in a Workspace

Data permissions

Overview

DataArts DataService

Use DataArts DataService to centrally manage API services, create data APIs based on tables, and register APIs with DataArts DataService itself for unified management and publication.

Developing APIs

Buying and Managing an Exclusive Cluster

Creating a Reviewer in DataArts DataService

Creating an API

Debugging an API

Publishing an API

Managing APIs

Orchestrating APIs

Configuring a Throttling Policy for API Calling

Authorizing API Calling

Calling APIs

Applying for API Authorization

Calling APIs Using Different Methods