Creating a Table Group and Adding Tables to the Group
Group source tables before verifying their consistency.
Precautions
- A maximum of 10,000 tables can be imported at a time.
- The tables to be imported must come from the same metadata source.
- When a table is imported, the system does not verify how many table groups the table is added to or what verification rules are configured for those groups. You are advised to import a table into a maximum of three table groups with different verification rules.
- When you create a table group for verifying the consistency of data migrated from MaxCompute to DLI, you are advised to configure basic verification rules, such as count, sum, and allsum, for the group. After all the verification rules are met and the data volumes at the source and the target are the same, you can create a table group with the content verification rule to verify the content consistency.
Prerequisites
- You have completed all preparations.
- A source connection has been created.
Creating a Table Group
- Sign in to the MgC console. In the navigation pane, under Project, select your big data migration project from the drop-down list.
- In the navigation pane, choose Migrate > Big Data Verification.
- In the Features area, click Table Management.
- Under Table Groups, click Create.
- Configure the parameters listed in Table 1.
Table 1 Parameters for creating a table group Parameter
Description
Table Group
User-defined
Metadata Connection
Select the created source connection.
CAUTION:A table group can only contain tables coming from the same metadata source.
Verification Rule
Select a rule that defines the method for verifying data consistency and the inconsistency tolerance. You can View More to see the details about the verification rules provided by MgC.
Description (Optional)
Enter a description to identify the table group.
- Click Confirm. The table group is created. In the table group list, you can view information about the created table group.
- If the tables to be verified are in the table list, add the tables to the group.
- If the tables are not in the list or only some of the tables are in the list, import tables.
Adding Tables to the Table Group
- On the Table Management page, click the Tables tab.
- Select the data tables to be added to the table group and choose Option > Add Tables to Group above the list. In the displayed dialog box, select the table group to which the data tables are to be added and click Confirm.
Importing Tables to the Table Group
- On the Table Management page, click the Tables tab.
- Choose Table Management > Import above the list.
- Select a metadata connection and the table group that tables are added to.
- Click Download to download the import template to the local PC. Open the import template and fill in the information of the tables to be added.
- A maximum of 10,000 tables can be imported at a time.
- Tables in a table group must come from the same metadata source.
- Cells in the template must not contain formulas and must be in text format. Otherwise, the table parsing will fail.
- If a Delta Lake (with metadata) or Hudi (with metadata) connection is selected for Metadata Connection, the source_path parameter in the template is mandatory.
- If a Delta Lake (without metadata) or Hudi (without metadata) connection is selected for Metadata Connection, the source_path and target_path parameters in the template are mandatory.
- Go back to the console and click Select File to upload the populated template to MgC.
- Click Confirm. Then you can view the tables in the table list.
Modifying a Table
- On the Table Management page, click the Tables tab.
- Locate the table you want to modify and click Modify in the Operation column.
- Update the table settings according to Table 2.
Table 2 Parameters for modifying a table Parameter
Description
Table Group
Change the table groups to which the data table belongs. You are advised to add a data table to no more than three table groups, and avoid adding the table to table groups with the same verification rule.
Business Owner (Optional)
Fill in the owner of the business to which the data table belongs to facilitate problem tracking and responsibility attribution.
Analysis Owner (Optional)
Fill in the person who is responsible for the analysis of the data table to facilitate the communication and handling of data problems.
Source Table Path (Optional)
Enter the storage path of the data table in the source system.
Target Table Path (Optional)
Enter the storage path of the data table in the target system.
Description (Optional)
Enter remarks.
Tags (Optional)
You can set the data table to a core table or an active table. The system will collect statistics on the verification fulfillment rates of core tables and active tables after the verification task is complete. The statistics will be displayed in a line chart in the Monitoring area on the Big Data Verification page.
Exporting Data Tables
You can export information about tables from Delta Lake (with metadata) and Hudi (with metadata) to CSV files. Information about data tables from sources without metadata cannot be exported.
- On the Table Management page, click the Tables tab.
- Choose Export > Export Data Table above the list.
- Select the table groups that contain the data tables to be exported and click Confirm.
- After the export is complete, choose Export > Manage Exports.
- Click Download in the Operation column to download the data table information to the local PC.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot