Processing Data
DRS processes synchronized objects and allows you to add rules for selected objects. The processing rules supported by each data flow type are different. Currently, only some data flow types support data processing. For details, see Table 1.
Synchronization Direction |
Data Flow |
Data Filtering |
Additional Column |
Column Processing |
---|---|---|---|---|
To the cloud |
MySQL->MySQL |
Supported |
Supported |
Supported |
To the cloud |
MySQL -> GaussDB distributed |
Supported |
Supported |
Supported |
To the cloud |
MySQL -> GaussDB primary/standby |
Supported |
Supported |
Supported |
To the cloud |
MySQL->GaussDB(DWS) |
Supported |
Supported |
Not supported |
To the cloud |
MySQL->GaussDB(for MySQL) |
Supported |
Supported |
Supported |
To the cloud |
MySQL->MariaDB |
Supported |
Supported |
Supported |
To the cloud |
DDM->MySQL |
Not supported |
Not supported |
Supported |
To the cloud |
DDM->GaussDB(DWS) |
Not supported |
Supported |
Not supported |
To the cloud |
Oracle->GaussDB(DWS) |
Supported |
Supported |
Not supported |
To the cloud |
Oracle->MySQL |
Supported |
Not supported |
Not supported |
To the cloud |
Oracle->GaussDB(for MySQL) |
Supported |
Not supported |
Not supported |
To the cloud |
Oracle -> GaussDB primary/standby |
Supported |
Not supported |
Supported |
To the cloud |
Oracle -> GaussDB distributed |
Supported |
Not supported |
Supported |
To the cloud |
DB2 for LUW -> GaussDB primary/standby |
Supported |
Not supported |
Not supported |
To the cloud |
DB2 for LUW -> GaussDB distributed |
Supported |
Not supported |
Not supported |
To the cloud |
MariaDB->MariaDB |
Supported |
Not supported |
Not supported |
To the cloud |
MariaDB->MySQL |
Supported |
Supported |
Supported |
To the cloud |
MariaDB->GaussDB(for MySQL) |
Supported |
Supported |
Supported |
From the cloud |
MySQL->MySQL |
Supported |
Supported |
Supported |
From the cloud |
MySQL->Kafka |
Not supported |
Not supported |
Supported |
From the cloud |
MySQL->CSS/ES |
Supported |
Not supported |
Supported |
From the cloud |
MySQL->Oracle |
Supported |
Not supported |
Not supported |
From the cloud |
MySQL->MariaDB |
Supported |
Supported |
Supported |
From the cloud |
DDM->MySQL |
Not supported |
Not supported |
Supported |
From the cloud |
GaussDB primary/standby -> MySQL |
Supported |
Not supported |
Not supported |
From the cloud |
GaussDB primary/standby -> Oracle |
Supported |
Not supported |
Supported |
From the cloud |
GaussDB primary/standby -> Kafka |
Not supported |
Not supported |
Supported |
From the cloud |
GaussDB primary/standby -> GaussDB(DWS) |
Supported |
Not supported |
Not supported |
From the cloud |
GaussDB primary/standby -> GaussDB distributed |
Supported |
Not supported |
Supported |
From the cloud |
GaussDB primary/standby -> GaussDB primary/standby |
Supported |
Not supported |
Supported |
From the cloud |
GaussDB distributed -> MySQL |
Supported |
Not supported |
Not supported |
From the cloud |
GaussDB distributed -> Oracle |
Supported |
Not supported |
Supported |
From the cloud |
GaussDB distributed -> GaussDB(DWS) |
Supported |
Not supported |
Not supported |
From the cloud |
GaussDB distributed -> Kafka |
Not supported |
Not supported |
Supported |
From the cloud |
GaussDB distributed -> GaussDB distributed |
Supported |
Not supported |
Supported |
From the cloud |
GaussDB distributed->GaussDB primary/standby |
Supported |
Not supported |
Supported |
From the cloud |
GaussDB(for MySQL)->MySQL |
Supported |
Supported |
Not supported |
From the cloud |
GaussDB(for MySQL)->GaussDB(DWS) |
Not supported |
Supported |
Not supported |
From the cloud |
GaussDB(for MySQL)->CSS/ES |
Supported |
Not supported |
Supported |
From the cloud |
MariaDB->MariaDB |
Supported |
Not supported |
Not supported |
Self-built -> Self-built |
MySQL->Kafka |
Not supported |
Not supported |
Supported |
Self-built -> Self-built |
MySQL->CSS/ES |
Supported |
Not supported |
Supported |
Self-built -> Self-built |
MySQL -> GaussDB Distributed |
Supported |
Supported |
Supported |
Self-built -> Self-built |
MySQL -> GaussDB Primary/Standby |
Supported |
Supported |
Supported |
Self-built -> Self-built |
Oracle -> GaussDB primary/standby |
Supported |
Not supported |
Supported |
Self-built -> Self-built |
Oracle -> GaussDB distributed |
Supported |
Not supported |
Supported |
Self-built -> Self-built |
GaussDB primary/standby -> Kafka |
Not supported |
Not supported |
Supported |
Self-built -> Self-built |
GaussDB distributed -> Kafka |
Not supported |
Not supported |
Supported |
Self-built -> Self-built |
DB2 for LUW -> GaussDB primary/standby |
Supported |
Not supported |
Not supported |
Self-built -> Self-built |
DB2 for LUW -> GaussDB distributed |
Supported |
Not supported |
Not supported |
Adding Additional Columns
- On the Process Data page of the real-time synchronization task, click Additional Columns, locate the table to be processed, and click Add in the Operation column.
Figure 1 Additional columns
- In the displayed Add dialog box, specify the column name, operation type, and field type. Click OK.
Figure 2 Operation types
- In many-to-one mapping scenarios, additional columns for data processing are required to avoid data conflicts.
- The following operation types are supported:
- Default: Use the default value to fill in the new column.
- Use the create_time column and update_time column as an example to fill the new column with the data creation time and data update time.
- Expression: Use the concat(_current_database, '@',_current_table) expression to fill in the new column. You cannot manually enter an expression.
- If you fill in the new column in serverName@database@table format, you need to enter a server name and then the database name and table name will be automatically filled in.
- Value: Select a value, for example, synchronization time.
- You can apply the additional column information of the first editable table to all editable tables in batches.
- During MySQL to GaussDB(for MySQL) synchronization, if the number of columns in a single table exceeds 500, the number of additional columns added to the table may exceed the upper limit. As a result, the task fails.
- If serverName@database@table is used to add an additional column, this additional column will be used on the destination database as an implicit filtering condition for row comparison and value comparison by default.
- For a table with additional columns, the DDL operations of dropping a table and then creating a table are not supported in the incremental synchronization phase.
- During task editing in the many-to-one mapping scenario, if the new table has been synchronized, many-to-one mapping has been performed, and additional columns have been set, you need to reset the additional columns for the table. Otherwise, the additional column settings in the last synchronization are retained by default.
- Click Next.
Filtering Data
- The filter criteria are met after the update. You need to continue the synchronization and perform the same update operation on the destination database. If no data is matched, the operation will be ignored, causing data inconsistency.
- The filter criteria are not met after the update. You need to continue the synchronization and perform the same update operation on the destination database.
- On the Processing Data page, set Processing Type to Data filtering.
Figure 3 Filtering data
- In the Object area, select the table to be processed.
- In the Filtering Criteria area, enter the filter criteria (only the part after WHERE in the SQL statement, for example, id=1), and click Verify.
- Each table has only one verification rule.
- Up to 512 tables can be filtered at a time. If there are more than 512 tables, perform rule verifications in batches.
- The filter expression cannot use the package, function, variable, or constant of a specific DB engine. It must comply with the general SQL standard. Enter the part following WHERE in the SQL statement (excluding WHERE and semicolons), for example, sid > 3 and sname like "G %". A maximum of 512 characters are allowed.
- In SQL statements for setting filter criteria, keywords must be enclosed in backquotes, and the value of datetime (including date and time) and character string type must be enclosed in single quotation marks, for example, `update` > '2022-07-13 00:00:00' and age >10, `update` ='abc'.
- If the TIMESTAMP type is used as a filtering condition, the time of the character type must be set to the time value in the UTC time zone. For example, in MySQL, the TIMESTAMP data is stored based on the UTC time zone. You need to use the time value in the UTC time zone for comparison.
- Implicit conversion rules are not supported. Enter filtering criteria of a valid data type. For example, if column c of an Oracle database uses characters of the varchar2 type, the filtering criteria must be set to c > '10' instead of c > 10.
- Filter criteria cannot be configured for large objects, such as CLOB, BLOB, and BYTEA.
- Filtering rules cannot be set for objects whose database names and table names contain newline characters.
- The syntax of row-level locks, such as for update, cannot be used as filtering criteria.
- Function operations cannot be performed on column names. If function operations are performed, data may be inconsistent.
- You are not advised to set filter criteria for fields of approximate numeric types, such as FLOAT, DECIMAL, and DOUBLE.
- Do not use fields containing special characters as a filter condition.
- You are advised not to perform DDL operations on columns involved in filter criteria. Otherwise, task exceptions may occur.
- You are not advised to use non-idempotent expressions or functions as data processing conditions, such as SYSTIMESTAMP and SYSDATE, because the returned result may be different each time the function is called.
- The filtering rules for a synchronized table cannot be modified.
- During data filtering for real-time synchronization with Oracle serving as the source database, the fixed-length character types NCHAR and CHAR must be matched using complete fixed-length characters.
- After the verification is successful, click Generate Processing Rule. The rule is displayed.
- Click Next.
Advanced Settings for Data Filtering
If you need to query an association table, you can use the advanced settings of data processing.
- On the Process Data page of the real-time synchronization task, set Processing Type to Data filtering.
- In the Object area, select the table to be processed.
- In the Filtering Criteria area, specify the filtering criteria, for example, id1 in (select id from db1.tab1 where id >=3 and id <10), and click Verify.
- Each table has only one verification rule.
- Up to 512 tables can be filtered at a time. If there are more than 512 tables, perform rule verifications in batches.
- The filter expression cannot use the package, function, variable, or constant of a specific DB engine. It must comply with the general SQL standard. Enter the part following WHERE in the SQL statement (excluding WHERE and semicolons), for example, sid > 3 and sname like "G %". A maximum of 512 characters are allowed.
- Implicit conversion rules are not supported. Enter filtering criteria of a valid data type. For example, if column c of an Oracle database uses characters of the varchar2 type, the filtering criteria must be set to c > '10' instead of c > 10.
- Filter criteria cannot be configured for large objects, such as CLOB, BLOB, and BYTEA.
- Filtering rules cannot be set for objects whose database names and table names contain newline characters.
- The syntax of row-level locks, such as for update, cannot be used as filtering criteria.
- Data changes in a referenced table are not supported, which may cause data inconsistency during synchronization.
- You are not advised to set filter criteria for fields of approximate numeric types, such as FLOAT, DECIMAL, and DOUBLE.
- Do not use fields containing special characters as a filter condition.
- You are not advised to use non-idempotent expressions or functions as data processing conditions, such as SYSTIMESTAMP and SYSDATE, because the returned result may be different each time the function is called.
- During data filtering for real-time synchronization with Oracle serving as the source database, the fixed-length character types NCHAR and CHAR must be matched using complete fixed-length characters.
- After the verification is successful, click Generate Processing Rule. The rule is displayed.
- In the Advanced Settings area, specify the configuration condition and rule for the association table to help you filter data.
Figure 4 Advanced settings
- In the Configuration Condition area, enter the association table information entered in 3.
Database Name, Table Name, Column Name, Primary Key, Index, and Filter Criteria are mandatory. If the table does not have an index, enter its primary key.
Filter Criteria is the filter condition of the association table information entered in 3.
- Then, click Verify.
- After the verification is successful, click Generate Configuration Rule. The rule is displayed in the Configuration Rule area.
To filter data in multiple association tables, repeat 5.
Configuration rules can be deleted.
- In the Configuration Condition area, enter the association table information entered in 3.
- Click Next.
Processing Columns
- On the Process Data page of the real-time synchronization task, select Processing Columns.
- Select a column processing mode.
Only MySQL-to-GaussDB and Oracle-to-GaussDB synchronization tasks support column processing by importing files. For other tasks, column processing is performed by selecting objects by default.
- Select Objects
- In the Object area, select the objects to be processed.
Figure 5 Processing columns
- Click Edit to the right of the selected object.
- In the Edit Column dialog box, select the columns to be mapped and enter new column names.
Figure 6 Editing a column
- You can query or filter columns or create new column names.
- After the column name is edited, the column name of the destination database is changed to the new name.
- The new column name cannot be the same as the original column name or an existing column name.
- Columns whose database names or table names contain newline characters cannot be mapped.
- The column name in the synchronized table cannot be modified.
- Only selected columns are synchronized. Newly-added columns are not included in column processing.
- The partitioned table does not support column mapping or column filtering.
- In the incremental phase, DDL operations cannot be performed on filtered, mapped, additional columns in a table.
- For a table on which column filtering, column mapping, and additional column adding are performed, the DDL operations of dropping a table and then creating a table are not supported in the incremental synchronization phase.
- If the source database is MySQL or GaussDB(for MySQL), column filtering and mapping are not supported for columns that have function-based indexes.
- Click OK.
- In the Object area, select the objects to be processed.
- Import object file
- On the Process Data page of the real-time synchronization task, choose Processing Columns > Import object file.
- Click Download Template.
Figure 7 Processing columns
- In the downloaded Excel file, enter information about the objects to be imported.
- Click Select File. In the displayed dialog box, select the edited template.
- Click Upload.
- Select Objects
- Click Next.
Viewing Data Filtering Results
- On the Data Synchronization Management page, click the task to be processed.
- Click the Process Data tab to view data filtering records. Click in the upper right corner to refresh the record list.
View Column Processing
- On the Data Synchronization Management page, click the target synchronization task name in the Task Name/ID column.
- In the navigation pane on the left, choose Synchronization Mapping. In the upper right corner, and select Columns to view column mapping records. Click in the upper right corner to refresh the record list.
Figure 8 Viewing column mappings
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot