Data Consistency
- The preCombine field must be configured for Hudi tables.
You can specify the field via preCombineField when creating a Spark table.
- The partition formats of the Hudi tables must be unified.
If the same data is written in different partitioning styles, two different partitions are generated.
hoodie.datasource.write.hive_style_partitioning = false;

hoodie.datasource.write.hive_style_partitioning = true;

- The KeyGenerators of Hudi tables must be the same.
You can check whether the KeyGenerators are the same based on the value of _hoodie_record_key in each data record.
org.apache.hudi.keygen.ComplexKeyGenerator

org.apache.hudi.keygen.SimpleKeyGenerator

- If multiple engines are used to read data from and write data to Hudi tables, you are advised to use lowercase letters for Hudi tables.
- Execute the table service based on the specifications in Selecting a Proper Table Service Execution Mode. Do not change the specifications without permission.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot
