Spark Read/Write Hudi Development Specifications
Specifications of the parameters in various write modes for the Spark write Hudi
Type |
Description |
Enable parameter |
Scenario Selection |
Features |
---|---|---|---|---|
upsert |
update + insert Hudi default write type, which has the update capability. |
This parameter is set by default and does not need to be set to Enabled.
|
The default value is selected. |
Pros:
Disadvantages:
|
append |
Directly write data without updates |
|
High throughput and no data update scenario. |
Pros:
Disadvantages:
|
delete |
Delete operation |
No parameter is required. You can directly use the delete syntax. delete from tableName where primaryKey='id1'; |
The SQL statement deletes data. |
Same as the upsert type. |
Insert overwrite |
Override partition |
No parameter is required. Use the insert overwrite syntax directly. insert overwrite table tableName partition (dt = '2021-01-04') select * from srcTable; |
Partition level again. |
Overwrite the partition. |
Insert overwrite table |
Override the entire table |
No parameter is required. Use the insert overwrite syntax directly. insert overwrite table tableName select * from srcTable; |
Rewrite it all. |
Overwrite the entire table. |
Bulk_insert |
Batch Import |
|
You are advised to use this tool during table initialization and migration. |
The mode is the same as the append mode. |
Specifications for Spark to read Hudi parameters in incremental mode
- SparkSQL table creation parameter specifications
- Specifications for Spark to read Hudi parameters in incremental mode
- Specifications for setting the compaction parameter in the Spark asynchronous task execution table
- Spark Table Data Maintenance Specifications
- Suggestions for Spark Concurrently Write Hudi Data
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot