To Hive
If the destination link of a job is the Link to Hive, configure the destination job parameters based on Table 1.
Parameter |
Description |
Example Value |
---|---|---|
Database Name |
Database name. Click the icon next to the text box. The dialog box for selecting the database is displayed. |
default |
Auto Table Creation |
This parameter is displayed only when the source is a relational database. The options are as follows:
|
Non-auto creation |
Table Name |
Destination table name. Click the icon next to the text box. The dialog box for selecting the table is displayed. This parameter can be configured as a macro variable of date and time and a path name can contain multiple macro variables. When the macro variable of date and time works with a scheduled job, the incremental data can be synchronized periodically. For details, see Incremental Synchronization Using the Macro Variables of Date and Time.
NOTE:
If you have configured a macro variable of date and time and schedule a CDM job through DataArts Studio DataArts Factory, the system replaces the macro variable of date and time with (Planned start time of the data development job – Offset) rather than (Actual start time of the CDM job – Offset). |
TBL_X |
Clear Data Before Import |
Whether the data in the destination table is cleared before data import. The options are as follows:
|
Yes |
Partition to Clear |
This parameter is available when Clear Data Before Import is set to Yes. When you enter the information about the partitions to be cleared, the data in the partitions will be cleared. |
Single partition: year=2020,location=sun Multiple partitions: ['year=2020,location=sun', 'year=2021,location=earth'] |
- When Hive serves as the destination end, a table whose storage format is ORC is automatically created.
- When Hive serves as the migration destination, if the storage format is TEXTFILE, delimiters must be explicitly specified in the statement for creating Hive tables. The following gives an example:
CREATE TABLE csv_tbl( smallint_value smallint, tinyint_value tinyint, int_value int, bigint_value bigint, float_value float, double_value double, decimal_value decimal(9, 7), timestmamp_value timestamp, date_value date, varchar_value varchar(100), string_value string, char_value char(20), boolean_value boolean, binary_value binary, varchar_null varchar(100), string_null string, char_null char(20), int_null int ) ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.OpenCSVSerde' WITH SERDEPROPERTIES ( "separatorChar" = "\t", "quoteChar" = "'", "escapeChar" = "\\" ) STORED AS TEXTFILE;
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.