Exporting Search Results

Function

This statement is used to directly write query results to a specified directory. The query results can be stored in CSV, Parquet, ORC, JSON, or Avro format.

Syntax

    
         INSERT OVERWRITE DIRECTORY path
  USING file_format
  [OPTIONS(key1=value1)]
  select_statement;

Keywords

USING: Specifies the storage format.
OPTIONS: Specifies the list of attributes to be exported. This parameter is optional.

Parameter

**Table 1** INSERT OVERWRITE DIRECTORY parameters
Parameter	Description
path	The OBS path to which the query result is to be written.
file_format	Format of the file to be written. The value can be CSV, Parquet, ORC, JSON, or Avro.

If file_format is set to csv, see Table 3 for the OPTIONS parameters.

Precautions

You can configure the spark.sql.shuffle.partitions parameter to set the number of files to be inserted into the OBS bucket in the table. In addition, to avoid data skew, you can add distribute by rand() to the end of the INSERT statement to increase the number of concurrent jobs. The following is an example:
```
insert into table table_target select * from table_source distribute by cast(rand() * N as int);
```
When the configuration item is OPTIONS('DELIMITER'=','), you can specify a separator. The default value is ,.
For CSV data, the following delimiters are supported:
- Tab character, for example, 'DELIMITER'='\t'.
- You can specify a delimiter using Unicode encoding, for example: 'DELIMITER'='\u0001'.
- Single quotation mark ('). A single quotation mark must be enclosed in double quotation marks (" "). For example, 'DELIMITER'= "'".

Example

    
         INSERT OVERWRITE DIRECTORY 'obs://bucket/dir'
  USING csv
  OPTIONS(key1=value1)
  select * from db1.tb1;

Parent topic: Spark SQL Syntax Reference

Previous topic: Clearing Data

Next topic: Backing Up and Restoring Data of Multiple Versions

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel

For any further questions, feel free to contact us through the chatbot.

Chatbot