What Should I Do If the MapReduce Engine Cannot Query the Data Written by the Union Statement Running on Tez?

Question

Hive uses the Tez engine to execute union-related statements to write data. After Hive is switched to the MapReduce engine for query, no data is found.

Answer

When Hive uses the Tez engine to execute the union-related statement, the generated output file is stored in the HIVE_UNION_SUBDIR directory. After Hive is switched back to the MapReduce engine, files in the directory are not read by default. Therefore, data in the HIVE_UNION_SUBDIR directory is not read.

In this case, you can set mapreduce.input.fileinputformat.input.dir.recursive to true to enable union optimization and determine whether to read data in the directory.

Parent topic: Common Issues About Hive

Previous topic: Does the Location of a Hive Table Support Cross-OBS and Cross-HDFS Paths?

Next topic: Does Hive Support Concurrent Data Writing to the Same Table or Partition?

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.

The system is busy. Please try again later.