Help Center> Data Warehouse Service (DWS)> User Guide> FAQs> General Problems> What Are the Differences Between GaussDB(DWS) and Hive in Functions?
Updated on 2023-03-08 GMT+08:00

What Are the Differences Between GaussDB(DWS) and Hive in Functions?

GaussDB(DWS) and Hive have different functions in the following aspects:

  1. Hive is a data warehouse based on Hadoop MapReduce. GaussDB(DWS) is a data warehouse based on Postgres MPP.
  2. Hive data is stored on HDFS. GaussDB(DWS) data can be stored locally or on OBS in foreign table form.
  3. Hive does not support indexes. GaussDB(DWS) supports indexes, so querying is faster.
  4. Hive does not support stored procedures. GaussDB(DWS) does, so it has more extensive application scenarios.
  5. Hive supports fewer SQL statements than GaussDB(DWS), including functions, customized functions, and stored procedures.
  6. Hive does not support transactions. GaussDB(DWS) supports complete transactions.
  7. Both Hive and GaussDB(DWS) support backups, so the reliability is the same.
  8. GaussDB(DWS) delivers much better performance than Hive.

Based on their respective functions, Hive is useful for offline analysis while GaussDB(DWS) is useful for both online analysis and ad-hoc query.