Updated on 2022-12-08 GMT+08:00

Introduction to ClickHouse

ClickHouse

ClickHouse is a column-based database oriented to online analysis and processing. It supports SQL query and provides good query performance. The aggregation analysis and query performance based on large and wide tables is excellent, which is one order of magnitude faster than other analytical databases.

Advantages for ClickHouse:

  • High data compression ratio
  • Multi-core parallel computing
  • Vectorized computing engine
  • Supporting nested data structure
  • Supporting sparse indexes
  • Supporting INSERT and UPDATE

ClickHouse application scenarios:

  • Real-time data warehouse

    The streaming computing engine (such as Flink) is used to write real-time data to ClickHouse. With the excellent query performance of ClickHouse, Multi-dimensional and multi-mode real-time query and analysis requests can be responded within subseconds.

  • Offline query

    Large-scale service data is imported to ClickHouse and constructs a large wide table with hundreds of millions to tens of billions of records and hundreds of dimensions. It supports personalized statistics collection and continuous exploratory query and analysis at any time to assist business decision-making and provides excellent query experience.

Introduction to the ClickHouse Development Interface

ClickHouse is developed using C++ and positioned as a DBMS. It supports HTTP and Native TCP network interface protocols and multiple driver modes such as JDBC and ODBC. You are advised to use clickhouse-jdbc of the community version for application development.