Updated on 2022-12-09 GMT+08:00

Alluxio

Alluxio is data orchestration technology for analytics and AI for the cloud. In the MRS big data ecosystem, Alluxio lies between computing and storage. It provides a data abstraction layer for computing frameworks including Apache Spark, Presto, MapReduce, and Apache Hive, so that upper-layer computing applications can access persistent storage systems including HDFS and OBS through unified client APIs and a global namespace. In this way, computing and storage are separated.

Figure 1 Alluxio architecture

Advantages:

  • Provides in-memory I/O throughput, and makes elastically scale data-driven applications cost effective.
  • Simplified cloud and object storage access
  • Simplified data management and a single point of access to multiple data sources
  • Easy application deployment

For details about Alluxio, visit https://docs.alluxio.io/os/user/stable/en/Overview.html.