Help Center/ MapReduce Service/ User Guide (Ankara Region)/ Overview/ Components/ MemArtsCC/ Relationships Between MemArtsCC and Other Components
Updated on 2024-11-29 GMT+08:00

Relationships Between MemArtsCC and Other Components

OBS

OBS provides a new InputStream: OBSMemArtsCCInputStream. This InputStream reads data from the MemArtsCC cluster deployed on the compute side to reduce OBS server pressure and improve data read performance.

MemArtsCC persistently stores data to the storage (SSD) on the compute side. OBS interconnects with MemArtsCC to:

  1. Improve the data access performance of the architecture where storage and compute are decoupled.

    The local storage of MemArtsCC avoids the cross-network access of hotspot data. This accelerates the data reads of OBS upper-layer applications.

  1. Reduce the pressure on the OBS server.

    MemArtsCC stores hotspot data in the compute cluster to reduce the bandwidth pressure of the OBS server.

Spark

Spark reads data from OBS. OBS reads data from MemArtsCC. If data is hit in the local cache, the data is read directly. Otherwise, the data is prefetched.

Hive

Hive reads data from OBS. OBS reads data from MemArtsCC. If data is hit in the local cache, the data is read directly. Otherwise, the data is prefetched.

HetuEngine

HetuEngine reads data from OBS. OBS reads data from MemArtsCC. If data is hit in the local cache, the data is read directly. Otherwise, the data is prefetched.