Updated on 2024-10-09 GMT+08:00

Configuring Local Disk Cache for JobHistory

Scenarios

JobHistory can use local disks to cache historical data of Spark applications to prevent large volumes of application data from being loaded to the JobHistory memory and reduce memory usage. In addition, the cached data can be reused to accelerate access to the same application.

Parameters

Log in to FusionInsight Manager, choose Cluster > Services > Spark2x > Configurations, click All Configurations, and search for the following parameters.

Parameter

Description

Default Value

spark.history.store.path

Local directory for JobHistory to cache historical data. If this parameter is configured, JobHistory caches historical application data in local disks instead of the memory.

${BIGDATA_HOME}/tmp/spark2x_JobHistory

spark.history.store.maxDiskUsage

Maximum available space for JobHistory to caching data in local disks

10g