Updated on 2024-10-09 GMT+08:00

Configuring SparkSQL for Hudi Schema Evolution

  • Schema evolution cannot be disabled once being enabled.
  • This topic is available for MRS 3.2.0 and earlier versions only.
  • To use spark-beeline, log in to FusionInsight Manager, choose Cluster > Services > Spark2x, and click the Configurations tab then the All Configurations sub-tab.

    Search for spark.sql.extensions in the search box and change its value of JDBCServer to org.apache.spark.sql.hive.FISparkSessionExtension,org.apache.spark.sql.hudi.HoodieSparkSessionExtension,org.apache.spark.sql.hive.CarbonInternalExtensions.

  • For SQL operations, run the following command before running any SQL statements:
    set hoodie.schema.evolution.enable=true
  • For API calls, specify the following parameter in DataFrame options:
    hoodie.schema.evolution.enable -> true