Configuring Hive Transform
Scenario
The Hive Transform function allows users to embed external scripts (such as Python and Java) in Hive queries to perform complex data conversion and processing. This enables users to execute custom data processing logic that cannot be implemented using standard Hive functions. The Transform operation transfers data to external scripts for processing and then returns the results to Hive, implementing data cleaning, conversion, and enrichment. This function extends the capabilities of Hive to meet more complex ETL requirements.
The Transform function is disabled on the open-source Hive. MRS Hive supports the configuration of the Transform function. The function is disabled by default, which is the same as that of the open-source community version. You can toggle the switch to enable the Transform function. Exercise caution when enabling the Transform function because it may pose security risks.
Notes and Constraints
Transform can only be disabled if Kerberos authentication is enabled for the cluster (in security mode).
Procedure
- Log in to FusionInsight Manager and choose Cluster > Services > Hive. Click Configurations then All Configurations.
- Search for hive.security.transform.disallow in the search box, change its value to true or false, and click Save.
- If this parameter is set to true, the Transform function is disabled, which is consistent with that in the open-source Hive.
- If this parameter is set to false, the Transform function is enabled, which may pose security risks. Exercise caution when enabling this function.
- Click the Instances tab, select all HiveServer instances, click More > Restart Instance, enter the user password, and click OK to restart all HiveServer instances.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot