Accessing Hive Using Python 3

Use Python 3 to connect to Hive to execute data analysis tasks.

You can execute sample analysis tasks provided in the hive-examples/python3-examples/pyCLI_sec.py file.

Import Hive classes.
```
from pyhive import hive
```
Create a JDBC connection.
```
connection = hive.Connection(host='hiveserverIp', port=hiveserverPort, username='hive', database='default', auth='KERBEROS', kerberos_service_name="hive", krbhost='hadoop.hadoop.com')
```
Modify the following parameters based on the site requirements:
- hiveserverIp: Replace it with the IP address of the HiveServer node you want to connect. You can log in to FusionInsight Manager and choose Cluster > Service > Hive and click the Instances tab to view the IP address.
- hiveserverPort: Replace it with the port of the Hive service. To view the port number, log in to FusionInsight Manager, choose Cluster > Service > Hive and click the Configuration tab. Search for hive.server2.thrift.port. The default value is 10000.
Run the statement. The sample code only queries all tables. You can modify the HiveQL statements as you need.
```
cursor = connection.cursor()
cursor.execute('show tables')
```

Obtain and output the result.

for result in cursor.fetchall():
    print(result)

Parent topic: Developing an Application

Thank you very much for your feedback. We will continue working to improve the documentation.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel