Typical Scenario Description
Scenarios
A user develops a Hive data analysis application for managing employee information described in Table 1 and Table 2.
Procedure
- Prepare data.
- Create three tables: employee information table employees_info, contact table employees_contact, and extended employee information table employees_info_extended.
- Fields in the employees_info table include the employee ID, name, salary currency, salary, tax category, work place, and hiring date. R indicates RMB, and D indicates USD.
- Fields in the employees_contact table include the employee ID, mobile phone number, and e-mail address.
- Fields in the employees_info_extended table include the employee ID, name, mobile phone number, e-mail address, salary currency, salary, tax category, and work place. The partition field is the hiring date.
For table creation codes, see Creating a Table.
- Load employee information to employees_info.
For data loading codes, see Loading Data.
Table 1 describes employee information.
Table 1 Employee information ID
Name
Salary Currency
Salary
Tax Category
Work Place
Hiring Date
1
Wang
R
8000.01
personal income tax&0.05
China:Shenzhen
2014
3
Tom
D
12000.02
personal income tax&0.09
America:NewYork
2014
4
Jack
D
24000.03
personal income tax&0.09
America:Manhattan
2014
6
Linda
D
36000.04
personal income tax&0.09
America:NewYork
2014
8
Zhang
R
9000.05
personal income tax&0.05
China:Shanghai
2014
- Load employee contact information to employees_contact.
Table 2 describes employee contact information.
- Create three tables: employee information table employees_info, contact table employees_contact, and extended employee information table employees_info_extended.
- Analyze data.
For data analysis codes, see Querying Data.
- Query contact information of employees whose salaries are paid in USD.
- Query the IDs and names of employees who were hired in 2014, and load query results to the partition with the hiring time of 2014 in employees_info_extended.
- Collect statistics for the number of records in the employees_info table.
- Query information about employees whose email addresses end with "cn".
- Submit a data analysis task to collect statistics for the number of records in the employees_info table.
For details about the implementation, see Example Program Guide.
Last Article: Developing an Application
Next Article: Example Codes
Did this article solve your problem?
Thank you for your score!Your feedback would help us improve the website.