Importing and Configuring Spark Sample Projects
Overview
Spark provides sample projects for different scenarios, including Java and Scala projects, to help users quickly understand Spark projects.
The import methods for Java and Scala projects are the same Python sample projects do not need to be imported and you only need to open the Python file (*.py).
The import of Java sample codes is used as a sample in the following procedure. Figure 1 shows the procedure of importing sample projects.
Procedure
- Obtain multiple sample projects such as Scala and Spark Streaming in the sparknormal-examples folder within the spark-examples directory where the sample code is decompressed. For details, see Obtaining the MRS Application Development Sample Project.
- After installing the IntelliJ IDEA and JDK, configure the JDK in IntelliJ IDEA.
- Start IntelliJ IDEA and select Configure.
Figure 2 Quick Start
- Choose Project Defaults from the Configure drop-down list.
Figure 3 Configure
- Select Project Structure from Project Defaults.
Figure 4 Project Defaults
- On the Project Structure page, select SDKs and click the plus sign to add the JDK.
Figure 5 Adding the JDK
- On the displayed Select Home Directory for JDK page, select the JDK directory and click OK.
Figure 6 Selecting a home directory for the JDK
- After selecting the JDK, click OK to complete the configuration.
Figure 7 Completing the configuration
- Start IntelliJ IDEA and select Configure.
- (Optional) If the Scala development environment is used, install the Scala plugin in IntelliJ IDEA.
- Select Plugins from the Configure drop-down list.
Figure 8 Plugins
- On the Plugins page, select Install plugin from disk.
Figure 9 Install plugin from disk
- On the Choose Plugin File page, select the Scala plugin file of the corresponding version and click OK.
- On the Plugins page, click Apply to install the Scala plugin.
- On the displayed Plugins Changed page, click Restart for the configuration to take effect.
Figure 10 Plugins Changed
- Select Plugins from the Configure drop-down list.
- Import the Java sample projects to IntelliJ IDEA.
- Start IntelliJ IDEA. On the Quick Start page, select Import Project.
Alternatively, for the used IDEA tool, add the project directly from the IDEA home page. Selectto import projects.Figure 11 Importing a project (on the Quick Start page)
- Select the path and folder for the to-be-imported project and click OK.
Figure 12 Select File or Directory to Import
- Confirm the location and name of the project and click Next.
Figure 13 Import Project from Maven
- Select the project to be imported and click Next.
- Confirm the project JDK and click Next.
Figure 14 Select project SDK
- Confirm the project name and project file location, and click Finish to complete the import.
Figure 15 Confirm the project name and file location
- After the import, the imported projects are displayed on the IDEA home page.
Figure 16 Imported projects
- Start IntelliJ IDEA. On the Quick Start page, select Import Project.
- (Optional) If a sample project developed in Scala is imported, configure the language for the project.
- On the main page of the IDEA, choose Project Structure page. to access the
- Choose Modules, right-click the project name, and choose .
Figure 17 Selecting the Scala language
- Wait until IDEA identifies Scala SDK, select the dependency JAR packages in the Add Scala Support dialog box, and click OK.
Figure 18 Add Scala Support
- If IDEA fails to identify Scala SDK, create a Scala SDK.
- Click Create....
Figure 19 Create...
- On the Select JAR's for the new Scala SDK page, click Browse....
Figure 20 Select JAR's for the new Scala SDK
- On the Scala SDK files page, select the scala sdk directory, and click OK.
Figure 21 Scala SDK files
- Click Create....
- Click OK.
Figure 22 Successful configuration
- Set the file encoding format of IntelliJ IDEA to avoid garbled characters.
- On the IDEA homepage, choose Figure 23 Choosing Settings
.
- Configure encoding.
- On the Settings page, choose Editor > File Encodings.
- In the Global Encoding and Project Encoding drop-down lists, select UTF-8, respectively.
- Click Apply.
- Click OK.
- On the IDEA homepage, choose
Sample Code Path Description
Code Project |
Name |
Development Language |
---|---|---|
SparkJavaExample |
Spark Core Project |
Java |
SparkScalaExample |
Spark Core Project |
Scala |
SparkPyhtonExample |
Spark Core Project |
Python |
SparkSQLJavaExample |
Spark SQL Project |
Java |
SparkSQLScalaExample |
Spark SQL Project |
Scala |
SparkSQLPythonExample |
Spark SQL Project |
Python |
SparkThriftServerJavaExample |
Accessing the Spark SQL Through JDBC |
Java |
SparkThriftServerScalaExample |
Accessing the Spark SQL Through JDBC |
Scala |
SparkOnHbaseJavaExample-AvroSource |
Spark on HBase-Performing Operations on Data in Avro Format |
Java |
SparkOnHbaseScalaExample-AvroSource |
Spark on HBase-Performing Operations on Data in Avro Format |
Scala |
SparkOnHbasePythonExample-AvroSource |
Spark on HBase-Performing Operations on Data in Avro Format |
Python |
SparkOnHbaseJavaExample-HbaseSource |
Spark on HBase-Performing Operations on the HBase Data Source |
Java |
SparkOnHbaseScalaExample-HbaseSource |
Spark on HBase-Performing Operations on the HBase Data Source |
Scala |
SparkOnHbasePythonExample-HbaseSource |
Spark on HBase-Performing Operations on the HBase Data Source |
Python |
SparkOnHbaseJavaExample-JavaHBaseBulkPutExample |
Spark on HBase-Using the BulkPut API |
Java |
SparkOnHbaseScalaExample-HBaseBulkPutExample |
Spark on HBase-Using the BulkPut API |
Scala |
SparkOnHbasePythonExample-HBaseBulkPutExample |
Spark on HBase-Using the BulkPut API |
Python |
SparkOnHbaseJavaExample-JavaHBaseBulkGetExample |
Spark on HBase-Using the BulkGet API |
Java |
SparkOnHbaseScalaExample-HBaseBulkGetExample |
Spark on HBase-Using the BulkGet API |
Scala |
SparkOnHbasePythonExample-HBaseBulkGetExample |
Spark on HBase-Using the BulkGet API |
Python |
SparkOnHbaseJavaExample-JavaHBaseBulkDeleteExample |
Spark on HBase-Using the BulkDelete API |
Java |
SparkOnHbaseScalaExample-HBaseBulkDeleteExample |
Spark on HBase-Using the BulkDelete API |
Scala |
SparkOnHbasePythonExample-HBaseBulkDeleteExample |
Spark on HBase-Using the BulkDelete API |
Python |
SparkOnHbaseJavaExample-JavaHBaseBulkLoadExample |
Spark on HBase-Using the BulkLoad API |
Java |
SparkOnHbaseScalaExample-HBaseBulkLoadExample |
Spark on HBase-Using the BulkLoad API |
Scala |
SparkOnHbasePythonExample-HBaseBulkLoadExample |
Spark on HBase-Using the BulkLoad API |
Python |
SparkOnHbaseJavaExample-JavaHBaseForEachPartitionExample |
Spark on HBase-Using the foreachPartition API |
Java |
SparkOnHbaseScalaExample-HBaseForEachPartitionExample |
Spark on HBase-Using the foreachPartition API |
Scala |
SparkOnHbasePythonExample-HBaseForEachPartitionExample |
Spark on HBase-Using the foreachPartition API |
Python |
SparkOnHbaseJavaExample-JavaHBaseDistributedScanExample |
Spark on HBase-Distributedly Scanning HBase Tables |
Java |
SparkOnHbaseScalaExample-HBaseDistributedScanExample |
Spark on HBase-Distributedly Scanning HBase Tables |
Scala |
SparkOnHbasePythonExample-HBaseDistributedScanExample |
Spark on HBase-Distributedly Scanning HBase Tables |
Python |
SparkOnHbaseJavaExample-JavaHBaseMapPartitionExample |
Spark on HBase-Using the mapPartition API |
Java |
SparkOnHbaseScalaExample-HBaseMapPartitionExample |
Spark on HBase-Using the mapPartition API |
Scala |
SparkOnHbasePythonExample-HBaseMapPartitionExample |
Spark on HBase-Using the mapPartition API |
Python |
SparkOnHbaseJavaExample-JavaHBaseStreamingBulkPutExample |
Spark on HBase-Writing Data to HBase Tables In Batches Using SparkStreaming |
Java |
SparkOnHbaseScalaExample-HBaseStreamingBulkPutExample |
Spark on HBase-Writing Data to HBase Tables In Batches Using SparkStreaming |
Scala |
SparkOnHbasePythonExample-HBaseStreamingBulkPutExample |
Spark on HBase-Writing Data to HBase Tables In Batches Using SparkStreaming |
Python |
SparkHbasetoHbaseJavaExample |
Reading Data from HBase and Write It Back to HBase |
Java |
SparkHbasetoHbaseScalaExample |
Reading Data from HBase and Write It Back to HBase |
Scala |
SparkHbasetoHbasePythonExample |
Reading Data from HBase and Write It Back to HBase |
Python |
SparkHivetoHbaseJavaExample |
Reading Data from Hive and Write It to HBase |
Java |
SparkHivetoHbaseScalaExample |
Reading Data from Hive and Write It to HBase |
Scala |
SparkHivetoHbasePythonExample |
Reading Data from Hive and Write It to HBase |
Python |
SparkStreamingKafka010JavaExample |
Connecting Spark Streaming to Kafka0-10 |
Java |
SparkStreamingKafka010ScalaExample |
Connecting Spark Streaming to Kafka0-10 |
Scala |
SparkStructuredStreamingJavaExample |
Structured Streaming Project |
Java |
SparkStructuredStreamingScalaExample |
Structured Streaming Project |
Scala |
SparkStructuredStreamingPythonExample |
Structured Streaming Project |
Python |
StructuredStreamingADScalaExample |
Structured Streaming Stream Join |
Scala |
StructuredStreamingStateScalaExample |
Structured Streaming Status Operation |
Scala |
SparkOnHudiJavaExample |
Using Spark to Perform Basic Hudi Operations |
Java |
SparkOnHudiPythonExample |
Using Spark to Perform Basic Hudi Operations |
Python |
SparkOnHudiScalaExample |
Using Spark to Perform Basic Hudi Operations |
Scala |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot