Spark Core Sample Projects (Scala)
Function
Collects the information of female netizens who spend more than 2 hours in online shopping on the weekend from the log files.
Sample Code
The following code segment is only an example. For details, see the com.huawei.bigdata.spark.examples.FemaleInfoCollection class.
Example: CollectMapper class
val spark = SparkSession .builder() .appName("CollectFemaleInfo") .config("spark.some.config.option", "some-value") .getOrCreate() // Read data. This code indicates the data path that the input parameter args(0) specifies. val text = spark.sparkContext.textFile(args(0)) // Filter the data information about the time that female netizens spend online. val data = text.filter(_.contains("female")) // Aggregate the time that each female netizen spends online. val femaleData:RDD[(String,Int)] = data.map{line => val t= line.split(',') (t(0),t(2).toInt) }.reduceByKey(_ + _) // Filter the information about female netizens who spend more than 2 hours online, and export the results. val result = femaleData.filter(line => line._2 > 120) result.collect().map(x => x._1 + ',' + x._2).foreach(println) spark.stop()
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot