Scala Sample Code
Function
Collects the information of female netizens who spend more than 2 hours in online shopping on the weekend from the log files.
Sample Code
The following code segment is only an example. For details, see the com.huawei.bigdata.spark.examples.FemaleInfoCollection class.
Example: CollectMapper class
val spark = SparkSession .builder() .appName("CollectFemaleInfo") .config("spark.some.config.option", "some-value") .getOrCreate() // Read data. This code indicates the data path that the input parameter args(0) specifies. val text = spark.sparkContext.textFile(args(0)) // Filter the data information about the time that female netizens spend online. val data = text.filter(_.contains("female")) // Aggregate the time that each female netizen spends online. val femaleData:RDD[(String,Int)] = data.map{line => val t= line.split(',') (t(0),t(2).toInt) }.reduceByKey(_ + _) // Filter the information about female netizens who spend more than 2 hours online, and export the results. val result = femaleData.filter(line => line._2 > 120) result.collect().map(x => x._1 + ',' + x._2).foreach(println) spark.stop()
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.