spark(3)-wordcount原理解析

1. WordCount Examples详解

1.1 Word Count流程示意图


JavaRDD<String> textFile = sc.textFile("hdfs://...");
JavaPairRDD<String, Integer> counts = textFile
    .flatMap(s -> Arrays.asList(s.split(" ")).iterator())
    .mapToPair(word -> new Tuple2<>(word, 1))
    .reduceByKey((a, b) -> a + b);
counts.saveAsTextFile("hdfs://...");


spark(3)-wordcount原理解析_第1张图片
spark(3)-wordcount原理解析_第2张图片

你可能感兴趣的:(spark)