PaperReading20200424

CanChen [email protected]


 

MapReduce

PaperReading20200424_第1张图片
The MapReduce model in user program first split up the input file into M parts and then forks user program to every node in the cluster. One node is special and called master. Master node assigns M map tasks to nodes and these nodes process the corresponding data part and write the results in the local disk. Here written data is splitted up into R parts and after that the node which receives reduce task will read one part of the buffered data remotely. The results of R tasks are then written out as the final result. For me I think the R results should be reduced to the real final result.

 

你可能感兴趣的:(PaperReading20200424)