Use reduceByKey instead of groupByKey

groupByKey creates a lot of shuffling which hampers the performance, while reduceByKey does not shuffle the data as much

https://blog.csdn.net/qq_17685725/article/details/123033552

Use reduceByKey instead of groupByKey

http://example.com/2022/03/04/reduceByKey/

Author

Lavine Hu

Posted on

2022-03-04

Updated on

2022-03-04

Licensed under

Comments

:D 一言句子获取中...