Spark Programming Guide(四)

Shuffle operations Certain operations within Spark trigger an event known as the shuffle. The shuffle is Spark’s mechanism for re-distributing data so that it’s grouped differently across partitions.
相关文章
相关标签/搜索