-
- Downloads
Add a shuffle parameter to coalesce.
This is useful for when you want just 1 output file (part-00000) but still up the upstream RDD to be computed in parallel.
Showing
- core/src/main/scala/spark/RDD.scala 9 additions, 1 deletioncore/src/main/scala/spark/RDD.scala
- core/src/main/scala/spark/api/java/JavaDoubleRDD.scala 6 additions, 0 deletionscore/src/main/scala/spark/api/java/JavaDoubleRDD.scala
- core/src/main/scala/spark/api/java/JavaPairRDD.scala 7 additions, 1 deletioncore/src/main/scala/spark/api/java/JavaPairRDD.scala
- core/src/main/scala/spark/api/java/JavaRDD.scala 6 additions, 0 deletionscore/src/main/scala/spark/api/java/JavaRDD.scala
- core/src/test/scala/spark/RDDSuite.scala 5 additions, 1 deletioncore/src/test/scala/spark/RDDSuite.scala
Loading
Please register or sign in to comment