-
- Downloads
Merge pull request #201 from rxin/mappartitions
Use the proper partition index in mapPartitionsWIthIndex mapPartitionsWithIndex uses TaskContext.partitionId as the partition index. TaskContext.partitionId used to be identical to the partition index in a RDD. However, pull request #186 introduced a scenario (with partition pruning) that the two can be different. This pull request uses the right partition index in all mapPartitionsWithIndex related calls. Also removed the extra MapPartitionsWIthContextRDD and put all the mapPartitions related functionality in MapPartitionsRDD.
No related branches found
No related tags found
Showing
- core/src/main/scala/org/apache/spark/rdd/MapPartitionsRDD.scala 4 additions, 6 deletions...rc/main/scala/org/apache/spark/rdd/MapPartitionsRDD.scala
- core/src/main/scala/org/apache/spark/rdd/MapPartitionsWithContextRDD.scala 0 additions, 41 deletions...la/org/apache/spark/rdd/MapPartitionsWithContextRDD.scala
- core/src/main/scala/org/apache/spark/rdd/RDD.scala 18 additions, 21 deletionscore/src/main/scala/org/apache/spark/rdd/RDD.scala
- core/src/test/scala/org/apache/spark/CheckpointSuite.scala 0 additions, 2 deletionscore/src/test/scala/org/apache/spark/CheckpointSuite.scala
Please register or sign in to comment