Skip to content
Snippets Groups Projects
  1. Dec 10, 2013
  2. Dec 09, 2013
  3. Dec 07, 2013
  4. Dec 06, 2013
  5. Dec 02, 2013
  6. Dec 01, 2013
  7. Nov 29, 2013
  8. Nov 28, 2013
  9. Nov 27, 2013
  10. Nov 26, 2013
  11. Nov 25, 2013
    • Holden Karau's avatar
      Fix the test · 7222ee29
      Holden Karau authored
      7222ee29
    • Matei Zaharia's avatar
      Merge pull request #204 from rxin/hash · 0e2109dd
      Matei Zaharia authored
      OpenHashSet fixes
      
      Incorporated ideas from pull request #200.
      - Use Murmur Hash 3 finalization step to scramble the bits of HashCode
        instead of the simpler version in java.util.HashMap; the latter one
        had trouble with ranges of consecutive integers. Murmur Hash 3 is used
        by fastutil.
      - Don't check keys for equality when re-inserting due to growing the
        table; the keys will already be unique.
      - Remember the grow threshold instead of recomputing it on each insert
      
      Also added unit tests for size estimation for specialized hash sets and maps.
      0e2109dd
    • Matei Zaharia's avatar
      Merge pull request #206 from ash211/patch-2 · c46067f0
      Matei Zaharia authored
      Update tuning.md
      
      Clarify when serializer is used based on recent user@ mailing list discussion.
      c46067f0
    • Matei Zaharia's avatar
      Merge pull request #201 from rxin/mappartitions · 14bb465b
      Matei Zaharia authored
      Use the proper partition index in mapPartitionsWIthIndex
      
      mapPartitionsWithIndex uses TaskContext.partitionId as the partition index. TaskContext.partitionId used to be identical to the partition index in a RDD. However, pull request #186 introduced a scenario (with partition pruning) that the two can be different. This pull request uses the right partition index in all mapPartitionsWithIndex related calls.
      
      Also removed the extra MapPartitionsWIthContextRDD and put all the mapPartitions related functionality in MapPartitionsRDD.
      14bb465b
    • Andrew Ash's avatar
      Update tuning.md · 08afef37
      Andrew Ash authored
      Clarify when serializer is used based on recent user@ mailing list discussion.
      08afef37
    • Matei Zaharia's avatar
      Merge pull request #101 from colorant/yarn-client-scheduler · eb4296c8
      Matei Zaharia authored
      For SPARK-527, Support spark-shell when running on YARN
      
      sync to trunk and resubmit here
      
      In current YARN mode approaching, the application is run in the Application Master as a user program thus the whole spark context is on remote.
      
      This approaching won't support application that involve local interaction and need to be run on where it is launched.
      
      So In this pull request I have a YarnClientClusterScheduler and backend added.
      
      With this scheduler, the user application is launched locally,While the executor will be launched by YARN on remote nodes with a thin AM which only launch the executor and monitor the Driver Actor status, so that when client app is done, it can finish the YARN Application as well.
      
      This enables spark-shell to run upon YARN.
      
      This also enable other Spark applications to have the spark context to run locally with a master-url "yarn-client". Thus e.g. SparkPi could have the result output locally on console instead of output in the log of the remote machine where AM is running on.
      
      Docs also updated to show how to use this yarn-client mode.
      eb4296c8
    • Prashant Sharma's avatar
      Merge branch 'master' into scala-2.10-wip · 44fd30d3
      Prashant Sharma authored
      Conflicts:
      	core/src/main/scala/org/apache/spark/rdd/RDD.scala
      	project/SparkBuild.scala
      44fd30d3
    • Prashant Sharma's avatar
    • Reynold Xin's avatar
      Incorporated ideas from pull request #200. · 466fd064
      Reynold Xin authored
      - Use Murmur Hash 3 finalization step to scramble the bits of HashCode
        instead of the simpler version in java.util.HashMap; the latter one
        had trouble with ranges of consecutive integers. Murmur Hash 3 is used
        by fastutil.
      
      - Don't check keys for equality when re-inserting due to growing the
        table; the keys will already be unique
      
      - Remember the grow threshold instead of recomputing it on each insert
      466fd064
    • Reynold Xin's avatar
Loading