Skip to content
Snippets Groups Projects
  1. Feb 06, 2014
    • Prashant Sharma's avatar
      Merge pull request #498 from ScrapCodes/python-api. Closes #498. · 084839ba
      Prashant Sharma authored
      Python api additions
      
      Author: Prashant Sharma <prashant.s@imaginea.com>
      
      == Merge branch commits ==
      
      commit 8b51591f1a7a79a62c13ee66ff8d83040f7eccd8
      Author: Prashant Sharma <prashant.s@imaginea.com>
      Date:   Fri Jan 24 11:50:29 2014 +0530
      
          Josh's and Patricks review comments.
      
      commit d37f9677838e43bef6c18ef61fbf08055ba6d1ca
      Author: Prashant Sharma <prashant.s@imaginea.com>
      Date:   Thu Jan 23 17:27:17 2014 +0530
      
          fixed doc tests
      
      commit 27cb54bf5c99b1ea38a73858c291d0a1c43d8b7c
      Author: Prashant Sharma <prashant.s@imaginea.com>
      Date:   Thu Jan 23 16:48:43 2014 +0530
      
          Added keys and values methods for PairFunctions in python
      
      commit 4ce76b396fbaefef2386d7a36d611572bdef9b5d
      Author: Prashant Sharma <prashant.s@imaginea.com>
      Date:   Thu Jan 23 13:51:26 2014 +0530
      
          Added foreachPartition
      
      commit 05f05341a187cba829ac0e6c2bdf30be49948c89
      Author: Prashant Sharma <prashant.s@imaginea.com>
      Date:   Thu Jan 23 13:02:59 2014 +0530
      
          Added coalesce fucntion to python API
      
      commit 6568d2c2fa14845dc56322c0f39ba2e13b3b26dd
      Author: Prashant Sharma <prashant.s@imaginea.com>
      Date:   Thu Jan 23 12:52:44 2014 +0530
      
          added repartition function to python API.
      084839ba
  2. Jan 28, 2014
    • Josh Rosen's avatar
      Switch from MUTF8 to UTF8 in PySpark serializers. · 1381fc72
      Josh Rosen authored
      This fixes SPARK-1043, a bug introduced in 0.9.0
      where PySpark couldn't serialize strings > 64kB.
      
      This fix was written by @tyro89 and @bouk in #512.
      This commit squashes and rebases their pull request
      in order to fix some merge conflicts.
      1381fc72
  3. Jan 23, 2014
  4. Jan 18, 2014
  5. Jan 14, 2014
  6. Jan 13, 2014
  7. Jan 12, 2014
    • Matei Zaharia's avatar
      Log Python exceptions to stderr as well · 5741078c
      Matei Zaharia authored
      This helps in case the exception happened while serializing a record to
      be sent to Java, leaving the stream to Java in an inconsistent state
      where PythonRDD won't be able to read the error.
      5741078c
    • Matei Zaharia's avatar
      Update some Python MLlib parameters to use camelCase, and tweak docs · 4c28a2ba
      Matei Zaharia authored
      We've used camel case in other Spark methods so it felt reasonable to
      keep using it here and make the code match Scala/Java as much as
      possible. Note that parameter names matter in Python because it allows
      passing optional parameters by name.
      4c28a2ba
    • Matei Zaharia's avatar
      Add Naive Bayes to Python MLlib, and some API fixes · 9a0dfdf8
      Matei Zaharia authored
      - Added a Python wrapper for Naive Bayes
      - Updated the Scala Naive Bayes to match the style of our other
        algorithms better and in particular make it easier to call from Java
        (added builder pattern, removed default value in train method)
      - Updated Python MLlib functions to not require a SparkContext; we can
        get that from the RDD the user gives
      - Added a toString method in LabeledPoint
      - Made the Python MLlib tests run as part of run-tests as well (before
        they could only be run individually through each file)
      9a0dfdf8
  8. Jan 06, 2014
  9. Jan 04, 2014
  10. Jan 03, 2014
  11. Jan 02, 2014
  12. Jan 01, 2014
  13. Dec 30, 2013
  14. Dec 29, 2013
  15. Dec 28, 2013
  16. Dec 25, 2013
  17. Dec 24, 2013
  18. Dec 22, 2013
  19. Dec 21, 2013
Loading