Skip to content
Snippets Groups Projects
  1. Nov 08, 2016
  2. Jul 15, 2016
    • Joseph K. Bradley's avatar
      [SPARK-14817][ML][MLLIB][DOC] Made DataFrame-based API primary in MLlib guide · 5ffd5d38
      Joseph K. Bradley authored
      ## What changes were proposed in this pull request?
      
      Made DataFrame-based API primary
      * Spark doc menu bar and other places now link to ml-guide.html, not mllib-guide.html
      * mllib-guide.html keeps RDD-specific list of features, with a link at the top redirecting people to ml-guide.html
      * ml-guide.html includes a "maintenance mode" announcement about the RDD-based API
        * **Reviewers: please check this carefully**
      * (minor) Titles for DF API no longer include "- spark.ml" suffix.  Titles for RDD API have "- RDD-based API" suffix
      * Moved migration guide to ml-guide from mllib-guide
        * Also moved past guides from mllib-migration-guides to ml-migration-guides, with a redirect link on mllib-migration-guides
        * **Reviewers**: I did not change any of the content of the migration guides.
      
      Reorganized DataFrame-based guide:
      * ml-guide.html mimics the old mllib-guide.html page in terms of content: overview, migration guide, etc.
      * Moved Pipeline description into ml-pipeline.html and moved tuning into ml-tuning.html
        * **Reviewers**: I did not change the content of these guides, except some intro text.
      * Sidebar remains the same, but with pipeline and tuning sections added
      
      Other:
      * ml-classification-regression.html: Moved text about linear methods to new section in page
      
      ## How was this patch tested?
      
      Generated docs locally
      
      Author: Joseph K. Bradley <joseph@databricks.com>
      
      Closes #14213 from jkbradley/ml-guide-2.0.
      5ffd5d38
  3. May 20, 2016
    • sethah's avatar
      [SPARK-15394][ML][DOCS] User guide typos and grammar audit · 5e203505
      sethah authored
      ## What changes were proposed in this pull request?
      
      Correct some typos and incorrectly worded sentences.
      
      ## How was this patch tested?
      
      Doc changes only.
      
      Note that many of these changes were identified by whomfire01
      
      Author: sethah <seth.hendrickson16@gmail.com>
      
      Closes #13180 from sethah/ml_guide_audit.
      5e203505
  4. May 17, 2016
    • wm624@hotmail.com's avatar
      [SPARK-14434][ML] User guide doc and examples for GaussianMixture in spark.ml · 4134ff0c
      wm624@hotmail.com authored
      ## What changes were proposed in this pull request?
      
      (Please fill in changes proposed in this fix)
      
      Add guide doc and examples for GaussianMixture in Spark.ml in Java, Scala and Python.
      
      ## How was this patch tested?
      
      (Please explain how this patch was tested. E.g. unit tests, integration tests, manual tests)
      
      Manual compile and test all examples
      
      Author: wm624@hotmail.com <wm624@hotmail.com>
      
      Closes #12788 from wangmiao1981/example.
      4134ff0c
  5. May 16, 2016
  6. May 11, 2016
    • Zheng RuiFeng's avatar
      [SPARK-15150][EXAMPLE][DOC] Update LDA examples · d88afabd
      Zheng RuiFeng authored
      ## What changes were proposed in this pull request?
      1,create a libsvm-type dataset for lda: `data/mllib/sample_lda_libsvm_data.txt`
      2,add python example
      3,directly read the datafile in examples
      4,BTW, change to `SparkSession` in `aft_survival_regression.py`
      
      ## How was this patch tested?
      manual tests
      `./bin/spark-submit examples/src/main/python/ml/lda_example.py`
      
      Author: Zheng RuiFeng <ruifengz@foxmail.com>
      
      Closes #12927 from zhengruifeng/lda_pe.
      d88afabd
    • Zheng RuiFeng's avatar
      [SPARK-15149][EXAMPLE][DOC] update kmeans example · 8beae591
      Zheng RuiFeng authored
      ## What changes were proposed in this pull request?
      Python example for ml.kmeans already exists, but not included in user guide.
      1,small changes like: `example_on` `example_off`
      2,add it to user guide
      3,update examples to directly read datafile
      
      ## How was this patch tested?
      manual tests
      `./bin/spark-submit examples/src/main/python/ml/kmeans_example.py
      
      Author: Zheng RuiFeng <ruifengz@foxmail.com>
      
      Closes #12925 from zhengruifeng/km_pe.
      8beae591
    • Zheng RuiFeng's avatar
      [SPARK-14340][EXAMPLE][DOC] Update Examples and User Guide for ml.BisectingKMeans · cef73b56
      Zheng RuiFeng authored
      ## What changes were proposed in this pull request?
      
      1, add BisectingKMeans to ml-clustering.md
      2, add the missing Scala BisectingKMeansExample
      3, create a new datafile `data/mllib/sample_kmeans_data.txt`
      
      ## How was this patch tested?
      
      manual tests
      
      Author: Zheng RuiFeng <ruifengz@foxmail.com>
      
      Closes #11844 from zhengruifeng/doc_bkm.
      cef73b56
  7. Dec 16, 2015
  8. Dec 10, 2015
    • Timothy Hunter's avatar
      [SPARK-12212][ML][DOC] Clarifies the difference between spark.ml, spark.mllib... · 2ecbe02d
      Timothy Hunter authored
      [SPARK-12212][ML][DOC] Clarifies the difference between spark.ml, spark.mllib and mllib in the documentation.
      
      Replaces a number of occurences of `MLlib` in the documentation that were meant to refer to the `spark.mllib` package instead. It should clarify for new users the difference between `spark.mllib` (the package) and MLlib (the umbrella project for ML in spark).
      
      It also removes some files that I forgot to delete with #10207
      
      Author: Timothy Hunter <timhunter@databricks.com>
      
      Closes #10234 from thunterdb/12212.
      2ecbe02d
  9. Dec 08, 2015
  10. Nov 30, 2015
  11. Nov 20, 2015
Loading