Skip to content
  • Manish Amde's avatar
    f269b016
    SPARK-1544 Add support for deep decision trees. · f269b016
    Manish Amde authored
    @etrain and I came with a PR for arbitrarily deep decision trees at the cost of multiple passes over the data at deep tree levels.
    
    To summarize:
    1) We take a parameter that indicates the amount of memory users want to reserve for computation on each worker (and 2x that at the driver).
    2) Using that information, we calculate two things - the maximum depth to which we train as usual (which is, implicitly, the maximum number of nodes we want to train in parallel), and the size of the groups we should use in the case where we exceed this depth.
    
    cc: @atalwalkar, @hirakendu, @mengxr
    
    Author: Manish Amde <manish9ue@gmail.com>
    Author: manishamde <manish9ue@gmail.com>
    Author: Evan Sparks <sparks@cs.berkeley.edu>
    
    Closes #475 from manishamde/deep_tree and squashes the following commits:
    
    968ca9d [Manish Amde] merged master
    7fc9545 [Manish Amde] added docs
    ce004a1 [Manish Amde] minor formatting
    b27ad2c [Manish Amde] formatting
    426bb28 [Manish Amde] programming guide blurb
    8053fed [Manish Amde] more formatting
    5eca9e4 [Manish Amde] grammar
    4731cda [Manish Amde] formatting
    5e82202 [Manish Amde] added documentation, fixed off by 1 error in max level calculation
    cbd9f14 [Manish Amde] modified scala.math to math
    dad9652 [Manish Amde] removed unused imports
    e0426ee [Manish Amde] renamed parameter
    718506b [Manish Amde] added unit test
    1517155 [Manish Amde] updated documentation
    9dbdabe [Manish Amde] merge from master
    719d009 [Manish Amde] updating user documentation
    fecf89a [manishamde] Merge pull request #6 from etrain/deep_tree
    0287772 [Evan Sparks] Fixing scalastyle issue.
    2f1e093 [Manish Amde] minor: added doc for maxMemory parameter
    2f6072c [manishamde] Merge pull request #5 from etrain/deep_tree
    abc5a23 [Evan Sparks] Parameterizing max memory.
    50b143a [Manish Amde] adding support for very deep trees
    f269b016
    SPARK-1544 Add support for deep decision trees.
    Manish Amde authored
    @etrain and I came with a PR for arbitrarily deep decision trees at the cost of multiple passes over the data at deep tree levels.
    
    To summarize:
    1) We take a parameter that indicates the amount of memory users want to reserve for computation on each worker (and 2x that at the driver).
    2) Using that information, we calculate two things - the maximum depth to which we train as usual (which is, implicitly, the maximum number of nodes we want to train in parallel), and the size of the groups we should use in the case where we exceed this depth.
    
    cc: @atalwalkar, @hirakendu, @mengxr
    
    Author: Manish Amde <manish9ue@gmail.com>
    Author: manishamde <manish9ue@gmail.com>
    Author: Evan Sparks <sparks@cs.berkeley.edu>
    
    Closes #475 from manishamde/deep_tree and squashes the following commits:
    
    968ca9d [Manish Amde] merged master
    7fc9545 [Manish Amde] added docs
    ce004a1 [Manish Amde] minor formatting
    b27ad2c [Manish Amde] formatting
    426bb28 [Manish Amde] programming guide blurb
    8053fed [Manish Amde] more formatting
    5eca9e4 [Manish Amde] grammar
    4731cda [Manish Amde] formatting
    5e82202 [Manish Amde] added documentation, fixed off by 1 error in max level calculation
    cbd9f14 [Manish Amde] modified scala.math to math
    dad9652 [Manish Amde] removed unused imports
    e0426ee [Manish Amde] renamed parameter
    718506b [Manish Amde] added unit test
    1517155 [Manish Amde] updated documentation
    9dbdabe [Manish Amde] merge from master
    719d009 [Manish Amde] updating user documentation
    fecf89a [manishamde] Merge pull request #6 from etrain/deep_tree
    0287772 [Evan Sparks] Fixing scalastyle issue.
    2f1e093 [Manish Amde] minor: added doc for maxMemory parameter
    2f6072c [manishamde] Merge pull request #5 from etrain/deep_tree
    abc5a23 [Evan Sparks] Parameterizing max memory.
    50b143a [Manish Amde] adding support for very deep trees
Loading