Skip to content
Snippets Groups Projects
  • Tor Myklebust's avatar
    d9203350
    [SPARK-1672][MLLIB] Separate user and product partitioning in ALS · d9203350
    Tor Myklebust authored
    Some clean up work following #593.
    
    1. Allow to set different number user blocks and number product blocks in `ALS`.
    2. Update `MovieLensALS` to reflect the change.
    
    Author: Tor Myklebust <tmyklebu@gmail.com>
    Author: Xiangrui Meng <meng@databricks.com>
    
    Closes #1014 from mengxr/SPARK-1672 and squashes the following commits:
    
    0e910dd [Xiangrui Meng] change private[this] to private[recommendation]
    36420c7 [Xiangrui Meng] set exclusion rules for ALS
    9128b77 [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-1672
    294efe9 [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-1672
    9bab77b [Xiangrui Meng] clean up add numUserBlocks and numProductBlocks to MovieLensALS
    84c8e8c [Xiangrui Meng] Merge branch 'master' into SPARK-1672
    d17a8bf [Xiangrui Meng] merge master
    a4925fd [Tor Myklebust] Style.
    bd8a75c [Tor Myklebust] Merge branch 'master' of github.com:apache/spark into alsseppar
    021f54b [Tor Myklebust] Separate user and product blocks.
    dcf583a [Tor Myklebust] Remove the partitioner member variable; instead, thread that needle everywhere it needs to go.
    23d6f91 [Tor Myklebust] Stop making the partitioner configurable.
    495784f [Tor Myklebust] Merge branch 'master' of https://github.com/apache/spark
    674933a [Tor Myklebust] Fix style.
    40edc23 [Tor Myklebust] Fix missing space.
    f841345 [Tor Myklebust] Fix daft bug creating 'pairs', also for -> foreach.
    5ec9e6c [Tor Myklebust] Clean a couple of things up using 'map'.
    36a0f43 [Tor Myklebust] Make the partitioner private.
    d872b09 [Tor Myklebust] Add negative id ALS test.
    df27697 [Tor Myklebust] Support custom partitioners.  Currently we use the same partitioner for users and products.
    c90b6d8 [Tor Myklebust] Scramble user and product ids before bucketing.
    c774d7d [Tor Myklebust] Make the partitioner a member variable and use it instead of modding directly.
    d9203350
    History
    [SPARK-1672][MLLIB] Separate user and product partitioning in ALS
    Tor Myklebust authored
    Some clean up work following #593.
    
    1. Allow to set different number user blocks and number product blocks in `ALS`.
    2. Update `MovieLensALS` to reflect the change.
    
    Author: Tor Myklebust <tmyklebu@gmail.com>
    Author: Xiangrui Meng <meng@databricks.com>
    
    Closes #1014 from mengxr/SPARK-1672 and squashes the following commits:
    
    0e910dd [Xiangrui Meng] change private[this] to private[recommendation]
    36420c7 [Xiangrui Meng] set exclusion rules for ALS
    9128b77 [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-1672
    294efe9 [Xiangrui Meng] Merge remote-tracking branch 'apache/master' into SPARK-1672
    9bab77b [Xiangrui Meng] clean up add numUserBlocks and numProductBlocks to MovieLensALS
    84c8e8c [Xiangrui Meng] Merge branch 'master' into SPARK-1672
    d17a8bf [Xiangrui Meng] merge master
    a4925fd [Tor Myklebust] Style.
    bd8a75c [Tor Myklebust] Merge branch 'master' of github.com:apache/spark into alsseppar
    021f54b [Tor Myklebust] Separate user and product blocks.
    dcf583a [Tor Myklebust] Remove the partitioner member variable; instead, thread that needle everywhere it needs to go.
    23d6f91 [Tor Myklebust] Stop making the partitioner configurable.
    495784f [Tor Myklebust] Merge branch 'master' of https://github.com/apache/spark
    674933a [Tor Myklebust] Fix style.
    40edc23 [Tor Myklebust] Fix missing space.
    f841345 [Tor Myklebust] Fix daft bug creating 'pairs', also for -> foreach.
    5ec9e6c [Tor Myklebust] Clean a couple of things up using 'map'.
    36a0f43 [Tor Myklebust] Make the partitioner private.
    d872b09 [Tor Myklebust] Add negative id ALS test.
    df27697 [Tor Myklebust] Support custom partitioners.  Currently we use the same partitioner for users and products.
    c90b6d8 [Tor Myklebust] Scramble user and product ids before bucketing.
    c774d7d [Tor Myklebust] Make the partitioner a member variable and use it instead of modding directly.