Skip to content
Snippets Groups Projects
  • Marcelo Vanzin's avatar
    2cd1bfa4
    [SPARK-4563][CORE] Allow driver to advertise a different network address. · 2cd1bfa4
    Marcelo Vanzin authored
    The goal of this feature is to allow the Spark driver to run in an
    isolated environment, such as a docker container, and be able to use
    the host's port forwarding mechanism to be able to accept connections
    from the outside world.
    
    The change is restricted to the driver: there is no support for achieving
    the same thing on executors (or the YARN AM for that matter). Those still
    need full access to the outside world so that, for example, connections
    can be made to an executor's block manager.
    
    The core of the change is simple: add a new configuration that tells what's
    the address the driver should bind to, which can be different than the address
    it advertises to executors (spark.driver.host). Everything else is plumbing
    the new configuration where it's needed.
    
    To use the feature, the host starting the container needs to set up the
    driver's port range to fall into a range that is being forwarded; this
    required the block manager port to need a special configuration just for
    the driver, which falls back to the existing spark.blockManager.port when
    not set. This way, users can modify the driver settings without affecting
    the executors; it would theoretically be nice to also have different
    retry counts for driver and executors, but given that docker (at least)
    allows forwarding port ranges, we can probably live without that for now.
    
    Because of the nature of the feature it's kinda hard to add unit tests;
    I just added a simple one to make sure the configuration works.
    
    This was tested with a docker image running spark-shell with the following
    command:
    
     docker blah blah blah \
       -p 38000-38100:38000-38100 \
       [image] \
       spark-shell \
         --num-executors 3 \
         --conf spark.shuffle.service.enabled=false \
         --conf spark.dynamicAllocation.enabled=false \
         --conf spark.driver.host=[host's address] \
         --conf spark.driver.port=38000 \
         --conf spark.driver.blockManager.port=38020 \
         --conf spark.ui.port=38040
    
    Running on YARN; verified the driver works, executors start up and listen
    on ephemeral ports (instead of using the driver's config), and that caching
    and shuffling (without the shuffle service) works. Clicked through the UI
    to make sure all pages (including executor thread dumps) worked. Also tested
    apps without docker, and ran unit tests.
    
    Author: Marcelo Vanzin <vanzin@cloudera.com>
    
    Closes #15120 from vanzin/SPARK-4563.
    2cd1bfa4
    History
    [SPARK-4563][CORE] Allow driver to advertise a different network address.
    Marcelo Vanzin authored
    The goal of this feature is to allow the Spark driver to run in an
    isolated environment, such as a docker container, and be able to use
    the host's port forwarding mechanism to be able to accept connections
    from the outside world.
    
    The change is restricted to the driver: there is no support for achieving
    the same thing on executors (or the YARN AM for that matter). Those still
    need full access to the outside world so that, for example, connections
    can be made to an executor's block manager.
    
    The core of the change is simple: add a new configuration that tells what's
    the address the driver should bind to, which can be different than the address
    it advertises to executors (spark.driver.host). Everything else is plumbing
    the new configuration where it's needed.
    
    To use the feature, the host starting the container needs to set up the
    driver's port range to fall into a range that is being forwarded; this
    required the block manager port to need a special configuration just for
    the driver, which falls back to the existing spark.blockManager.port when
    not set. This way, users can modify the driver settings without affecting
    the executors; it would theoretically be nice to also have different
    retry counts for driver and executors, but given that docker (at least)
    allows forwarding port ranges, we can probably live without that for now.
    
    Because of the nature of the feature it's kinda hard to add unit tests;
    I just added a simple one to make sure the configuration works.
    
    This was tested with a docker image running spark-shell with the following
    command:
    
     docker blah blah blah \
       -p 38000-38100:38000-38100 \
       [image] \
       spark-shell \
         --num-executors 3 \
         --conf spark.shuffle.service.enabled=false \
         --conf spark.dynamicAllocation.enabled=false \
         --conf spark.driver.host=[host's address] \
         --conf spark.driver.port=38000 \
         --conf spark.driver.blockManager.port=38020 \
         --conf spark.ui.port=38040
    
    Running on YARN; verified the driver works, executors start up and listen
    on ephemeral ports (instead of using the driver's config), and that caching
    and shuffling (without the shuffle service) works. Clicked through the UI
    to make sure all pages (including executor thread dumps) worked. Also tested
    apps without docker, and ran unit tests.
    
    Author: Marcelo Vanzin <vanzin@cloudera.com>
    
    Closes #15120 from vanzin/SPARK-4563.