Skip to content
Snippets Groups Projects
  • Andrew Or's avatar
    09f7e458
    [SPARK-2157] Enable tight firewall rules for Spark · 09f7e458
    Andrew Or authored
    The goal of this PR is to allow users of Spark to write tight firewall rules for their clusters. This is currently not possible because Spark uses random ports in many places, notably the communication between executors and drivers. The changes in this PR are based on top of ash211's changes in #1107.
    
    The list covered here may or may not be the complete set of port needed for Spark to operate perfectly. However, as of the latest commit there are no known sources of random ports (except in tests). I have not documented a few of the more obscure configs.
    
    My spark-env.sh looks like this:
    ```
    export SPARK_MASTER_PORT=6060
    export SPARK_WORKER_PORT=7070
    export SPARK_MASTER_WEBUI_PORT=9090
    export SPARK_WORKER_WEBUI_PORT=9091
    ```
    and my spark-defaults.conf looks like this:
    ```
    spark.master spark://andrews-mbp:6060
    spark.driver.port 5001
    spark.fileserver.port 5011
    spark.broadcast.port 5021
    spark.replClassServer.port 5031
    spark.blockManager.port 5041
    spark.executor.port 5051
    ```
    
    Author: Andrew Or <andrewor14@gmail.com>
    Author: Andrew Ash <andrew@andrewash.com>
    
    Closes #1777 from andrewor14/configure-ports and squashes the following commits:
    
    621267b [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    8a6b820 [Andrew Or] Use a random UI port during tests
    7da0493 [Andrew Or] Fix tests
    523c30e [Andrew Or] Add test for isBindCollision
    b97b02a [Andrew Or] Minor fixes
    c22ad00 [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    93d359f [Andrew Or] Executors connect to wrong port when collision occurs
    d502e5f [Andrew Or] Handle port collisions when creating Akka systems
    a2dd05c [Andrew Or] Patrick's comment nit
    86461e2 [Andrew Or] Remove spark.executor.env.port and spark.standalone.client.port
    1d2d5c6 [Andrew Or] Fix ports for standalone cluster mode
    cb3be88 [Andrew Or] Various doc fixes (broken link, format etc.)
    e837cde [Andrew Or] Remove outdated TODOs
    bfbab28 [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    de1b207 [Andrew Or] Update docs to reflect new ports
    b565079 [Andrew Or] Add spark.ports.maxRetries
    2551eb2 [Andrew Or] Remove spark.worker.watcher.port
    151327a [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    9868358 [Andrew Or] Add a few miscellaneous ports
    6016e77 [Andrew Or] Add spark.executor.port
    8d836e6 [Andrew Or] Also document SPARK_{MASTER/WORKER}_WEBUI_PORT
    4d9e6f3 [Andrew Or] Fix super subtle bug
    3f8e51b [Andrew Or] Correct erroneous docs...
    e111d08 [Andrew Or] Add names for UI services
    470f38c [Andrew Or] Special case non-"Address already in use" exceptions
    1d7e408 [Andrew Or] Treat 0 ports specially + return correct ConnectionManager port
    ba32280 [Andrew Or] Minor fixes
    6b550b0 [Andrew Or] Assorted fixes
    73fbe89 [Andrew Or] Move start service logic to Utils
    ec676f4 [Andrew Or] Merge branch 'SPARK-2157' of github.com:ash211/spark into configure-ports
    038a579 [Andrew Ash] Trust the server start function to report the port the service started on
    7c5bdc4 [Andrew Ash] Fix style issue
    0347aef [Andrew Ash] Unify port fallback logic to a single place
    24a4c32 [Andrew Ash] Remove type on val to match surrounding style
    9e4ad96 [Andrew Ash] Reformat for style checker
    5d84e0e [Andrew Ash] Document new port configuration options
    066dc7a [Andrew Ash] Fix up HttpServer port increments
    cad16da [Andrew Ash] Add fallover increment logic for HttpServer
    c5a0568 [Andrew Ash] Fix ConnectionManager to retry with increment
    b80d2fd [Andrew Ash] Make Spark's block manager port configurable
    17c79bb [Andrew Ash] Add a configuration option for spark-shell's class server
    f34115d [Andrew Ash] SPARK-1176 Add port configuration for HttpBroadcast
    49ee29b [Andrew Ash] SPARK-1174 Add port configuration for HttpFileServer
    1c0981a [Andrew Ash] Make port in HttpServer configurable
    09f7e458
    History
    [SPARK-2157] Enable tight firewall rules for Spark
    Andrew Or authored
    The goal of this PR is to allow users of Spark to write tight firewall rules for their clusters. This is currently not possible because Spark uses random ports in many places, notably the communication between executors and drivers. The changes in this PR are based on top of ash211's changes in #1107.
    
    The list covered here may or may not be the complete set of port needed for Spark to operate perfectly. However, as of the latest commit there are no known sources of random ports (except in tests). I have not documented a few of the more obscure configs.
    
    My spark-env.sh looks like this:
    ```
    export SPARK_MASTER_PORT=6060
    export SPARK_WORKER_PORT=7070
    export SPARK_MASTER_WEBUI_PORT=9090
    export SPARK_WORKER_WEBUI_PORT=9091
    ```
    and my spark-defaults.conf looks like this:
    ```
    spark.master spark://andrews-mbp:6060
    spark.driver.port 5001
    spark.fileserver.port 5011
    spark.broadcast.port 5021
    spark.replClassServer.port 5031
    spark.blockManager.port 5041
    spark.executor.port 5051
    ```
    
    Author: Andrew Or <andrewor14@gmail.com>
    Author: Andrew Ash <andrew@andrewash.com>
    
    Closes #1777 from andrewor14/configure-ports and squashes the following commits:
    
    621267b [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    8a6b820 [Andrew Or] Use a random UI port during tests
    7da0493 [Andrew Or] Fix tests
    523c30e [Andrew Or] Add test for isBindCollision
    b97b02a [Andrew Or] Minor fixes
    c22ad00 [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    93d359f [Andrew Or] Executors connect to wrong port when collision occurs
    d502e5f [Andrew Or] Handle port collisions when creating Akka systems
    a2dd05c [Andrew Or] Patrick's comment nit
    86461e2 [Andrew Or] Remove spark.executor.env.port and spark.standalone.client.port
    1d2d5c6 [Andrew Or] Fix ports for standalone cluster mode
    cb3be88 [Andrew Or] Various doc fixes (broken link, format etc.)
    e837cde [Andrew Or] Remove outdated TODOs
    bfbab28 [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    de1b207 [Andrew Or] Update docs to reflect new ports
    b565079 [Andrew Or] Add spark.ports.maxRetries
    2551eb2 [Andrew Or] Remove spark.worker.watcher.port
    151327a [Andrew Or] Merge branch 'master' of github.com:apache/spark into configure-ports
    9868358 [Andrew Or] Add a few miscellaneous ports
    6016e77 [Andrew Or] Add spark.executor.port
    8d836e6 [Andrew Or] Also document SPARK_{MASTER/WORKER}_WEBUI_PORT
    4d9e6f3 [Andrew Or] Fix super subtle bug
    3f8e51b [Andrew Or] Correct erroneous docs...
    e111d08 [Andrew Or] Add names for UI services
    470f38c [Andrew Or] Special case non-"Address already in use" exceptions
    1d7e408 [Andrew Or] Treat 0 ports specially + return correct ConnectionManager port
    ba32280 [Andrew Or] Minor fixes
    6b550b0 [Andrew Or] Assorted fixes
    73fbe89 [Andrew Or] Move start service logic to Utils
    ec676f4 [Andrew Or] Merge branch 'SPARK-2157' of github.com:ash211/spark into configure-ports
    038a579 [Andrew Ash] Trust the server start function to report the port the service started on
    7c5bdc4 [Andrew Ash] Fix style issue
    0347aef [Andrew Ash] Unify port fallback logic to a single place
    24a4c32 [Andrew Ash] Remove type on val to match surrounding style
    9e4ad96 [Andrew Ash] Reformat for style checker
    5d84e0e [Andrew Ash] Document new port configuration options
    066dc7a [Andrew Ash] Fix up HttpServer port increments
    cad16da [Andrew Ash] Add fallover increment logic for HttpServer
    c5a0568 [Andrew Ash] Fix ConnectionManager to retry with increment
    b80d2fd [Andrew Ash] Make Spark's block manager port configurable
    17c79bb [Andrew Ash] Add a configuration option for spark-shell's class server
    f34115d [Andrew Ash] SPARK-1176 Add port configuration for HttpBroadcast
    49ee29b [Andrew Ash] SPARK-1174 Add port configuration for HttpFileServer
    1c0981a [Andrew Ash] Make port in HttpServer configurable