Skip to content
Snippets Groups Projects
Commit 58a7d320 authored by Patrick Wendell's avatar Patrick Wendell
Browse files

Inclue packaging and launching pyspark in guide.

It's nicer if all the commands you need are made explicit.
parent 86057ec7
No related branches found
No related tags found
No related merge requests found
...@@ -67,8 +67,14 @@ The script automatically adds the `pyspark` package to the `PYTHONPATH`. ...@@ -67,8 +67,14 @@ The script automatically adds the `pyspark` package to the `PYTHONPATH`.
# Interactive Use # Interactive Use
The `pyspark` script launches a Python interpreter that is configured to run PySpark jobs. The `pyspark` script launches a Python interpreter that is configured to run PySpark jobs. To use `pyspark` interactively, first build Spark, then launch it directly from the command line without any options:
When run without any input files, `pyspark` launches a shell that can be used explore data interactively, which is a simple way to learn the API:
{% highlight bash %}
$ sbt/sbt package
$ ./pyspark
{% endhighlight %}
The Python shell can be used explore data interactively and is a simple way to learn the API:
{% highlight python %} {% highlight python %}
>>> words = sc.textFile("/usr/share/dict/words") >>> words = sc.textFile("/usr/share/dict/words")
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment