Skip to content
Snippets Groups Projects
Commit c46067f0 authored by Matei Zaharia's avatar Matei Zaharia
Browse files

Merge pull request #206 from ash211/patch-2

Update tuning.md

Clarify when serializer is used based on recent user@ mailing list discussion.
parents 14bb465b 08afef37
No related branches found
No related tags found
No related merge requests found
...@@ -39,7 +39,8 @@ in your operations) and performance. It provides two serialization libraries: ...@@ -39,7 +39,8 @@ in your operations) and performance. It provides two serialization libraries:
for best performance. for best performance.
You can switch to using Kryo by calling `System.setProperty("spark.serializer", "org.apache.spark.serializer.KryoSerializer")` You can switch to using Kryo by calling `System.setProperty("spark.serializer", "org.apache.spark.serializer.KryoSerializer")`
*before* creating your SparkContext. The only reason it is not the default is because of the custom *before* creating your SparkContext. This setting configures the serializer used for not only shuffling data between worker
nodes but also when serializing RDDs to disk. The only reason Kryo is not the default is because of the custom
registration requirement, but we recommend trying it in any network-intensive application. registration requirement, but we recommend trying it in any network-intensive application.
Finally, to register your classes with Kryo, create a public class that extends Finally, to register your classes with Kryo, create a public class that extends
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment