- Mar 20, 2013
-
-
Reynold Xin authored
CoGroupedRDD.
-
- Mar 16, 2013
-
-
Mark Hamstra authored
-
- Mar 11, 2013
-
-
Mark Hamstra authored
-
Mark Hamstra authored
-
- Mar 05, 2013
-
-
Mark Hamstra authored
-
Mark Hamstra authored
-
- Mar 04, 2013
-
-
Mark Hamstra authored
-
- Feb 18, 2013
-
-
Matei Zaharia authored
-
- Feb 05, 2013
-
-
Stephen Haberman authored
-
- Feb 04, 2013
-
-
Matei Zaharia authored
-
- Feb 01, 2013
-
-
Matei Zaharia authored
-
Matei Zaharia authored
These operations used to wait for all the results to be available in an array on the driver program before merging them. They now merge values incrementally as they arrive.
-
- Jan 24, 2013
-
-
Stephen Haberman authored
-
Reynold Xin authored
PartitionPruningRDD.
-
- Jan 23, 2013
-
-
Reynold Xin authored
-
Reynold Xin authored
-
Reynold Xin authored
-
- Jan 06, 2013
-
-
Stephen Haberman authored
-
- Jan 05, 2013
-
-
Stephen Haberman authored
-
Stephen Haberman authored
-
- Jan 01, 2013
-
-
Tathagata Das authored
-
Charles Reiss authored
-
- Dec 24, 2012
-
-
Mark Hamstra authored
-
- Nov 28, 2012
-
-
Matei Zaharia authored
partitions and number of elements in each partition)
-
- Oct 30, 2012
-
-
Tathagata Das authored
-
- Oct 07, 2012
-
-
Matei Zaharia authored
-
- Oct 05, 2012
-
-
Andy Konwinski authored
package spark to package spark.rdd and updates all references to them.
-
- Sep 30, 2012
-
-
Matei Zaharia authored
- SizeEstimator was following through a ClassLoader field of Hadoop JobConfs, which referenced the whole interpreter, Scala compiler, etc. Chaos ensued, giving an estimated size in the tens of gigabytes. - Broadcast variables in local mode were only stored as MEMORY_ONLY and never made accessible over a server, so they fell out of the cache when they were deemed too large and couldn't be reloaded.
-
- Sep 29, 2012
-
-
Matei Zaharia authored
-
Matei Zaharia authored
-
- Sep 26, 2012
-
-
Reynold Xin authored
-
- Aug 23, 2012
-
-
Matei Zaharia authored
The broadcast fix is to store values as MEMORY_ONLY_DESER instead of MEMORY_ONLY, which will save substantial time on serialization.
-
- Aug 03, 2012
-
-
Shivaram Venkataraman authored
-
- Jul 27, 2012
-
-
Matei Zaharia authored
-
Tathagata Das authored
-
- Jul 23, 2012
- Jul 18, 2012
-
-
Denny authored
-
- Nov 30, 2011
-
-
Matei Zaharia authored
merge results into rather than requiring a new object allocation for each element merged. Fixes #95.
-