Commits · fe3eceab5724bec0103471eb905bb9701120b04a · cs525-sp18-g07 / spark

Jan 31, 2013

Remove activation of profiles by default · fe3eceab

Mikhail Bautin authored 12 years ago

See the discussion at https://github.com/mesos/spark/pull/355 for why
default profile activation is a problem.

fe3eceab

Jan 30, 2013
- Merge pull request #430 from pwendell/pyspark-guide · 55327a28
  Matei Zaharia authored 12 years ago
  
  Minor improvements to PySpark docs
  55327a28
- Make module help available in python shell. · 3f945e3b
  Patrick Wendell authored 12 years ago
  
  Also, adds a line in doc explaining how to use.
  3f945e3b
- Inclue packaging and launching pyspark in guide. · 58a7d320
  Patrick Wendell authored 12 years ago
  
  It's nicer if all the commands you need are made explicit.
  58a7d320
- Merge pull request #426 from woggling/conn-manager-ips · d12330bd
  Matei Zaharia authored 12 years ago
  
  Remember ConnectionManagerId used to initiate SendingConnections
  d12330bd
- Merge pull request #428 from woggling/mesos-exec-id · 612a9fee
  Matei Zaharia authored 12 years ago
  
  Make ExecutorIDs include SlaveIDs when running Mesos
  612a9fee
- Merge pull request #429 from stephenh/includemessage · dfb721b9
  Matei Zaharia authored 12 years ago
  
  Include message and exitStatus if availalbe.
  dfb721b9
- Include message and exitStatus if availalbe. · 871476d5
  Stephen Haberman authored 12 years ago
  
  871476d5
- Remove remants of attempt to use slaveId-executorId in MesosExecutorBackend · 252845d3
  Charles Reiss authored 12 years ago
  
  252845d3
- Use Mesos ExecutorIDs to hold SlaveIDs. Then we can safely use · f7de6978
  Charles Reiss authored 12 years ago
  
  the Mesos ExecutorID as a Spark ExecutorID.
  f7de6978
Jan 29, 2013

Remember ConnectionManagerId used to initiate SendingConnections. · 16a0789e

Charles Reiss authored 12 years ago

This prevents ConnectionManager from getting confused if a machine
has multiple host names and the one getHostName() finds happens
not to be the one that was passed from, e.g., the BlockManagerMaster.

16a0789e

Merge remote-tracking branch 'stephenh/removefailedjob' · d54b10b6
Matei Zaharia authored 12 years ago
```
Conflicts:
	core/src/main/scala/spark/deploy/master/Master.scala
```
d54b10b6
Merge pull request #425 from stephenh/toDebugString · ccb67ff2
Matei Zaharia authored 12 years ago
```
Add RDD.toDebugString.
```
ccb67ff2
Merge pull request #415 from stephenh/driver · 9ae11603
Matei Zaharia authored 12 years ago
```
Replace old 'master' term with 'driver'.
```
9ae11603

Simplify checkpointing code and RDD class a little: · 64ba6a8c

Matei Zaharia authored 12 years ago

- RDD's getDependencies and getSplits methods are now guaranteed to be
  called only once, so subclasses can safely do computation in there
  without worrying about caching the results.

- The management of a "splits_" variable that is cleared out when we
  checkpoint an RDD is now done in the RDD class.

- A few of the RDD subclasses are simpler.

- CheckpointRDD's compute() method no longer assumes that it is given a
  CheckpointRDDSplit -- it can work just as well on a split from the
  original RDD, because it only looks at its index. This is important
  because things like UnionRDD and ZippedRDD remember the parent's
  splits as part of their own and wouldn't work on checkpointed parents.

- RDD.iterator can now reuse cached data if an RDD is computed before it
  is checkpointed. It seems like it wouldn't do this before (it always
  called iterator() on the CheckpointRDD, which read from HDFS).

64ba6a8c

Fix code that depended on metadata cleaner interval being in minutes · b29599e5
Matei Zaharia authored 12 years ago

b29599e5
Include name, if set, in RDD.toString(). · cbf72bff
Stephen Haberman authored 12 years ago

cbf72bff
Add number of splits. · 3cda14af
Stephen Haberman authored 12 years ago

3cda14af
Merge branch 'master' of github.com:mesos/spark · a1ecec8d
Matei Zaharia authored 12 years ago

a1ecec8d
Add JavaRDDLike.toDebugString(). · 951cfd9b
Stephen Haberman authored 12 years ago

951cfd9b
Merge pull request #413 from pwendell/stage-logging · f6eb1f08
Matei Zaharia authored 12 years ago
```
SPARK-658: Adding logging of stage duration
```
f6eb1f08

Jan 28, 2013
- Add RDD.toDebugString. · b45857c9
  Stephen Haberman authored 12 years ago
  
  Original idea by Nathan Kronenfeld.
  b45857c9
- Units from ms -> s · 7ee824e4
  Patrick Wendell authored 12 years ago
  
  7ee824e4
- Merge branch 'master' into driver · 13368818
  Stephen Haberman authored 12 years ago
  
  Conflicts: core/src/main/scala/spark/SparkContext.scala core/src/main/scala/spark/SparkEnv.scala core/src/main/scala/spark/deploy/LocalSparkCluster.scala core/src/main/scala/spark/executor/StandaloneExecutorBackend.scala core/src/main/scala/spark/scheduler/cluster/SparkDeploySchedulerBackend.scala core/src/main/scala/spark/scheduler/cluster/StandaloneClusterMessage.scala core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala core/src/main/scala/spark/storage/BlockManagerMaster.scala core/src/main/scala/spark/storage/ThreadingTest.scala core/src/test/scala/spark/MapOutputTrackerSuite.scala
  13368818
- Merge pull request #424 from pwendell/logging-cleanup · dda2ce01
  Matei Zaharia authored 12 years ago
  
  Some DEBUG-level log cleanup.
  dda2ce01
- Merge pull request #423 from squito/long_float_accums · 8160f03a
  Matei Zaharia authored 12 years ago
  
  add long and float accumulatorparams
  8160f03a
- Some DEBUG-level log cleanup. · 1f9b486a
  Patrick Wendell authored 12 years ago
  
  A few changes to make the DEBUG-level logs less noisy and more readable. - Moved a few very frequent messages to Trace - Changed some BlockManger log messages to make them more understandable SPARK-666 #resolve
  1f9b486a
- add long and float accumulatorparams · efff7bfb
  Imran Rashid authored 12 years ago
  
  efff7bfb
- Making submission time a field · 501433f1
  Patrick Wendell authored 12 years ago
  
  501433f1
- Renaming stage finished function · c423be7d
  Patrick Wendell authored 12 years ago
  
  c423be7d
- SPARK-658: Adding logging of stage duration · 07f568e1
  Patrick Wendell authored 12 years ago
  
  07f568e1
- Change time unit in MetadataCleaner to seconds · 286f8f87
  Matei Zaharia authored 12 years ago
  
  286f8f87
- Clean up BlockManagerUI a little (make it not be an object, merge with · f03d9760
  Matei Zaharia authored 12 years ago
  
  Directives, and bind to a random port)
  f03d9760
- Rename more things from slave to executor · 90985072
  Matei Zaharia authored 12 years ago
  
  90985072
Jan 27, 2013
- Track workers by executor ID instead of hostname to allow multiple · 44b4a0f8
  Matei Zaharia authored 12 years ago
  
  executors per machine and remove the need for multiple IP addresses in unit tests.
  44b4a0f8
- Merge pull request #419 from shivaram/ec2-ip-change · b9e2d9ef
  Matei Zaharia authored 12 years ago
  
  Detect whether we run on EC2 using ec2-metadata as well
  b9e2d9ef
- Merge pull request #401 from squito/blockmanager_ui · 6ad8540b
  Matei Zaharia authored 12 years ago
  
  Blockmanager ui
  6ad8540b
- Detect whether we run on EC2 using ec2-metadata as well · 717b221c
  Shivaram Venkataraman authored 12 years ago
  
  717b221c
Jan 26, 2013
- Merge pull request #418 from woggling/reregister-deadlock · 49f6472c
  Matei Zaharia authored 12 years ago
  
  Fix BlockManager reregistration deadlock; do BlockManager reregistration more asynchronously
  49f6472c
- Handle duplicate registrations better. · 58fc6b2b
  Charles Reiss authored 12 years ago
  
  58fc6b2b