Commits · 3e63d98f09065386901d78c141b0da93cdce0f76 · cs525-sp18-g07 / spark

Mar 09, 2014

SPARK-782 Clean up for ASM dependency. · b9be1609

Patrick Wendell authored 11 years ago

This makes two changes.

1) Spark uses the shaded version of asm that is (conveniently) published
   with Kryo.
2) Existing exclude rules around asm are updated to reflect the new groupId
   of `org.ow2.asm`. This made all of the old rules not work with newer Hadoop
   versions that pull in new asm versions.

Author: Patrick Wendell <pwendell@gmail.com>

Closes #100 from pwendell/asm and squashes the following commits:

9235f3f [Patrick Wendell] SPARK-782 Clean up for ASM dependency.

b9be1609

Mar 08, 2014

SPARK-1193. Fix indentation in pom.xmls · a99fb374

Sandy Ryza authored 11 years ago

Author: Sandy Ryza <sandy@cloudera.com>

Closes #91 from sryza/sandy-spark-1193 and squashes the following commits:

a878124 [Sandy Ryza] SPARK-1193. Fix indentation in pom.xmls

a99fb374

Mar 06, 2014

SPARK-1189: Add Security to Spark - Akka, Http, ConnectionManager, UI use servlets · 7edbea41

Thomas Graves authored 11 years ago

resubmit pull request. was https://github.com/apache/incubator-spark/pull/332.

Author: Thomas Graves <tgraves@apache.org>

Closes #33 from tgravescs/security-branch-0.9-with-client-rebase and squashes the following commits:

dfe3918 [Thomas Graves] Fix merge conflict since startUserClass now using runAsUser
05eebed [Thomas Graves] Fix dependency lost in upmerge
d1040ec [Thomas Graves] Fix up various imports
05ff5e0 [Thomas Graves] Fix up imports after upmerging to master
ac046b3 [Thomas Graves] Merge remote-tracking branch 'upstream/master' into security-branch-0.9-with-client-rebase
13733e1 [Thomas Graves] Pass securityManager and SparkConf around where we can. Switch to use sparkConf for reading config whereever possible. Added ConnectionManagerSuite unit tests.
4a57acc [Thomas Graves] Change UI createHandler routines to createServlet since they now return servlets
2f77147 [Thomas Graves] Rework from comments
50dd9f2 [Thomas Graves] fix header in SecurityManager
ecbfb65 [Thomas Graves] Fix spacing and formatting
b514bec [Thomas Graves] Fix reference to config
ed3d1c1 [Thomas Graves] Add security.md
6f7ddf3 [Thomas Graves] Convert SaslClient and SaslServer to scala, change spark.authenticate.ui to spark.ui.acls.enable, and fix up various other things from review comments
2d9e23e [Thomas Graves] Merge remote-tracking branch 'upstream/master' into security-branch-0.9-with-client-rebase_rework
5721c5a [Thomas Graves] update AkkaUtilsSuite test for the actorSelection changes, fix typos based on comments, and remove extra lines I missed in rebase from AkkaUtils
f351763 [Thomas Graves] Add Security to Spark - Akka, Http, ConnectionManager, UI to use servlets

7edbea41

Mar 02, 2014

SPARK-1121: Include avro for yarn-alpha builds · c3f5e075

Patrick Wendell authored 11 years ago

This lets us explicitly include Avro based on a profile for 0.23.X
builds. It makes me sad how convoluted it is to express this logic
in Maven. @tgraves and @sryza curious if this works for you.

I'm also considering just reverting to how it was before. The only
real problem was that Spark advertised a dependency on Avro
even though it only really depends transitively on Avro through
other deps.

Author: Patrick Wendell <pwendell@gmail.com>

Closes #49 from pwendell/avro-build-fix and squashes the following commits:

8d6ee92 [Patrick Wendell] SPARK-1121: Add avro to yarn-alpha profile

c3f5e075

Remove remaining references to incubation · 1fd2bfd3

Patrick Wendell authored 11 years ago

This removes some loose ends not caught by the other (incubating -> tlp) patches. @markhamstra this updates the version as you mentioned earlier.

Author: Patrick Wendell <pwendell@gmail.com>

Closes #51 from pwendell/tlp and squashes the following commits:

d553b1b [Patrick Wendell] Remove remaining references to incubation

1fd2bfd3

Feb 27, 2014

SPARK 1084.1 (resubmitted) · 12bbca20

Sean Owen authored 11 years ago

(Ported from https://github.com/apache/incubator-spark/pull/637 )

Author: Sean Owen <sowen@cloudera.com>

Closes #31 from srowen/SPARK-1084.1 and squashes the following commits:

6c4a32c [Sean Owen] Suppress warnings about legitimate unchecked array creations, or change code to avoid it
f35b833 [Sean Owen] Fix two misc javadoc problems
254e8ef [Sean Owen] Fix one new style error introduced in scaladoc warning commit
5b2fce2 [Sean Owen] Fix scaladoc invocation warning, and enable javac warnings properly, with plugin config updates
007762b [Sean Owen] Remove dead scaladoc links
b8ff8cb [Sean Owen] Replace deprecated Ant <tasks> with <target>

12bbca20

[SPARK-1089] fix the regression problem on ADD_JARS in 0.9 · 345df5f4

CodingCat authored 11 years ago

https://spark-project.atlassian.net/browse/SPARK-1089

copied from JIRA, reported by @ash211

"Using the ADD_JARS environment variable with spark-shell used to add the jar to both the shell and the various workers. Now it only adds to the workers and importing a custom class in the shell is broken.
The workaround is to add custom jars to both ADD_JARS and SPARK_CLASSPATH.
We should fix ADD_JARS so it works properly again.
See various threads on the user list:
https://mail-archives.apache.org/mod_mbox/incubator-spark-user/201402.mbox/%3CCAJbo4neMLiTrnm1XbyqomWmp0m+EUcg4yE-txuRGSVKOb5KLeA@mail.gmail.com%3E
(another one that doesn't appear in the archives yet titled "ADD_JARS not working on 0.9")"

The reason of this bug is two-folds

in the current implementation of SparkILoop.scala, the settings.classpath is not set properly when the process() method is invoked

the weird behaviour of Scala 2.10, (I personally thought it is a bug)

if we simply set value of a PathSettings object (like settings.classpath), the isDefault is not set to true (this is a flag showing if the variable is modified), so it makes the PathResolver loads the default CLASSPATH environment variable value to calculated the path (see https://github.com/scala/scala/blob/2.10.x/src/compiler/scala/tools/util/PathResolver.scala#L215)

what we have to do is to manually make this flag set, (https://github.com/CodingCat/incubator-spark/blob/e3991d97ddc33e77645e4559b13bf78b9e68239a/repl/src/main/scala/org/apache/spark/repl/SparkILoop.scala#L884)

Author: CodingCat <zhunansjtu@gmail.com>

Closes #13 from CodingCat/SPARK-1089 and squashes the following commits:

8af81e7 [CodingCat] impose non-null settings
9aa2125 [CodingCat] code cleaning
ce36676 [CodingCat] code cleaning
e045582 [CodingCat] fix the regression problem on ADD_JARS in 0.9

345df5f4

Feb 23, 2014

SPARK-1071: Tidy logging strategy and use of log4j · c0ef3afa

Sean Owen authored 11 years ago

Prompted by a recent thread on the mailing list, I tried and failed to see if Spark can be made independent of log4j. There are a few cases where control of the underlying logging is pretty useful, and to do that, you have to bind to a specific logger.

Instead I propose some tidying that leaves Spark's use of log4j, but gets rid of warnings and should still enable downstream users to switch. The idea is to pipe everything (except log4j) through SLF4J, and have Spark use SLF4J directly when logging, and where Spark needs to output info (REPL and tests), bind from SLF4J to log4j.

This leaves the same behavior in Spark. It means that downstream users who want to use something except log4j should:

- Exclude dependencies on log4j, slf4j-log4j12 from Spark
- Include dependency on log4j-over-slf4j
- Include dependency on another logger X, and another slf4j-X
- Recreate any log config that Spark does, that is needed, in the other logger's config

That sounds about right.

Here are the key changes:

- Include the jcl-over-slf4j shim everywhere by depending on it in core.
- Exclude dependencies on commons-logging from third-party libraries.
- Include the jul-to-slf4j shim everywhere by depending on it in core.
- Exclude slf4j-* dependencies from third-party libraries to prevent collision or warnings
- Added missing slf4j-log4j12 binding to GraphX, Bagel module tests

And minor/incidental changes:

- Update to SLF4J 1.7.5, which happily matches Hadoop 2’s version and is a recommended update over 1.7.2
- (Remove a duplicate HBase dependency declaration in SparkBuild.scala)
- (Remove a duplicate mockito dependency declaration that was causing warnings and bugging me)

Author: Sean Owen <sowen@cloudera.com>

Closes #570 from srowen/SPARK-1071 and squashes the following commits:

52eac9f [Sean Owen] Add slf4j-over-log4j12 dependency to core (non-test) and remove it from things that depend on core.
77a7fa9 [Sean Owen] SPARK-1071: Tidy logging strategy and use of log4j

c0ef3afa

Feb 17, 2014

[SPARK-1090] improvement on spark_shell (help information, configure memory) · e0d49ad2

CodingCat authored 11 years ago

https://spark-project.atlassian.net/browse/SPARK-1090

spark-shell should print help information about parameters and should allow user to configure exe memory
there is no document about hot to set --cores/-c in spark-shell

and also

users should be able to set executor memory through command line options

In this PR I also check the format of the options passed by the user

Author: CodingCat <zhunansjtu@gmail.com>

Closes #599 from CodingCat/spark_shell_improve and squashes the following commits:

de5aa38 [CodingCat] add parameter to set driver memory
915cbf8 [CodingCat] improvement on spark_shell (help information, configure memory)

e0d49ad2

Feb 09, 2014

Merge pull request #557 from ScrapCodes/style. Closes #557. · b69f8b2a

Patrick Wendell authored 11 years ago

SPARK-1058, Fix Style Errors and Add Scala Style to Spark Build.

Author: Patrick Wendell <pwendell@gmail.com>
Author: Prashant Sharma <scrapcodes@gmail.com>

== Merge branch commits ==

commit 1a8bd1c059b842cb95cc246aaea74a79fec684f4
Author: Prashant Sharma <scrapcodes@gmail.com>
Date:   Sun Feb 9 17:39:07 2014 +0530

    scala style fixes

commit f91709887a8e0b608c5c2b282db19b8a44d53a43
Author: Patrick Wendell <pwendell@gmail.com>
Date:   Fri Jan 24 11:22:53 2014 -0800

    Adding scalastyle snapshot

b69f8b2a

Feb 08, 2014

Merge pull request #542 from markhamstra/versionBump. Closes #542. · c2341c92

Mark Hamstra authored 11 years ago

Version number to 1.0.0-SNAPSHOT

Since 0.9.0-incubating is done and out the door, we shouldn't be building 0.9.0-incubating-SNAPSHOT anymore.

@pwendell

Author: Mark Hamstra <markhamstra@gmail.com>

== Merge branch commits ==

commit 1b00a8a7c1a7f251b4bb3774b84b9e64758eaa71
Author: Mark Hamstra <markhamstra@gmail.com>
Date:   Wed Feb 5 09:30:32 2014 -0800

    Version number to 1.0.0-SNAPSHOT

c2341c92

Jan 14, 2014
- Add missing header files · 23034798
  Patrick Wendell authored 11 years ago
  
  23034798
Jan 12, 2014
- Removing mentions in tests · 0bb33076
  Patrick Wendell authored 11 years ago
  
  0bb33076
Jan 10, 2014

Revert GraphX changes to SparkILoopInit · 0ca18b8b

Ankur Dave authored 11 years ago

The changes were to support a custom banner in spark-shell for use by
graphx-shell, but once GraphX is merged into Spark, a separate shell
will be unnecessary.

0ca18b8b

Jan 07, 2014
- Added license header and removed @author tag · 4689ce29
  Luca Rosellini authored 11 years ago
  
  4689ce29
Jan 03, 2014

Added ‘-i’ command line option to spark REPL. · 0b6db8c1

Luca Rosellini authored 11 years ago

We had to create a new implementation of both scala.tools.nsc.CompilerCommand and scala.tools.nsc.Settings, because using scala.tools.nsc.GenericRunnerSettings would bring in other options (-howtorun, -save and -execute) which don’t make sense in Spark.
Any new Spark specific command line option could now be added to org.apache.spark.repl.SparkRunnerSettings class.

Since the behavior of loading a script from the command line should be the same as loading it using the “:load” command inside the shell, the script should be loaded when the SparkContext is available, that’s why we had to move the call to ‘loadfiles(settings)’ _after_ the call to postInitialization(). This still doesn’t work if ‘isAsync = true’.

0b6db8c1

fixed review comments · 94f2fffa
Prashant Sharma authored 11 years ago

94f2fffa

Jan 01, 2014

Miscellaneous fixes from code review. · e2c68642

Matei Zaharia authored 11 years ago

Also replaced SparkConf.getOrElse with just a "get" that takes a default
value, and added getInt, getLong, etc to make code that uses this
simpler later on.

e2c68642

Dec 31, 2013
- Removing initLogging entirely · 18181e6c
  Patrick Wendell authored 11 years ago
  
  18181e6c
Dec 30, 2013

SPARK-1008: Logging improvments · cffe1c1d

Patrick Wendell authored 11 years ago

1. Adds a default log4j file that gets loaded if users haven't specified a log4j file.
2. Isolates use of the tools assembly jar. I found this produced SLF4J warnings
after building with SBT (and I've seen similar warnings on the mailing list).

cffe1c1d

Dec 28, 2013

Various fixes to configuration code · 642029e7

Matei Zaharia authored 11 years ago

- Got rid of global SparkContext.globalConf
- Pass SparkConf to serializers and compression codecs
- Made SparkConf public instead of private[spark]
- Improved API of SparkContext and SparkConf
- Switched executor environment vars to be passed through SparkConf
- Fixed some places that were still using system properties
- Fixed some tests, though others are still failing

This still fails several tests in core, repl and streaming, likely due
to properties not being set or cleared correctly (some of the tests run
fine in isolation).

642029e7

Dec 24, 2013
- spark-544, introducing SparkConf and related configuration overhaul. · 2573add9
  Prashant Sharma authored 11 years ago
  
  2573add9
Dec 15, 2013
- Use scala.binary.version in POMs · 09ed7ddf
  Mark Hamstra authored 11 years ago
  
  09ed7ddf
Dec 13, 2013
- Review comments on the PR for scala 2.10 migration. · a854cc53
  Prashant Sharma authored 11 years ago
  
  a854cc53
Dec 10, 2013
- Style fixes and addressed review comments at #221 · 17db6a90
  Prashant Sharma authored 11 years ago
  
  17db6a90
Dec 07, 2013
- Incorporated Patrick's feedback comment on #211 and made maven... · 7ad6921a
  Prashant Sharma authored 11 years ago
  
  Incorporated Patrick's feedback comment on #211 and made maven build/dep-resolution atleast a bit faster.
  7ad6921a
Nov 26, 2013
- Fixed compile time warnings and formatting post merge. · d092a8cc
  Prashant Sharma authored 11 years ago
  
  d092a8cc
Nov 15, 2013

Various merge corrections · f629ba95

Aaron Davidson authored 11 years ago

I've diff'd this patch against my own -- since they were both created
independently, this means that two sets of eyes have gone over all the
merge conflicts that were created, so I'm feeling significantly more
confident in the resulting PR.

@rxin has looked at the changes to the repl and is resoundingly
confident that they are correct.

f629ba95

Nov 09, 2013
- Propagate the SparkContext local property from the thread that calls the... · 31929994
  Reynold Xin authored 11 years ago
  
  Propagate the SparkContext local property from the thread that calls the spark-repl to the actual execution thread.
  31929994
Nov 04, 2013

This commit adds a new graphx-shell which is essentially the same as · 3c37928f

Joseph E. Gonzalez authored 11 years ago

the spark shell but with GraphX packages automatically imported
and with Kryo serialization enabled for GraphX types.

In addition the graphx-shell has a nifty new logo.

To make these changes minimally invasive in the SparkILoop.scala
I added some additional environment variables:

   SPARK_BANNER_TEXT: If set this string is displayed instead
   of the spark logo

   SPARK_SHELL_INIT_BLOCK: if set this expression is evaluated in the
   spark shell after the spark context is created.

3c37928f

Oct 24, 2013
- Makes Spark SIMR ready. · 05a0df2b
  Ali Ghodsi authored 11 years ago
  
  05a0df2b
Oct 17, 2013

Spark shell exits if it cannot create SparkContext · 74737264

Aaron Davidson authored 11 years ago

Mainly, this occurs if you provide a messed up MASTER url (one that doesn't match one
of our regexes). Previously, we would default to Mesos, fail, and then start the shell
anyway, except that any Spark command would fail.

74737264

Oct 12, 2013
- deprecate "spark" script and SPAKR_CLASSPATH environment variable · 52ccf4f8
  Andrew xia authored 11 years ago
  
  52ccf4f8
Oct 06, 2013
- Merging build changes in from 0.8 · aa9fb849
  Patrick Wendell authored 11 years ago
  
  aa9fb849
Sep 26, 2013
- Bug fix in master build · e2ff59af
  Patrick Wendell authored 11 years ago
  
  e2ff59af
- fixed maven build for scala 2.10 · 7ff4c2d3
  Prashant Sharma authored 11 years ago
  
  7ff4c2d3
Sep 24, 2013
- Update build version in master · 6079721f
  Patrick Wendell authored 11 years ago
  
  6079721f
Sep 15, 2013
- ported repl improvements from master · 69fd42ae
  Prashant Sharma authored 11 years ago
  
  69fd42ae
- Fixed repl suite · 20c65bc3
  Prashant Sharma authored 11 years ago
  
  20c65bc3
Sep 10, 2013
- Few more fixes to tests broken during merge · 6fcfefcb
  Prashant Sharma authored 11 years ago
  
  6fcfefcb