Add the remaining odds and ends to Execution[T] #985

johnynek · 2014-07-30T22:35:28Z

I think Execution is in a usable state, and the preferred way to write Library code that needs to do multistep operation.

Please review these last changes.

johnynek · 2014-07-31T00:00:44Z

This is a very strange failure:

[error]     cascading.tuple.hadoop.TupleSerialization cannot be cast to org.apache.hadoop.io.serializer.Serialization (SerializationFactory.java:64)
[error]     org.apache.hadoop.io.serializer.SerializationFactory.add(SerializationFactory.java:64)
[error]     org.apache.hadoop.io.serializer.SerializationFactory.<init>(SerializationFactory.java:54)
[error]

But, cascading.tuple.hadoop.TupleSerialization does extend Serialization:

https://github.com/cwensel/cascading/blob/wip-2.6/cascading-hadoop/src/main/shared/cascading/tuple/hadoop/TupleSerialization.java#L85

johnynek · 2014-07-31T00:05:44Z

BTW, this also passes locally for me.

ianoc · 2014-07-31T00:58:08Z

Fails locally for me:

[error] x An ExecutionJob should
[error] x run correctly
[error] could not build flow from assembly: cascading.tuple.hadoop.TupleSerialization cannot be cast to org.apache.hadoop.io.serializer.Serialization
[error] cascading.flow.planner.FlowPlanner.handleExceptionDuringPlanning(FlowPlanner.java:577)
[error] cascading.flow.hadoop.planner.HadoopPlanner.buildFlow(HadoopPlanner.java:286)
[error] cascading.flow.hadoop.planner.HadoopPlanner.buildFlow(HadoopPlanner.java:80)
[error] cascading.flow.FlowConnector.connect(FlowConnector.java:459)
[error] com.twitter.scalding.ExecutionContext$class.buildFlow(ExecutionContext.scala:46)
[error] com.twitter.scalding.ExecutionContext$$anon$1.buildFlow(ExecutionContext.scala:83)
[error] com.twitter.scalding.ExecutionContext$class.run(ExecutionContext.scala:57)
[error] com.twitter.scalding.ExecutionContext$$anon$1.run(ExecutionContext.scala:83)
[error] com.twitter.scalding.Execution$FlowDefExecution$$anonfun$runStats$13$$anonfun$apply$7.apply(Execution.scala:222)

johnynek · 2014-07-31T01:34:30Z

Current theory: using Execution, which is using threads due to the scala.concurrent.ExecutionContext can expose some classloader issues. I'm trying to explicitly set the classloader. This is a tricky one for me. I have not really debugged classpath issues before.

@cwensel any comments? Is calling cascading from multiple threads going to work given all the classloader work going on internally?

johnynek · 2014-07-31T01:41:30Z

Keeps passing for me. :/ @ianoc , can you try this patch?

cwensel · 2014-07-31T02:10:16Z

Cascading has been embedded in a few multithreaded contexts. Spring source guys requested some minor changes years ago, and Lingual JDBC runs under threads (connection pooling etc).

Make sure you are setting the Context ClassLoader where appropriate. Cascading either calls it directly, or Hadoop itself when doing class loading (where we delegate too) relies on it.

johnynek · 2014-07-31T03:59:20Z

@cwensel can you take a look at this stack:
https://travis-ci.org/twitter/scalding/jobs/31297341#L3977

(line 3977)

We don't do much reflection at all (two calls, both get the context loader) in our code. What is looks to me like is that the classloader that hadoop is using is somehow inconsistent. I've tried a few ideas to fix it, but nothing is working out.

The context here is that the job itself is being started in a different thread than all the taps were created.

cwensel · 2014-07-31T05:12:04Z

you can try messing around with having your own UnitOfWorkExecutorStrategy on the Flow and own the context loader the thread executor utilizes.

but to have the problem you are seeing, you would need two peer classloaders loading the same classpath independently. classloaders aren't child first.

maybe try loading org.apache.hadoop.mapred.OutputFormat statically early and force it into a parent to see what happens.

or, you have some messed up dependencies.

fwiw, Lingual jumps through a bunch of hoops to allow for dynamic classloading of Tap/Schemes from remote sources. things work great, even embedded (which is why we have the capability).

ckw

On Jul 30, 2014, at 8:59 PM, P. Oscar Boykin notifications@github.com wrote:

@cwensel can you take a look at this stack:
https://travis-ci.org/twitter/scalding/jobs/31297341#L3977

(line 3977)

We don't do much reflection at all (two calls, both get the context loader) in our code. What is looks to me like is that the classloader that hadoop is using is somehow inconsistent. I've tried a few ideas to fix it, but nothing is working out.

The context here is that the job itself is being started in a different thread than all the taps were created.

—
Reply to this email directly or view it on GitHub.

Chris K Wensel
chris@concurrentinc.com
http://concurrentinc.com

johnynek · 2014-07-31T14:12:09Z

Well, setting the context classloader explicitly to the one that created the Configuration in the job works for 2.9.3, but 2.10.4 errors out with this:

/home/travis/build.sh: line 41: 1473 Killed ./sbt -Dlog4j.configuration=file://$TRAVIS_BUILD_DIR/project/travis-log4j.properties ++$TRAVIS_SCALA_VERSION assembly

I tried restarting it twice, and the log ends with that both times.

johnynek · 2014-07-31T21:32:47Z

I think this works now, but Travis just can't download the jars.

Passes for me and Ian.

I appreciate Travis, but I'd guess the false positive rate is greater than 20%. This dramatically undermines faith in the test failures (and we often ignore real failures because of it).

johnynek · 2014-08-02T20:05:52Z

scalding-core/src/main/scala/com/twitter/scalding/Execution.scala

@@ -136,7 +219,9 @@ object Execution {
    def runStats(conf: Config, mode: Mode)(implicit cec: ConcurrentExecutionContext) = {
      for {
        (flowDef, fn) <- Future(result(conf, mode))


there is no reason for these first two to be in Future threads.

…mporary file size

…r/scalding into back_to_the_future_201407

Add the remaining odds and ends to Execution[T]

Add the remaining odds and ends to Execution[T]

844c293

Try to explicitly set the classloader for MemoryTap

f39b470

Explicitly set the classpath on the JobConf

805cf44

johnynek added 3 commits July 30, 2014 18:03

Try setting cascading appJar

4f97510

Don't execute Executions concurrently

577a6fb

Try setting the classloader from the jobConf

e81a76e

Revert hack in TupleMemoryInputFormat

8982386

johnynek added 3 commits July 31, 2014 07:35

See if test reflection is causing the issue

f21f724

Fork in test

dc2ab71

Fix FileSourceTest to work with fork

1695366

ianoc and others added 10 commits July 31, 2014 15:11

use https for sonatype

080a9aa

Fix paths in repl-test

c8a1550

turn sudo off

75e69ba

File too long bug back with sudo off

c1443c1

Smaller xmx

bfdc6cd

WS change to poke travis

52ac650

Move the travis setting into an env var

14ca05e

Merge error

ed77c73

Try get around travis overriding our sbt settings

d444468

Fix typo

a761f9f

ianoc and others added 4 commits July 31, 2014 19:26

asdf

f295fb5

Just don't use threads in the test job

0de760b

Fix repl tests for non-fork sbt mode

bf07572

Merge with develop

d1f3d6e

johnynek reviewed Aug 2, 2014
View reviewed changes

johnynek and others added 3 commits August 2, 2014 18:56

Fix path and review comments

e50f390

make the reducer estimator test less sensitive to minor changes in te…

ab50470

…mporary file size

Merge branch 'bholt/estimator-test-wider-margin' of github.com:twitte…

3b3520b

…r/scalding into back_to_the_future_201407

johnynek mentioned this pull request Aug 4, 2014

Make estimator test less sensitive to changes in size of intermediates #997

Closed

ianoc added a commit that referenced this pull request Aug 4, 2014

Merge pull request #985 from twitter/back_to_the_future_201407

2d8aa06

Add the remaining odds and ends to Execution[T]

ianoc merged commit 2d8aa06 into develop Aug 4, 2014

ianoc deleted the back_to_the_future_201407 branch August 4, 2014 23:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the remaining odds and ends to Execution[T] #985

Add the remaining odds and ends to Execution[T] #985

johnynek commented Jul 30, 2014

johnynek commented Jul 31, 2014

johnynek commented Jul 31, 2014

ianoc commented Jul 31, 2014

johnynek commented Jul 31, 2014

johnynek commented Jul 31, 2014

cwensel commented Jul 31, 2014

johnynek commented Jul 31, 2014

cwensel commented Jul 31, 2014

johnynek commented Jul 31, 2014

johnynek commented Jul 31, 2014

johnynek Aug 2, 2014

Add the remaining odds and ends to Execution[T] #985

Add the remaining odds and ends to Execution[T] #985

Conversation

johnynek commented Jul 30, 2014

johnynek commented Jul 31, 2014

johnynek commented Jul 31, 2014

ianoc commented Jul 31, 2014

johnynek commented Jul 31, 2014

johnynek commented Jul 31, 2014

cwensel commented Jul 31, 2014

johnynek commented Jul 31, 2014

cwensel commented Jul 31, 2014

johnynek commented Jul 31, 2014

johnynek commented Jul 31, 2014

johnynek Aug 2, 2014

Choose a reason for hiding this comment