Add withConfig api to allow running an execution with a transformed config #1489

ianoc · 2016-02-01T22:34:00Z

Adds the the ability to transform Config's (and hence args) for sub-sections of Execution flows. This can be useful to override hadoop or source level options in subsections. This lets the user have more control over things like split sizes, memory used in mappers/reducers, combining small files, etc..

johnynek · 2016-02-02T22:17:51Z

I think the cache key needs to include the config if we do this.

Also, if we go this route, I would say we change Execution from a Reader[(Config Mode), T] to State[(Config, Mode), T] and even allow mode changes in the flow.

richwhitjr · 2016-02-08T18:48:11Z

I like this idea, will help in many cases where we have to go wide on a source using splits map side but want to write larger files to hdfs at the end of a job. Currently there is not much choice aside from using .shard

johnynek · 2016-02-08T23:40:30Z

scalding-core/src/main/scala/com/twitter/scalding/Execution.scala

@@ -209,6 +209,9 @@ object Execution {
    override def join[T, U](t: Execution[T], u: Execution[U]): Execution[(T, U)] = t.zip(u)
  }

+  def withConfig[T](ex: => Execution[T])(c: Config => Config): Execution[T] =


if this is lazy, the arg in the constructor needs to be, otherwise this should not be lazy.

johnynek · 2016-02-08T23:40:54Z

Looks great! (1 minor issue)

ianoc · 2016-02-08T23:42:39Z

Killed the lazy, not useful I don't think here

ianoc · 2016-02-09T14:58:42Z

Good to go now @johnynek ?

Add withConfig api to allow running an execution with a transformed config

Add a flatMapWithConfigTransform helper

f7242e9

ianoc added 4 commits February 8, 2016 13:21

Merge branch 'develop' into ianoc/flatMapWithConfig

aa61b8f

Adds a failing test to guarantee good cache interaction

37f7d98

Adds support to the cache to be Config aware

65e697e

New API discussion

9f7f2dd

johnynek reviewed Feb 8, 2016
View reviewed changes

Make arg not lazy

91a6e6b

ianoc changed the title ~~Add a flatMapWithConfigTransform helper~~ Add withConfig api to allow running an execution with a transformed config Feb 9, 2016

ianoc mentioned this pull request Feb 9, 2016

allow Executions to set Config values #1469

Closed

johnynek added a commit that referenced this pull request Feb 9, 2016

Merge pull request #1489 from twitter/ianoc/flatMapWithConfig

8009a5f

Add withConfig api to allow running an execution with a transformed config

johnynek merged commit 8009a5f into develop Feb 9, 2016

johnynek deleted the ianoc/flatMapWithConfig branch February 9, 2016 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add withConfig api to allow running an execution with a transformed config #1489

Add withConfig api to allow running an execution with a transformed config #1489

ianoc commented Feb 1, 2016

johnynek commented Feb 2, 2016

richwhitjr commented Feb 8, 2016

johnynek Feb 8, 2016

johnynek commented Feb 8, 2016

ianoc commented Feb 8, 2016

ianoc commented Feb 9, 2016

Add withConfig api to allow running an execution with a transformed config #1489

Add withConfig api to allow running an execution with a transformed config #1489

Conversation

ianoc commented Feb 1, 2016

johnynek commented Feb 2, 2016

richwhitjr commented Feb 8, 2016

johnynek Feb 8, 2016

Choose a reason for hiding this comment

johnynek commented Feb 8, 2016

ianoc commented Feb 8, 2016

ianoc commented Feb 9, 2016