Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge from twitter/scalding #1

Merged
merged 899 commits into from
Jul 31, 2014
Merged

Merge from twitter/scalding #1

merged 899 commits into from
Jul 31, 2014

Conversation

rubanm
Copy link
Owner

@rubanm rubanm commented Jul 31, 2014

No description provided.

johnynek and others added 30 commits June 22, 2014 17:46
Run the test target, so the tests are reformatted
Some minor improvements to typed joining code
Add Hadoop platform test to enable unit testing for Hadoop semantics
Allow tests for typed delimited by fixing swallowed bug
Integrate the repl completely into scald.rb. Fixup scald-rb for better hdfs-local mode now with our provides
johnynek and others added 26 commits July 25, 2014 15:28
…aits

Do not inherit from FileSource in LzoTraits
Generalize handling of merged TypedPipes
Make an Execution[T] type, which is a monad, which makes composing Jobs easy.
JsonLine previously did not have the transformations enabled
during the test phase (transformInTest set to false).

Because of this the only way to use the JsonLine was to create
a sub-class and override the transformInTest method with a ‘true’ def
value: as JsonLine can be effectively used with JobTest only
with transformation enabled, sub-classing and overriding was 
quite common.

When using JsonLine without conversation enabled you get a 
Cascading mapping exception:
“could not select fields: ... from: [{2}:'offset', 'line’]”

Default setting for transformInTest is kept to false for backward
compatibility with 0.9.x and 0.10.x.
…addition)

Conflicts:
	scalding-core/src/main/scala/com/twitter/scalding/Execution.scala
Read and writable partitioned sources
JsonLine: enable read transformation in test to get correct fields in sourceTap
Make map/flatMap lazy on IterablePipe to address OOM
Adds support for Counters inside the Execution Monad.
Improve TypedSimilarity algorithm and update test.
Basic reducer estimator support
rubanm added a commit that referenced this pull request Jul 31, 2014
Merge from twitter/scalding
@rubanm rubanm merged commit 1c8a5bb into rubanm:develop Jul 31, 2014
rubanm pushed a commit that referenced this pull request Feb 4, 2015
rubanm pushed a commit that referenced this pull request Jul 22, 2015
Fix the BDD tests with TypedText migration
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.