Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hydrate both sides of sampledCounts in skewJoinWithSmaller #1278

Merged
merged 2 commits into from
May 8, 2015

Conversation

jnievelt
Copy link
Contributor

@jnievelt jnievelt commented May 5, 2015

Resolves #1277

The test fixes here are sufficient to make two tests fail, both of which are fixed with this change.

jnievelt added 2 commits May 5, 2015 13:32
Augment sampledCounts in JoinAlgorithms#skewJoinWithSmaller to avoid dropping items when keys get filtered by sampling
Accumulate the results instead of just computing the append.  Also make input1 smaller to more easily expose sampling issues.
@johnynek
Copy link
Collaborator

johnynek commented May 5, 2015

nice work tracking this down. It is so sad this has been causing subtle errors for years

johnynek added a commit that referenced this pull request May 8, 2015
Hydrate both sides of sampledCounts in skewJoinWithSmaller
@johnynek johnynek merged commit ebb4968 into develop May 8, 2015
@johnynek johnynek deleted the jnievelt/1277 branch May 8, 2015 21:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

skewJoinWithSmaller can drop entries
2 participants