Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aggregate partition mode #6938

Closed
wants to merge 26 commits into from
Closed

Aggregate partition mode #6938

wants to merge 26 commits into from

Conversation

Dandandan
Copy link
Contributor

Which issue does this PR close?

Closes #.

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb and others added 25 commits July 10, 2023 12:46
Co-authored-by: Mustafa Akur <106137913+mustafasrepo@users.noreply.github.com>
Co-authored-by: Mustafa Akur <106137913+mustafasrepo@users.noreply.github.com>
Co-authored-by: Mustafa Akur <106137913+mustafasrepo@users.noreply.github.com>
…ter.rs

Co-authored-by: Mustafa Akur <106137913+mustafasrepo@users.noreply.github.com>
…ter.rs

Co-authored-by: Mustafa Akur <106137913+mustafasrepo@users.noreply.github.com>
@github-actions github-actions bot added physical-expr Physical Expressions core Core DataFusion crate labels Jul 12, 2023
@alamb
Copy link
Contributor

alamb commented Jul 12, 2023

Not sure if this is related to #6937 but I figured I would point it out

@Dandandan
Copy link
Contributor Author

Dandandan commented Jul 12, 2023

@alamb
this is just some WIP on top of #6904 to implement #6892

For this change I'm seeing some positive and some negative changes (timing wise).
Memory wise will likely be better as there is one fewer aggregate (so doesn't need to keep the higher-cardinality states in memory of the partial aggregate), but it seems the RepartitionExec at this point can be slower than the partial aggregate + RepartitionExec 🤔 I think we might have to transfer some of the learnings from the hash aggregate to hash-repartitioning.

@Dandandan
Copy link
Contributor Author

Cleaning to keep list of PRs clean

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core Core DataFusion crate physical-expr Physical Expressions
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants