Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

perf: DH-18300: Improve DataIndex performance. #6585

Merged
merged 24 commits into from
Jan 24, 2025

Conversation

cpwright
Copy link
Contributor

DataIndex, particularly when used for where() filters had missing parallelization opportunities; and would read more data than strictly necessary to satisfy the filter.

@cpwright cpwright self-assigned this Jan 20, 2025
@cpwright cpwright changed the title DH-18300: Improve DataIndex performance. perf: DH-18300: Improve DataIndex performance. Jan 20, 2025
@cpwright cpwright force-pushed the nightly/cpw/DH-18300-main branch 3 times, most recently from e92b0ee to b468cb1 Compare January 20, 2025 22:12
@cpwright cpwright added the ReleaseNotesNeeded Release notes are needed label Jan 20, 2025
DataIndex, particularly when used for where() filters had missing
parallelization opportunities; and would read more data than strictly
necessary to satisfy the filter.

Statistics have been added to various operations, the existing Value class
was not thread safe.  The internal state has been updated to use volatiles
and AtomicLongFieldUpdaters.

The following Configuration properties have been added:
- AbstractColumnSource.usePartialDataIndex
- AbstractColumnSource.useParallelIndexBuild
- QueryTable.useDataIndexForAggregation
- MergedDataIndex.useParallelLazyFetch
@cpwright cpwright force-pushed the nightly/cpw/DH-18300-main branch from f86e402 to a15b998 Compare January 20, 2025 22:42
@cpwright cpwright marked this pull request as ready for review January 21, 2025 15:55
Copy link
Member

@rcaudy rcaudy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ready to approve if performance is good

cpwright and others added 5 commits January 23, 2025 18:27
…dex.java

Co-authored-by: Ryan Caudy <rcaudy@gmail.com>
…tions.java

Co-authored-by: Ryan Caudy <rcaudy@gmail.com>
…tions.java

Co-authored-by: Ryan Caudy <rcaudy@gmail.com>
Copy link
Member

@rcaudy rcaudy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

.

@cpwright cpwright merged commit 5f62c5b into deephaven:main Jan 24, 2025
17 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Jan 24, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants