Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix query benchmarks overflow #6806

Merged
merged 1 commit into from
Jul 8, 2024
Merged

Fix query benchmarks overflow #6806

merged 1 commit into from
Jul 8, 2024

Conversation

teh-cmc
Copy link
Member

@teh-cmc teh-cmc commented Jul 8, 2024

These benchmarks used to be DataRow-based, i.e. the rows didn't share any memory.

Now that these are Chunk-based, the memory for all strings is shared, and therefore cannot be more than 4gig total.

Checklist

  • I have read and agree to Contributor Guide and the Code of Conduct
  • I've included a screenshot or gif (if applicable)
  • I have tested the web demo (if applicable):
  • The PR title and labels are set such as to maximize their usefulness for the next release's CHANGELOG
  • If applicable, add a new check to the release checklist!

To run all checks from main, comment on the PR with @rerun-bot full-check.

@teh-cmc teh-cmc added 🔨 testing testing and benchmarks ⛴ release Related to shipping or publishing exclude from changelog PRs with this won't show up in CHANGELOG.md labels Jul 8, 2024
@teh-cmc teh-cmc merged commit 59b8f9f into main Jul 8, 2024
26 of 31 checks passed
@teh-cmc teh-cmc deleted the cmc/fix_benches2 branch July 8, 2024 11:28
@@ -190,7 +190,7 @@ pub fn build_some_strings(len: usize) -> Vec<Text> {

(0..len)
.map(|_| {
let ilen: usize = rng.gen_range(0..10000);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So we will see a 100x improvement in benchmark numbers now?

Copy link
Member Author

@teh-cmc teh-cmc Jul 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤷 It was already not comparable with the old one anyway (and this version never ran successfully).

teh-cmc added a commit that referenced this pull request Jul 8, 2024
These benchmarks used to be `DataRow`-based, i.e. the rows didn't share
any memory.

Now that these are `Chunk`-based, the memory for all strings is shared,
and therefore cannot be more than 4gig total.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
exclude from changelog PRs with this won't show up in CHANGELOG.md ⛴ release Related to shipping or publishing 🔨 testing testing and benchmarks
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants