Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Customer-facing Benchmarks #281

Open
stanbrub opened this issue Apr 5, 2024 · 0 comments
Open

Customer-facing Benchmarks #281

stanbrub opened this issue Apr 5, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@stanbrub
Copy link
Collaborator

stanbrub commented Apr 5, 2024

The current benchmarks run nightly are developer-facing single-operation benchmarks. This is good for detecting and narrowing down regression by operation. What we are missing is more real-world benchmarks that test scenarios that users may be actively doing.

The current where benchmarks, for example, run in isolation. All CPU threads are available when those benchmark tests are run. They are extremely fast. But what happens if a where operation must compete with other chained operations or even other where operations running simultaneously from different tables?

Higher level benchmark possibilities:

  • User Scenario: Kafka to Rolling Group to UDFs or built-ins on the vectors, to joins, sorts, etc
  • Wide Boundary Test: A large tree of chained tables, shallow and wide
  • Deep Boundary Test: A large tree of chained tables, narrow and deep
  • Multi Source Test: Run many chains of operations from different sources
  • Cardinality: Run the same cardinality with widely different numbers of keys or vice versa
@stanbrub stanbrub added the enhancement New feature or request label Apr 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant