Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wave Metrics #10679

Closed
wants to merge 1 commit into from
Closed

Wave Metrics #10679

wants to merge 1 commit into from

Conversation

oerling
Copy link
Contributor

@oerling oerling commented Aug 7, 2024

  • Make multithreaded memcpy for staging transfers for GPU table scan.
  • Make variants of bit unpacking in GpuDecoder-inl.cuh. Make selective decoding templatized as opposed to runtime switching.
  • Add pieces to GpuDecoderTest, like comparing calling via launchDecode or (multi-function blocks) or decodeGlobal (single function thread blocks).
  • Add a metric for driver thread waiting for first continuable stream.
  • Check approx correctness of Wave runtimeStats.
  • Refactor QueryBenchmarkBase.* from TpchBenchmark. Logic to do sweeps across parameter combinations.
  • Add persistent file format to Wave mock format.
  • Add benchmark for scan, filter, filter expr, projection. aggregation combinations with Wave and Dwrf.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 7, 2024
Copy link

netlify bot commented Aug 7, 2024

Deploy Preview for meta-velox canceled.

Name Link
🔨 Latest commit e04c34d
🔍 Latest deploy log https://app.netlify.com/sites/meta-velox/deploys/66c22ba45d40040008d2a6a7

@facebook-github-bot
Copy link
Contributor

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D60880466

oerling pushed a commit to oerling/velox-1 that referenced this pull request Aug 7, 2024
Summary:
- Make multithreaded memcpy for staging transfers for GPU table scan.
- Make variants of bit unpacking in GpuDecoder-inl.cuh. Make selective decoding templatized as opposed to runtime switching.
- Add pieces to GpuDecoderTest, like comparing calling via launchDecode or (multi-function blocks) or decodeGlobal (single function thread blocks).
- Add a metric for driver thread waiting for first continuable stream.
- Check approx correctness of Wave runtimeStats.
- Refactor QueryBenchmarkBase.* from TpchBenchmark. Logic to do sweeps across parameter combinations.
- Add persistent file format to Wave mock format.
- Add benchmark for scan, filter, filter expr, projection. aggregation combinations with Wave and Dwrf.

Pull Request resolved: facebookincubator#10679

Differential Revision: D60880466
@oerling oerling force-pushed the wavemtr-pr branch 2 times, most recently from 1a2f4a5 to 4a20f55 Compare August 8, 2024 06:23
@facebook-github-bot
Copy link
Contributor

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D60880466

oerling pushed a commit to oerling/velox-1 that referenced this pull request Aug 8, 2024
Summary:
- Make multithreaded memcpy for staging transfers for GPU table scan.
- Make variants of bit unpacking in GpuDecoder-inl.cuh. Make selective decoding templatized as opposed to runtime switching.
- Add pieces to GpuDecoderTest, like comparing calling via launchDecode or (multi-function blocks) or decodeGlobal (single function thread blocks).
- Add a metric for driver thread waiting for first continuable stream.
- Check approx correctness of Wave runtimeStats.
- Refactor QueryBenchmarkBase.* from TpchBenchmark. Logic to do sweeps across parameter combinations.
- Add persistent file format to Wave mock format.
- Add benchmark for scan, filter, filter expr, projection. aggregation combinations with Wave and Dwrf.

Pull Request resolved: facebookincubator#10679

Differential Revision: D60880466

Pulled By: oerling
@facebook-github-bot
Copy link
Contributor

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary:
- Make multithreaded memcpy for staging transfers for GPU table scan.
- Make variants of bit unpacking in GpuDecoder-inl.cuh. Make selective decoding templatized as opposed to runtime switching.
- Add pieces to GpuDecoderTest, like comparing calling via launchDecode or (multi-function blocks) or decodeGlobal (single function thread blocks).
- Add a metric for driver thread waiting for first continuable stream.
- Check approx correctness of Wave runtimeStats.
- Refactor QueryBenchmarkBase.* from TpchBenchmark. Logic to do sweeps across parameter combinations.
- Add persistent file format to Wave mock format.
- Add benchmark for scan, filter, filter expr, projection. aggregation combinations with Wave and Dwrf.

Pull Request resolved: facebookincubator#10679

Differential Revision: D60880466

Pulled By: oerling
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D60880466

@facebook-github-bot
Copy link
Contributor

@oerling merged this pull request in be52988.

Copy link

Conbench analyzed the 1 benchmark run on commit be529880.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details.

Comment on lines -53 to -55
GTest::gtest
GTest::gtest_main
GTest::gmock
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please use the properly name spaced targets.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have fixed it in #10732

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants