Add `num_iterations` axis to the multi-threaded Parquet benchmarks #17231

vuule · 2024-11-01T00:16:34Z

Description

Added an axis that controls the number of times each thread reads its input. Running with a higher number of iterations should better show how work from different threads pipelines.
The new axis, "num_iterations", is added to all multi-threaded Parquet reader benchmarks.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

vuule · 2024-11-01T16:53:06Z

cpp/benchmarks/io/parquet/parquet_reader_multithread.cpp

@@ -109,12 +111,15 @@ void BM_parquet_multithreaded_read_common(nvbench::state& state,

  nvtxRangePushA(("(read) " + label).c_str());
  state.exec(nvbench::exec_tag::sync | nvbench::exec_tag::timer,
-             [&](nvbench::launch& launch, auto& timer) {
+             [&, num_files = num_files](nvbench::launch& launch, auto& timer) {


Unrelated: we used to capture a structured binding variable in lambdas, which is not supported in C++17.

Thanks, I wasn't aware of that!

vuule · 2024-11-01T16:57:29Z

CC @GregoryKimball

pmattione-nvidia · 2024-11-01T17:35:49Z

cpp/benchmarks/io/parquet/parquet_reader_multithread.cpp

@@ -267,6 +278,7 @@ NVBENCH_BENCH(BM_parquet_multithreaded_read_mixed)
  .add_int64_axis("cardinality", {1000})
  .add_int64_axis("total_data_size", {512 * 1024 * 1024, 1024 * 1024 * 1024})
  .add_int64_axis("num_threads", {1, 2, 4, 8})
+  .add_int64_axis("num_iterations", {1})


What values do we want to use here? Are the results interesting in comparing 1 to e.g. 8?

I'm not seeing very interesting results on my system (checked 1, 2, 4 and 8), but that might be because of low H2D/D2H transfer rates on it.
I added the axis so we can easily change the number of iterations when we're looking into pipelining. So, for now, I think we should stick to a single value to keep the number of benchmarks from growing.
I'm fine with defaulting to a larger value. Thoughts? CC @GregoryKimball

vuule · 2024-11-02T02:50:50Z

/merge

add axis

82d09c2

vuule added tests Unit testing for project cuIO cuIO issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Nov 1, 2024

vuule self-assigned this Nov 1, 2024

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Nov 1, 2024

vuule requested a review from mhaseeb123 November 1, 2024 16:47

vuule commented Nov 1, 2024

View reviewed changes

vuule marked this pull request as ready for review November 1, 2024 16:56

vuule requested a review from a team as a code owner November 1, 2024 16:56

vuule requested a review from pmattione-nvidia November 1, 2024 16:56

mhaseeb123 approved these changes Nov 1, 2024

View reviewed changes

pmattione-nvidia reviewed Nov 1, 2024

View reviewed changes

pmattione-nvidia approved these changes Nov 1, 2024

View reviewed changes

rapids-bot bot merged commit 3d07509 into rapidsai:branch-24.12 Nov 2, 2024
132 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `num_iterations` axis to the multi-threaded Parquet benchmarks #17231

Add `num_iterations` axis to the multi-threaded Parquet benchmarks #17231

vuule commented Nov 1, 2024 •

edited

Loading

vuule Nov 1, 2024

mhaseeb123 Nov 1, 2024

vuule commented Nov 1, 2024

pmattione-nvidia Nov 1, 2024

vuule Nov 1, 2024

vuule commented Nov 2, 2024

Add num_iterations axis to the multi-threaded Parquet benchmarks #17231

Add num_iterations axis to the multi-threaded Parquet benchmarks #17231

Conversation

vuule commented Nov 1, 2024 • edited Loading

Description

Checklist

vuule Nov 1, 2024

Choose a reason for hiding this comment

mhaseeb123 Nov 1, 2024

Choose a reason for hiding this comment

vuule commented Nov 1, 2024

pmattione-nvidia Nov 1, 2024

Choose a reason for hiding this comment

vuule Nov 1, 2024

Choose a reason for hiding this comment

vuule commented Nov 2, 2024

Add `num_iterations` axis to the multi-threaded Parquet benchmarks #17231

Add `num_iterations` axis to the multi-threaded Parquet benchmarks #17231

vuule commented Nov 1, 2024 •

edited

Loading