Perf. regression of `min` over `packed_simd` #222

jorgecarleitao · 2022-01-13T18:18:11Z

There is a 25% perf regression when converting packed_simd to core_simd.

The implementations here the same mutatis mutantis, but their performances are not:

core_simd_min 2^20 f32     [286.86 us 289.22 us 292.03 us]
packed_simd_min 2^20 f32   [230.50 us 234.12 us 238.86 us]
nonsimd_min 2^20 f32       [245.75 us 249.19 us 254.00 us]
naive_min 2^20 f32         [2.8560 ms 2.8721 ms 2.8885 ms]

In particular, it seems more efficient to write the code without std::simd (nonsimd_min) than with it (core_simd_min).

with target-cpu=native:

core_simd_min 2^20 f32     [376.98 us 378.40 us 379.72 us]
packed_simd_min 2^20 f32   [181.77 us 182.95 us 185.05 us]
nonsimd_min 2^20 f32       [185.89 us 186.35 us 186.83 us]
naive_min 2^20 f32         [2.0208 ms 2.0274 ms 2.0341 ms]

which is an even larger difference

Context

We are considering migrating from packed_simd to std::simd and observed this regression on our benchmarks. jorgecarleitao/arrow2#747 for details.

The text was updated successfully, but these errors were encountered:

bjorn3 · 2022-01-13T18:32:09Z

This is probably #193.

jorgecarleitao · 2022-01-13T20:06:38Z

The conclusion on zulip is that we are not yet using intrinsics for min

jorgecarleitao mentioned this issue Jan 13, 2022

Migrated to portable simd jorgecarleitao/arrow2#747

Merged

calebzulawski mentioned this issue Jan 13, 2022

Use intrinsic for min/max #224

Merged

calebzulawski closed this as completed in #224 Jan 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf. regression of `min` over `packed_simd` #222

Perf. regression of `min` over `packed_simd` #222

jorgecarleitao commented Jan 13, 2022 •

edited

Loading

bjorn3 commented Jan 13, 2022

jorgecarleitao commented Jan 13, 2022

Perf. regression of min over packed_simd #222

Perf. regression of min over packed_simd #222

Comments

jorgecarleitao commented Jan 13, 2022 • edited Loading

Context

bjorn3 commented Jan 13, 2022

jorgecarleitao commented Jan 13, 2022

Perf. regression of `min` over `packed_simd` #222

Perf. regression of `min` over `packed_simd` #222

jorgecarleitao commented Jan 13, 2022 •

edited

Loading