feat: Add explicit simd implementation for one pass svd in large action spaces. #4261

zwd-ms · 2022-11-01T22:04:58Z

Add an explicit SIMD implementation using AVX-512 for the one pass svd part in large actions. For now, it only works with quadratic interactions.

By default, VW builds with VW_BUILD_LAS_WITH_SIMD=ON so it includes the SIMD binaries. The compile flag can be turned off, for example when some CI runs on unsupported architectures (or when someone building on really old platforms), but all tests should build and pass on a recent CPU.

To enable vectorization at runtime, pass in --explicit_simd on vw command line. This flag is advisory: it only works if the program detects that the platform actually supports the needed instructions.

Next steps include adding alternative implementations using other extension sets (e.g. AVX2)~~, and adding automatic switching between scalar and simd code paths using CPUID info at runtime, instead of using the command-line flag~~.

vowpalwabbit/core/src/reductions/cb/details/large_action/compute_dot_prod_simd.h

vowpalwabbit/core/src/reductions/cb/cb_explore_adf_large_action_space.cc

olgavrou

A separate CI job should be added that turns on the compile time flag and runs the related tests (like we had for LAS while it was behind a compile time flag). It can run only for linux architectures

zwd-ms · 2022-11-14T15:50:35Z

A separate CI job should be added that turns on the compile time flag and runs the related tests (like we had for LAS while it was behind a compile time flag). It can run only for linux architectures

Good idea. I'm adding a CI in a subsequent PR that handles the runtime detection of supported instructions. Before that's done, the tests in CI won't always pass due to "illegal instructions" when the CI runs on unsupported machines.
Update: after adding the cpu feature detection, the compile flag is set to ON by default, so we can use the existing CI for tests now.

zwd-ms force-pushed the las_svd_simd branch from 5311864 to 358b33b Compare November 10, 2022 02:25

zwd-ms added 14 commits November 10, 2022 19:58

Add avx512 implementation.

8607725

Test with vpopcnt.

a5af88a

Add single-thread benchmarks.

26e9f8f

Refactor compute16.

3d5077e

Test linux ci build.

449d153

Use old loadu intrinsic.

314f471

Test windows build.

0283482

Fix windows build.

118050a

Set Eigen alignment.

de2f01d

Fix macos build.

e22d784

Set use_simd false.

9118114

Set file flag only.

faee5b5

Refactor ao_triplet.

55e5739

Compile linux only.

ddc4e5d

zwd-ms force-pushed the las_svd_simd branch from 358b33b to 55e5739 Compare November 11, 2022 19:51

zwd-ms added 2 commits November 11, 2022 16:19

Add compile flag.

0931542

Hide simd option behind compile flag.

319433e

zwd-ms changed the title ~~[wip] Las svd simd~~ [feat] Add explicit simd implementation for one pass svd in large action spaces. Nov 11, 2022

zwd-ms changed the title ~~[feat] Add explicit simd implementation for one pass svd in large action spaces.~~ feat: Add explicit simd implementation for one pass svd in large action spaces. Nov 11, 2022

zwd-ms marked this pull request as ready for review November 11, 2022 22:27

zwd-ms requested a review from olgavrou November 14, 2022 14:05

olgavrou reviewed Nov 14, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/cb/details/large_action/compute_dot_prod_simd.h Outdated Show resolved Hide resolved

olgavrou reviewed Nov 14, 2022

View reviewed changes

vowpalwabbit/core/src/reductions/cb/cb_explore_adf_large_action_space.cc Show resolved Hide resolved

olgavrou reviewed Nov 14, 2022

View reviewed changes

Remove alternative vpopcnt.

0022f72

zwd-ms added 3 commits November 14, 2022 11:46

Test linux CI.

7cf0b6d

Reset compile flag.

8353373

Detect cpu features.

fe6d6b5

zwd-ms requested a review from lokitoth November 16, 2022 14:53

Merge branch 'master' into las_svd_simd

6425d65

zwd-ms requested a review from jackgerrits November 28, 2022 14:09

zwd-ms added 2 commits November 30, 2022 17:52

Merge branch 'master' into las_svd_simd

f470107

Merge branch 'master' into las_svd_simd

e4b8e2b

olgavrou approved these changes Dec 1, 2022

View reviewed changes

zwd-ms merged commit 3ee9665 into VowpalWabbit:master Dec 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add explicit simd implementation for one pass svd in large action spaces. #4261

feat: Add explicit simd implementation for one pass svd in large action spaces. #4261

zwd-ms commented Nov 1, 2022 •

edited

Loading

olgavrou left a comment •

edited

Loading

zwd-ms commented Nov 14, 2022 •

edited

Loading

feat: Add explicit simd implementation for one pass svd in large action spaces. #4261

feat: Add explicit simd implementation for one pass svd in large action spaces. #4261

Conversation

zwd-ms commented Nov 1, 2022 • edited Loading

olgavrou left a comment • edited Loading

Choose a reason for hiding this comment

zwd-ms commented Nov 14, 2022 • edited Loading

zwd-ms commented Nov 1, 2022 •

edited

Loading

olgavrou left a comment •

edited

Loading

zwd-ms commented Nov 14, 2022 •

edited

Loading