Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AVX10.2]Optimize codegen for x86 using the new minmax instructions when possible. #109081

Open
DeepakRajendrakumaran opened this issue Oct 21, 2024 · 2 comments
Labels
area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI avx10 Related to the AVX10 architecture
Milestone

Comments

@DeepakRajendrakumaran
Copy link
Contributor

AVX10.2 introduces a set of MINMAX instructions. These will allow us to further optimize the x86 codegen for vector min/max calculations. The current codegen for these instructions look as following : https://www.godbolt.org/z/Kjf7Y1svn

The newly added Avx10.2 instructions are shown below

Image

@dotnet-issue-labeler dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Oct 21, 2024
@dotnet-policy-service dotnet-policy-service bot added the untriaged New issue has not been triaged by the area owner label Oct 21, 2024
Copy link
Contributor

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

@BruceForstall BruceForstall added this to the 10.0.0 milestone Oct 21, 2024
@BruceForstall BruceForstall added avx10 Related to the AVX10 architecture and removed untriaged New issue has not been triaged by the area owner labels Oct 21, 2024
@PavelCibulka
Copy link

We can also add scalar variant of this instructions:
VMINMAXSD
VMINMAXSH
VMINMAXSS

Image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI avx10 Related to the AVX10 architecture
Projects
None yet
Development

No branches or pull requests

3 participants