Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use x64 SSE instructions for 128-bit shifts (and other operations?) #2689

Open
cfallin opened this issue Feb 26, 2021 · 0 comments
Open

Use x64 SSE instructions for 128-bit shifts (and other operations?) #2689

cfallin opened this issue Feb 26, 2021 · 0 comments
Labels
cranelift:area:x64 Issues related to x64 codegen

Comments

@cfallin
Copy link
Member

cfallin commented Feb 26, 2021

As noted by @abrown in #2682, our 128-bit shift sequences on x86-64 could make use of PSLLDQ and PSRLDQ to do the 128-bit operation in one go, rather than an open-coded combination of 64-bit shifts with conditional moves, etc. It's likely that this would be faster even with moves to the XMM register file. It's possible that there are better SSE alternatives for some of our other operations as well.

@cfallin cfallin added the cranelift:area:x64 Issues related to x64 codegen label Feb 26, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cranelift:area:x64 Issues related to x64 codegen
Projects
None yet
Development

No branches or pull requests

1 participant