You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As noted by @abrown in #2682, our 128-bit shift sequences on x86-64 could make use of PSLLDQ and PSRLDQ to do the 128-bit operation in one go, rather than an open-coded combination of 64-bit shifts with conditional moves, etc. It's likely that this would be faster even with moves to the XMM register file. It's possible that there are better SSE alternatives for some of our other operations as well.
The text was updated successfully, but these errors were encountered:
As noted by @abrown in #2682, our 128-bit shift sequences on x86-64 could make use of
PSLLDQ
andPSRLDQ
to do the 128-bit operation in one go, rather than an open-coded combination of 64-bit shifts with conditional moves, etc. It's likely that this would be faster even with moves to the XMM register file. It's possible that there are better SSE alternatives for some of our other operations as well.The text was updated successfully, but these errors were encountered: