Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use NEON intrinsics in software renderer #16753

Merged
merged 5 commits into from
Jan 8, 2023

Conversation

unknownbrackets
Copy link
Collaborator

While clang/gcc seem to do a better job than MSVC on vectorizing, there were still a lot of cases (especially Vec3s) where it wasn't.

This basically changes all the places we used SSE for software rendering to use NEON. Although I checked several places, I didn't carefully verify each case especially around Vec3s. I also didn't validate carefully that other code around these were prodded to use vectors more, as happens with SSE in MSVC.

In my testing, I was only seeing at most 10% improvement from this in FPS, although power management made it a bit hard to tell.

-[Unknown]

@hrydgard hrydgard merged commit 25e41ae into hrydgard:master Jan 8, 2023
@unknownbrackets unknownbrackets deleted the softgpu-neon branch January 8, 2023 12:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants