-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rework #1
Rework #1
Conversation
Hi @typohnebild |
Thanks! And maybe a multithreaded version will be added too. Would you mind to update the README and charts when all changes ready? |
This is a great job you are doing! Thanks a lot! Of course. We will rerun the benchmarks when everything is done and update the README. Just let us know when you are finished. |
done
|
The default sweep algorithm changes to The |
Thanks a lot! We are going to merge this and then start a new run and afterwards we will update the readme! |
Changelog:
sweep_slice
to access memory only once like the Python code. I didn't make a benchmark, but I expect it will be a few times faster now.fastmath
to the kernel-like API andpragma(inline, false)
for some API. The last one is required to make the first one work well./
with*
where possible (it is fine for this kind of numeric code).mir.*
alternatives if possible.