Added Machine-Prime, simplified nearest_prime #84

JASory · 2024-11-29T00:00:08Z

Added the low-memory variant of Machine-prime (using a modified BPSW test) as an optional dependency. The Criterion benchmark shows that it is approximately 16x faster than the existing version. Machine-Prime's other variants are faster still but require more memory and this is already a memory-intensive library.

MP is no-std and usable in const functions as well.

Simplified the next_prime and previous_prime functions

Added the low-memory variant of Machine-prime (using a modified BPSW test) as an optional dependency. The Criterion benchmark shows that it is approximately 16x faster than the existing version. Simplified the next and previous_prime functions

JSorngard · 2024-11-29T11:41:33Z

Very impressive!

The is_prime function gets a ~440% increase in throughput on my machine when the feature is on:

The plot of the execution time distributions is quite funny as well:

Red is with the feature off and blue is with it on.
Quite a bit faster.

Cargo.toml

src/search.rs

JASory · 2024-11-29T12:40:05Z

The difference in benchmark is interesting. I consistently get 16x speed up. But that seems to be because I wrote/optimised it on lower-end hardware. I wonder if the trial division is somehow getting vectorised on your machine. Either way the worst-case is equivalent to only 2.5 fermat tests, so it'll beat it out pretty much regardless of what compiler optimisations are done.

On i5-5300u processor

time 12.021 ms -> 0.666 ms

Throughput 0.831M /s -> 15M/s

JSorngard · 2024-11-29T12:50:17Z

Interesting :o
Just using godbolt.org does not reveal any use of vector registers as far as I can tell from reading over the assembly, but modern processors are such massively complex beasts that there might be a technique I am not aware of that speeds it up a lot on my CPU.

My CPU is a 5800X3D, but I don't think the extra cache should help in this situation, it must be something else.

Regardless, I appreciate your excellent contribution!

@JASory

… by @JASory in #84

@JASory

… by @JASory in #84

@JASory

… by @JASory in #84 (#87) * Undo typed stride as that introduces a runtime branch. Use suggestion by @JASory in #84 * Add debug assert

JSorngard reviewed Nov 29, 2024

View reviewed changes

Cargo.toml Show resolved Hide resolved

JSorngard reviewed Nov 29, 2024

View reviewed changes

src/search.rs Show resolved Hide resolved

JSorngard changed the base branch from main to machine_prime_integration November 29, 2024 12:30

JSorngard merged commit 53a50a7 into JSorngard:machine_prime_integration Nov 29, 2024
4 of 6 checks passed

JSorngard mentioned this pull request Nov 29, 2024

Machine prime integration #86

Merged

JSorngard added a commit that referenced this pull request Nov 29, 2024

Undo typed stride as that introduces a runtime branch. Use suggestion…

34d401e

… by @JASory in #84

JSorngard added a commit that referenced this pull request Nov 29, 2024

Undo typed stride as that introduces a runtime branch. Use suggestion…

9b6d80d

… by @JASory in #84

JSorngard added a commit that referenced this pull request Nov 29, 2024

Undo typed stride as that introduces a runtime branch. Use suggestion…

6fc4e9f

… by @JASory in #84 (#87) * Undo typed stride as that introduces a runtime branch. Use suggestion by @JASory in #84 * Add debug assert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added Machine-Prime, simplified nearest_prime #84

Added Machine-Prime, simplified nearest_prime #84

JASory commented Nov 29, 2024 •

edited

Loading

JSorngard commented Nov 29, 2024 •

edited

Loading

JASory commented Nov 29, 2024

JSorngard commented Nov 29, 2024 •

edited

Loading

Added Machine-Prime, simplified nearest_prime #84

Added Machine-Prime, simplified nearest_prime #84

Conversation

JASory commented Nov 29, 2024 • edited Loading

JSorngard commented Nov 29, 2024 • edited Loading

JASory commented Nov 29, 2024

JSorngard commented Nov 29, 2024 • edited Loading

JASory commented Nov 29, 2024 •

edited

Loading

JSorngard commented Nov 29, 2024 •

edited

Loading

JSorngard commented Nov 29, 2024 •

edited

Loading