Add Median of Medians fallback to introselect #107522

Sp00ph · 2023-01-31T16:55:22Z

This PR is a follow up to #106997. It adds a Fast Deterministic Selection implementation as a fallback to the introselect algorithm used by select_nth_unstable. This allows it to guarantee O(n) worst case running time, while maintaining good performance in all cases.

This would fix #102451, which was opened because the select_nth_unstable docs falsely claimed that it had O(n) worst case performance, even though it was actually quadratic in the worst case. #106997 improved the worst case complexity to O(n log n) by using heapsort as a fallback, and this PR further improves it to O(n) (this would also make #106933 unnecessary).
It also improves the actual runtime if the fallback gets called: Using a pathological input of size 1 << 19 (see the playground link in #102451), calculating the median is roughly 3x faster using fast deterministic selection as a fallback than it is using heapsort.

The downside to this is less code reuse between the sorting and selection algorithms, but I don't think it's that bad. The additional algorithms are ~250 LOC with no unsafe blocks (I tried using unsafe to avoid bounds checks but it didn't noticeably improve the performance).
I also let it fuzz for a while against the current select_nth_unstable implementation to ensure correctness, and it seems to still fulfill all the necessary postconditions.

cc @scottmcm who reviewed #106997

rustbot · 2023-01-31T16:55:31Z

r? @cuviper

(rustbot has picked a reviewer for you, use r? to override)

rustbot · 2023-01-31T16:55:34Z

Hey! It looks like you've submitted a new PR for the library teams!

If this PR contains changes to any rust-lang/rust public library APIs then please comment with @rustbot label +T-libs-api -T-libs to tag it appropriately. If this PR contains changes to any unstable APIs please edit the PR description to add a link to the relevant API Change Proposal or create one if you haven't already. If you're unsure where your change falls no worries, just leave it as is and the reviewer will take a look and make a decision to forward on if necessary.

Examples of T-libs-api changes:

Stabilizing library features
Introducing insta-stable changes such as new implementations of existing stable traits on existing stable types
Introducing new or changing existing unstable library APIs (excluding permanently unstable features / features without a tracking issue)
Changing public documentation in ways that create new stability guarantees
Changing observable runtime behavior of library APIs

Sp00ph · 2023-01-31T17:10:27Z

Forgot to mention, on random input the fast deterministic select is a bit slower than the pdq based quickselect, so I didn't make select_nth_unstable always call the new algorithm and instead just used it it the introselect fallback.

scottmcm · 2023-01-31T20:28:21Z

Ahh, my apologies, I read cc as r? 🤦

Putting it back,
r? @cuviper

Sp00ph · 2023-02-01T22:22:21Z

I tried also implementing the ExpandPartition algorithm from the paper that takes into account which region of the slice we already know to be partitioned, but it doesn't really lead to noticeable speedups. I think it's better to just use the already existing partition (like I did in this PR), as that already seems pretty optimized.

bors · 2023-02-13T07:49:57Z

☔ The latest upstream changes (presumably #107191) made this pull request unmergeable. Please resolve the merge conflicts.

Sp00ph · 2023-02-20T14:45:22Z

Hmm, now that #106933 was merged, the API guarantees have been loosened to O(n log n) worst case runtime. I'm guessing we'd need an ACP to get it back down to O(n) worst case (and it's unclear to me if we'd even want to guarantee O(n)). Should I add note in the /// ## Current implementation section that states that the current algorithm (from this PR) always runs in linear time?

orlp · 2023-03-05T17:16:15Z

So first and foremost, the PR in its current form does not make the behavior linear, because the limit is set to log(n), so in the worst case we're still O(n log n).

Secondly, I think having the full Fast Deterministic Selection algorithm as a fallback algorithm does not make much sense. Either it should be run from start to finish, or we should reduce it to just the median_of_ninthers approach for its linearity. The fallback algorithm only really gets triggered by an incredibly unlucky input pattern or malicious input, so it does not make sense to have a lot of code baked into the executable to run our fallback approach at maximum speed.

Amanieu · 2023-03-05T16:47:20Z

library/core/src/slice/select.rs

+
+/// helper function used to find the index of the min/max element
+/// using e.g. `slice.iter().enumerate().min_by(from_is_less(&mut is_less)).unwrap()`
+fn from_is_less<T>(


It seems like the code could be made more readable by just having min_index and max_index helper functions that return an Option<usize>.

It seems like the code could be made more readable by just having min_index and max_index helper functions that return an Option<usize>.

I don't think this has been addressed yet.

Yeah I kinda forgot about it, should be fixed now though.

Sp00ph · 2023-03-05T17:34:20Z

So first and foremost, the PR in its current form does not make the behavior linear, because the limit is set to log(n), so in the worst case we're still O(n log n).

I had naively thought that the same limiting strategy used for introsort would also work for introselect, but yeah makes sense that it doesn't. Does this mean that to guarantee linear runtime we need a constant limit on the iterations?

Secondly, I think having the full Fast Deterministic Selection algorithm as a fallback algorithm does not make much sense. Either it should be run from start to finish, or we should reduce it to just the median_of_ninthers approach for its linearity. The fallback algorithm only really gets triggered by an incredibly unlucky input pattern or malicious input, so it does not make sense to have a lot of code baked into the executable to run our fallback approach at maximum speed.

In my tests, just using fast deterministic selection unconditionally was a bit slower than the current median-of-3 quickselect on random inputs, so I didn't want to do that. I also tried to just use median of medians first, but that was slower than using heapsort as a fallback. I haven't tried just using median of ninthers yet, I'll have to see if that still gets similar speedups to the full fast deterministic select.

Sp00ph · 2023-03-05T17:40:36Z

I suppose it could do something like let limit = cmp::min(ilog2(len), SOME_UPPER_BOUND) which would still mean O(1) iterations. Would probably have to figure out a good value for that cutoff point though.

orlp · 2023-03-05T18:17:01Z

I suppose it could do something like let limit = cmp::min(ilog2(len), SOME_UPPER_BOUND) which would still mean O(1) iterations. Would probably have to figure out a good value for that cutoff point though.

I think you should do what I did in pdqsort originally: track whether the partition was 'good' or 'bad' by some threshold (I used n / 8 = 12.5% quantile for the pivot position as the cutoff), and then and only then increment the counter. This way you can put the counter relatively low without affecting performance for normal code while quickly switching to the fallback for malicious/unlucky inputs.

workingjubilee · 2023-03-06T03:59:59Z

In my tests, just using fast deterministic selection unconditionally was a bit slower than the current median-of-3 quickselect on random inputs, so I didn't want to do that.

Performance on perfectly random inputs is slightly overrated. What about nonrandom inputs, like tokenized data from Wikipedia or such?

Sp00ph · 2023-03-06T16:12:52Z

I currently don't have access to a powerful machine, but I can try to set up a somewhat comprehensive benchmark suite once I do to compare heapsort fallback vs median-of-ninthers fallback vs just fast deterministic select.

cuviper · 2023-03-23T20:43:41Z

Sorry, I haven't had time to dig into this - let me roll a new reviewer...

r? libs

Sp00ph · 2023-03-23T20:45:16Z

I still need to change some stuff about the PR so

@rustbot author

Dylan-DPC · 2023-05-19T09:13:11Z

@Sp00ph any updates on this?

Sp00ph · 2023-05-21T21:30:30Z

Not yet, I'll see if I can get around to it some time soon.

Sp00ph · 2023-05-24T00:46:49Z

I wrote a little benchmark suite testing the current selection implementation, quickselect with median of medians as fallback and pure fast deterministic select on various slice lengths and input shapes and quickselect with median of medians fallback was consistently the fastest, so I changed it to that now. If anyone is interested I can upload the benchmark itself too. The algorithm should now also actually be O(n) worst case since I made the quickselect iteration limit constant now.

Sp00ph · 2023-05-24T01:47:25Z

@rustbot ready

Amanieu · 2023-05-25T19:52:06Z

@bors r+

bors · 2023-05-25T19:52:08Z

📌 Commit fd5fa01 has been approved by Amanieu

It is now in the queue for this repository.

Sp00ph · 2023-05-25T21:18:46Z

I forgot to ask: As of #106933, select_nth_unstable is documented to be O(n log n) in the worst case (it was falsely documented as O(n) before). I'm assuming that reverting the time complexity to O(n) would count as an API change and require an ACP? Should I maybe still say in the /// # Current implementation section of select_nth_unstable that it currently is in fact always O(n)?

Amanieu · 2023-05-25T21:49:10Z

This can be submitted as a separate PR which we will run an FCP on, since this is a new stable guarantee. No ACP is required.

…mpiler-errors Rollup of 7 pull requests Successful merges: - rust-lang#107522 (Add Median of Medians fallback to introselect) - rust-lang#111152 (update `pulldown-cmark` to `0.9.3`) - rust-lang#111757 (Consider lint check attributes on match arms) - rust-lang#111831 (Always capture slice when pattern requires checking the length) - rust-lang#111929 (Don't print newlines in APITs) - rust-lang#111945 (Migrate GUI colors test to original CSS color format) - rust-lang#111950 (Remove ExpnKind::Inlined.) r? `@ghost` `@rustbot` modify labels: rollup

Update runtime guarantee for `select_nth_unstable` rust-lang#106933 changed the runtime guarantee for `select_nth_unstable` from O(n) to O(n log n), since the old guarantee wasn't actually met by the implementation at the time. Now with rust-lang#107522, `select_nth_unstable` should be truly linear in runtime, so we can revert its runtime guarantee to O(n). Since rust-lang#106933 was considered a bug fix, this will probably need an FCP because it counts as a new API guarantee. r? `@Amanieu`

rustbot assigned cuviper Jan 31, 2023

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Jan 31, 2023

This comment was marked as outdated.

Sign in to view

rustbot assigned Mark-Simulacrum and unassigned cuviper Jan 31, 2023

rustbot assigned cuviper and unassigned Mark-Simulacrum Jan 31, 2023

Sp00ph force-pushed the introselect branch from 60216ef to 1ab4aad Compare February 5, 2023 14:32

Sp00ph force-pushed the introselect branch 2 times, most recently from 089165c to 9dfe219 Compare February 13, 2023 20:41

Amanieu reviewed Mar 5, 2023

View reviewed changes

rustbot assigned Mark-Simulacrum and unassigned cuviper Mar 23, 2023

rustbot removed the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Mar 23, 2023

rustbot added the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Mar 23, 2023

Add Median of Medians fallback to introselect

3d11b65

rust-cloud-vms bot force-pushed the introselect branch from 9dfe219 to 3d11b65 Compare May 24, 2023 00:41

Sp00ph changed the title ~~Add Fast Deterministic Selection fallback to introselect~~ Add Median of Medians fallback to introselect May 24, 2023

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels May 24, 2023

Amanieu assigned Amanieu and unassigned Mark-Simulacrum May 24, 2023

Use helper functions for min/max_idx

fd5fa01

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 25, 2023

compiler-errors mentioned this pull request May 25, 2023

Rollup of 7 pull requests #111960

Merged

bors merged commit fb45513 into rust-lang:master May 26, 2023

rustbot added this to the 1.71.0 milestone May 26, 2023

Sp00ph deleted the introselect branch May 26, 2023 01:13

Sp00ph mentioned this pull request May 26, 2023

Update runtime guarantee for select_nth_unstable #111974

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Median of Medians fallback to introselect #107522

Add Median of Medians fallback to introselect #107522

Sp00ph commented Jan 31, 2023

rustbot commented Jan 31, 2023

rustbot commented Jan 31, 2023

Sp00ph commented Jan 31, 2023

This comment was marked as outdated.

scottmcm commented Jan 31, 2023

Sp00ph commented Feb 1, 2023

bors commented Feb 13, 2023

Sp00ph commented Feb 20, 2023

orlp commented Mar 5, 2023

Amanieu Mar 5, 2023

Amanieu May 24, 2023

Sp00ph May 24, 2023

Sp00ph commented Mar 5, 2023

Sp00ph commented Mar 5, 2023

orlp commented Mar 5, 2023

workingjubilee commented Mar 6, 2023 •

edited

Loading

Sp00ph commented Mar 6, 2023

cuviper commented Mar 23, 2023

Sp00ph commented Mar 23, 2023

Dylan-DPC commented May 19, 2023

Sp00ph commented May 21, 2023

Sp00ph commented May 24, 2023

Sp00ph commented May 24, 2023

Amanieu commented May 25, 2023

bors commented May 25, 2023

Sp00ph commented May 25, 2023

Amanieu commented May 25, 2023

Add Median of Medians fallback to introselect #107522

Add Median of Medians fallback to introselect #107522

Conversation

Sp00ph commented Jan 31, 2023

rustbot commented Jan 31, 2023

rustbot commented Jan 31, 2023

Sp00ph commented Jan 31, 2023

This comment was marked as outdated.

scottmcm commented Jan 31, 2023

Sp00ph commented Feb 1, 2023

bors commented Feb 13, 2023

Sp00ph commented Feb 20, 2023

orlp commented Mar 5, 2023

Amanieu Mar 5, 2023

Choose a reason for hiding this comment

Amanieu May 24, 2023

Choose a reason for hiding this comment

Sp00ph May 24, 2023

Choose a reason for hiding this comment

Sp00ph commented Mar 5, 2023

Sp00ph commented Mar 5, 2023

orlp commented Mar 5, 2023

workingjubilee commented Mar 6, 2023 • edited Loading

Sp00ph commented Mar 6, 2023

cuviper commented Mar 23, 2023

Sp00ph commented Mar 23, 2023

Dylan-DPC commented May 19, 2023

Sp00ph commented May 21, 2023

Sp00ph commented May 24, 2023

Sp00ph commented May 24, 2023

Amanieu commented May 25, 2023

bors commented May 25, 2023

Sp00ph commented May 25, 2023

Amanieu commented May 25, 2023

workingjubilee commented Mar 6, 2023 •

edited

Loading