Use a ad hoc trie to avoid slow cases #6283

Eh2406 · 2018-11-08T04:34:50Z

This adds a test for the pure case described here. When added, this test will time out in debug (~20 sec release) with an N of 3.

Looking with a profiler, it is spending most of its time in Vec.contains. Keeping a deduplicated list with contains is O(n^2), so let's change things to be Ord so we can use a btree to deduplicate.

Now the profiler claimed that we are spending our time searching that Vec. So time to work on that "vaporware", the simplest thing I could think of that lets us check for is_active in better then O(n) is nested hashmaps. Each hashmap has keys of a shared Id, and values of hashmaps representing all the conflicts that use that Id. When searching if the key is not active then we need not look at any of its descendants.

There are lots of ways to make this better but even this much gets the test to pass with N of 3. So maybe we can justify the improvements with N of 4? No, eavan in debug N of 4 hits the 50k limit, so the that is as far as we need to go on the conflict_cache.

rust-highfive · 2018-11-08T04:35:01Z

r? @alexcrichton

(rust_highfive has picked a reviewer for you, use r? to override)

Eh2406 · 2018-11-08T15:22:55Z

I forgot that I used NLL to test this. I am not seeing how to get it to work without NLL. Thoughts?

Eh2406 · 2018-11-08T15:46:10Z

Figured out how to remove the reliance on NLL!

alexcrichton

Ok I think I roughly undestand what's going on here. I don't have a perfect map in my head for how it's functionally equivalent to the before, but you may have some more comments to add as well!

src/cargo/core/resolver/conflict_cache.rs

Eh2406 · 2018-11-09T16:41:01Z

Instead of responding in thread, I pushed a commit that added comments to address each of your excellent questions.

On a more general note this is not a heavily optimized and carefully chosen algorithm, it is much closer to the first thing I could think of. I spent some time looking through the data structures I had heard of to see if there is an existing literature, but did not find any good search terms. You menchan "set of sets", do you have links about data structures for that? If anyone has references I am happy to look into them!

alexcrichton

Looking good to me! Realizing now that this is a trie, perhaps it could be renamed to ConflictStoreTrie?

alexcrichton · 2018-11-09T17:20:30Z

src/cargo/core/resolver/conflict_cache.rs

+                    // will be conflicting.
+                    if cx.is_active(pid) {
+                        if let Some(o) = store.find_conflicting(cx, filter) {
+                            debug_assert!(cx.is_conflicting(None, o));


Would this debug assert bring back the N^2 behavior? In any case it's probably fine to move this debug assert above to if filter(&c), right?

oops, yes moved it. It should not be too expensive when it is in the correct place.

alexcrichton · 2018-11-09T17:21:05Z

src/cargo/core/resolver/conflict_cache.rs


 use super::types::ConflictReason;
 use core::resolver::Context;
 use core::{Dependency, PackageId};

+/// This is a data structure for storing a large number of Sets designed to
+/// efficiently see if any of the stored Sets are a subset of a search Set.
+enum ConflictStore {


Thinking more about this, I actually think this is basically a trie

src/cargo/core/resolver/conflict_cache.rs

Eh2406 · 2018-11-09T18:25:20Z

I again followed up with a commit.

alexcrichton · 2018-11-09T18:45:19Z

@bors: r+

💯

bors · 2018-11-09T18:45:20Z

📌 Commit 178ee0b has been approved by alexcrichton

bors · 2018-11-09T20:54:50Z

⌛ Testing commit 178ee0b with merge 1292a6d...

Use a ad hoc trie to avoid slow cases This adds a test for the pure case described [here](#6258 (comment)). When added, this test will time out in debug (~20 sec release) with an `N` of 3. Looking with a profiler, it is spending most of its time in [`Vec.contains`](https://github.com/rust-lang/cargo/blob/d08fd375db0014f097ee3fbdc1c595b632669f6a/src/cargo/core/resolver/conflict_cache.rs#L84). Keeping a deduplicated list with contains is `O(n^2)`, so let's change things to be `Ord` so we can use a btree to deduplicate. Now the profiler claimed that we are spending our time [searching that `Vec`](https://github.com/rust-lang/cargo/blob/d08fd375db0014f097ee3fbdc1c595b632669f6a/src/cargo/core/resolver/conflict_cache.rs#L66). So time to work on that "vaporware", the simplest thing I could think of that lets us check for is_active in better then `O(n)` is nested hashmaps. Each hashmap has keys of a shared Id, and values of hashmaps representing all the conflicts that use that Id. When searching if the key is not active then we need not look at any of its descendants. There are lots of ways to make this better but even this much gets the test to pass with `N` of 3. So maybe we can justify the improvements with `N` of 4? No, eavan in debug `N` of 4 hits the [50k limit](https://github.com/rust-lang/cargo/blob/d08fd375db0014f097ee3fbdc1c595b632669f6a/src/cargo/core/resolver/types.rs#L55), so the that is as far as we need to go on the conflict_cache.

bors · 2018-11-09T22:17:31Z

☀️ Test successful - status-appveyor, status-travis
Approved by: alexcrichton
Pushing 1292a6d to master...

Eh2406 · 2018-11-30T22:29:10Z

I was seeing some non-determinism on the run time of some tests, and trying to figure out where it was coming from. I think it is from here. ConflictStoreTrie.find_conflicting returns the first thing it finds. That is ok, as long as the order is deterministic, but I think it is in the iter order of a HashMap which is not.

I do not have proof that this can lead to different lock files, but strongly suspect it.

Eh2406 · 2018-12-01T16:37:56Z

On discord @alexcrichton clarified that the resolver does not have to be deterministic. In most use cases the index will change between any two invocations anyway.

@alexcrichton

ConflictStoreTrie: Faster filtered search This is an optimization that I was thinking of doing in #6283. I did not then as this is optimizing a not particularly hot path. I wish to do it now as it is stuck in my head, and I need that head space for more important things. This also "accidentally" fixes the [indeterminacy](#6283 (comment)) introduced in #6283, by replacing the `HashMap.iter().find()` like code with `BTreeMap.iter().find()` like code. This is not strictly needed, as @alexcrichton pointed out (In most use cases the index will change between any two invocations anyway), but does make it much easier to deal with (fuzz) test cases.

@alexcrichton

Resolver: A dep is equivalent to one of the things it can resolve to. This is a series of small changes to the resolver, each one on its own is not worth the cherne, but somehow all together we can add a new optimization rule. The result is that the test in #6283 is no longer exponencial (still a large polynomial, cubick?) and so N can be bumped from 3 to 20. This also means that we pass with all the cases reported in #6258. Resolution is NP-Hard, so we are moving the slow cases around. To reduce the chance that we will be flooded by new bad cases I run the 4 proptests overnight, and they did not find a new exponencial case. I would recommend reviewing this commit by commit. As each change is pretty simple on its own, but the mixed diff is harder to follow. This is submitted as one big PR as that is @alexcrichton's preference. A special thanks to @nex3, our conversation was important in convincing me that several of these changes would be needed even in an eventual PubGrub based system. And, the question "why would PubGrub not have a problem with #6283" was wat crystallized this optimization opportunity in my mind.

Eh2406 added 4 commits November 7, 2018 22:03

add a test that defets iter.rev

04e1564

a more pure version of the test

1e90c0a

remove the O(n) insertion, but loos the hack for lookup

e1ff48b

remove the O(n) lookup

4c8ab5e

rust-highfive assigned alexcrichton Nov 8, 2018

remove the use of NLL

53b535f

Eh2406 added A-dependency-resolution Area: dependency resolution and the resolver A-testing-cargo-itself Area: cargo's tests labels Nov 8, 2018

alexcrichton reviewed Nov 9, 2018

View reviewed changes

add comments and pick better names

d070f06

alexcrichton reviewed Nov 9, 2018

View reviewed changes

address suggestions

178ee0b

Eh2406 force-pushed the slow-case branch from 59d4068 to 178ee0b Compare November 9, 2018 18:24

Eh2406 changed the title ~~Use a ad hoc btree to avoid slow cases~~ Use a ad hoc trie to avoid slow cases Nov 9, 2018

bors merged commit 178ee0b into rust-lang:master Nov 9, 2018

Eh2406 mentioned this pull request Nov 12, 2018

Tracking issue for proptest resolve errors. #6258

Open

Eh2406 deleted the slow-case branch November 12, 2018 18:59

Eh2406 mentioned this pull request Nov 13, 2018

Tracking issue for RFC 1977: public & private dependencies, as it relates to the resolver #6129

Open

Eh2406 mentioned this pull request Dec 1, 2018

ConflictStoreTrie: Faster filtered search #6366

Merged

Eh2406 mentioned this pull request Jan 24, 2019

0.32.0 passes_validation hangs on armel and ppc64el (or perhaps randomly) #6490

Closed

Eh2406 mentioned this pull request Mar 2, 2019

PubGrub documentation questions dart-lang/pub#2089

Closed

Eh2406 mentioned this pull request Mar 23, 2019

Resolver: A dep is equivalent to one of the things it can resolve to. #6776

Merged

Eh2406 mentioned this pull request May 8, 2019

fast path for incompatible deps #6919

Closed

ehuss added this to the 1.32.0 milestone Feb 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a ad hoc trie to avoid slow cases #6283

Use a ad hoc trie to avoid slow cases #6283

Eh2406 commented Nov 8, 2018

rust-highfive commented Nov 8, 2018

Eh2406 commented Nov 8, 2018

Eh2406 commented Nov 8, 2018

alexcrichton left a comment

Eh2406 commented Nov 9, 2018

alexcrichton left a comment

alexcrichton Nov 9, 2018

Eh2406 Nov 9, 2018

alexcrichton Nov 9, 2018

Eh2406 commented Nov 9, 2018

alexcrichton commented Nov 9, 2018

bors commented Nov 9, 2018

bors commented Nov 9, 2018

bors commented Nov 9, 2018

Eh2406 commented Nov 30, 2018

Eh2406 commented Dec 1, 2018

Use a ad hoc trie to avoid slow cases #6283

Use a ad hoc trie to avoid slow cases #6283

Conversation

Eh2406 commented Nov 8, 2018

rust-highfive commented Nov 8, 2018

Eh2406 commented Nov 8, 2018

Eh2406 commented Nov 8, 2018

alexcrichton left a comment

Choose a reason for hiding this comment

Eh2406 commented Nov 9, 2018

alexcrichton left a comment

Choose a reason for hiding this comment

alexcrichton Nov 9, 2018

Choose a reason for hiding this comment

Eh2406 Nov 9, 2018

Choose a reason for hiding this comment

alexcrichton Nov 9, 2018

Choose a reason for hiding this comment

Eh2406 commented Nov 9, 2018

alexcrichton commented Nov 9, 2018

bors commented Nov 9, 2018

bors commented Nov 9, 2018

bors commented Nov 9, 2018

Eh2406 commented Nov 30, 2018

Eh2406 commented Dec 1, 2018