`-Z trait-solver=next`: Deduplicate region constraints in query responses #111172

ndrewxie · 2023-05-04T03:28:42Z

An attempt at fixing #109764 .

This is a MVP version, so I'm opening a PR primarily so others can review and critique it (is this correct protocol? or should a work-in-progress like this be handled differently?). I'm sure there's many points where my implementation is incorrect, but right now it does compile all the snippets provided in the issue (which I put into 1 test file called dedup-vars), and doesn't break any other code (at least, nothing that's detected by any tests :P)

Right now, I'm deduplicating the output of canonicalizing the Response - is this correct?

The idea is as follows:

Identify which variables can be deduplicated (i.e. merged if the constraints that involve them are isomorphic). For this version, I assumed that any variable that isn't in the root universe is dedupable, as canonicalization should compress all universes nameable from the caller to the root universe - correct?
For each constraint (region constraints and member constraints), extract all the variables that are present, in order. This logic is handled by constraint-walker.rs. The question is, am I extracting the variables from the constraints correctly (or am I missing some)?
For each constraint, we then create a "prototypical" version of the constraint, which is formed by replacing all variables with a dummy var. Constraints are considered structurally equal if their prototypical version is the same, i.e. they differ only in variables
The constraint variables, as well as sets of structurally equal constraints is then fed into a dedup solver, which looks for isomorphisms between sets of constraints. The solver returns a set of constraints that can be removed, as well as a set of variables that can be removed
The constraints and variables returned by the solver are then deleted. My question here is, does the order of constraints (as they're stored in .outlives and .member_constraints) matter? Right now, I'm not making any effort to preserve this order, but if it's needed I suppose I could do so, it'll just be less clean
Variable universes are "compressed" - this is given in the function compress_variables in dedup_solver/mod.rs. As variables are removed, they leave "holes" of unused universes, so I pack the universes in the remaining variables to fill this hole (minimizing the maximum universe in the vars). Is the idea of this sound, or is there something more subtle to universes that I'm missing?

Also just a quick question, what exactly is opaque_types? I couldn't find much documentation about it, and reading the code didn't give any significant clues... Should this deduplication handle that in some way?

rustbot · 2023-05-04T03:28:50Z

r? @b-naber

(rustbot has picked a reviewer for you, use r? to override)

rustbot · 2023-05-04T03:28:51Z

⚠️ Warning ⚠️

These commits modify submodules.

rustbot · 2023-05-04T03:28:54Z

Some changes occurred to the core trait solver

cc @rust-lang/initiative-trait-system-refactor

Some changes occurred in src/tools/cargo

cc @ehuss

ndrewxie · 2023-05-04T03:33:57Z

Aaaah my bad, it appears I screwed something up and staged some temp file changes to src/tools/cargo - if the rust bot pinged you about that, you can safely disregard this PR - I'll fix it once I figure out how to.

Edit: It appears that the current standings are git: 1, me: 0. I've tried doing git rm -r --cached src/tools/cargo, but it just broke my x.py (as in, none of the ./x.py commands worked afterwards). Anybody knows how I can untrack a folder?

ndrewxie · 2023-05-04T03:40:15Z

also r? @lcnr

lcnr

haven't looked at the algorithm yet, but some initial thoughts about to slightly simplify this impl.

you commited changes to cargo in 4bc7dd1b8275b129c89657852c446abe2338f471, see https://rustc-dev-guide.rust-lang.org/git.html#i-changed-a-submodule-by-accident for elp on how to solve this.

Is your algorithm something which can be viewed as:

map rust constraints to a graph
solve a known problem on graphs (or multiple problems in sequence)
map the result back to rust constraints

iiuc DedupSolver::dedup calls a bunch of subroutines and at least some of them may be equivalent to something on ordinary graphs. If so, it would be good to 1. use or mention the terminology used for graphs. generally, DedupSolver::dedup seems like the core of your approach, so this might be a good place to explain what's going on there. Right now the meaning of "category" and "rule" isn't immediately clear to me.

compiler/rustc_trait_selection/src/solve/eval_ctxt/canonical/dedup_solver/solver.rs

compiler/rustc_trait_selection/src/solve/eval_ctxt/canonical.rs

compiler/rustc_trait_selection/src/solve/eval_ctxt/canonical/dedup_solver/mod.rs

ndrewxie · 2023-05-04T20:05:55Z

Yeah, the deduplication is essentially just one big graph algorithm :) The constraints are vertices, and if two constraints can potentially be merged, they have an edge between them, and the variable mapping that would merge them is the edge label.

I suppose a "category" can best be described as an isolated clique in the graph, i.e. a clique that has no edges joining it to any other clique.

"Rule" is just my internal representation of a rust constraint, I just used it during development cuz "rule" is shorter, but I'll go ahead and replace it with constraint now

I'd like to make a few minor changes to the dedup solver, so right now I didn't add any additional comments documenting that (so it probably can't be reviewed yet) - once I settle those, I'll put additional comments and explanations for review.

compiler-errors · 2023-05-08T21:55:00Z

@ndrewxie please use a rebase instead of a merge if you want to update your branch -- we have no "no merge commit" policy in the repo

ndrewxie · 2023-05-08T22:12:50Z

@ndrewxie please use a rebase instead of a merge if you want to update your branch -- we have no "no merge commit" policy in the repo

Alright, gocha. Do you have any pointers for how I can undo my mistake? (sorry, I'm not too good with git...)
Thanks!

compiler-errors · 2023-05-08T23:39:38Z

I think you can just do git pull --rebase $UPSTREAM master where $UPSTREAM is whatever origin name you've set for the rust-lang/rust repo, not your fork..

ndrewxie · 2023-05-09T03:40:23Z

Alright, I undid my erroneous merge. When I try doing git pull --rebase I'm getting a lot of merge conflicts, however. What's the best way to deal with this?

Also sorry I haven't had too much time to work on this recently, been wrapping up another PR and have a bunch of mandatory testing. In a couple more days, I should have more time to clean up my implementation and document it a bit more for review.

Also just one question regarding lcnr's comment about deduplicating in compute_external_query_constraints instead of deduplicating the canonicalized Response - I understand why that's desirable, but without the variable universe information which is provided in canonicalization, how can I decide whether a variable (referenced in a region constraint) is nameable from the caller or not? Cuz correct me if I'm wrong, but 2 variables can only be merged if 1. the rules they appear in are isomorphic (i.e. equal up to a re-arranging of variables) and 2. at least one of them is not nameable from the caller (otherwise we might remove an inference variable that the caller "knows about")

lcnr · 2023-05-09T12:35:49Z

the information about universes is also available before we have the canonical response, because how could we otherwise create the response? 😁

you can look at how canonicalization gets the universe for existential variables, iirc we use infcx.probe_var or sth

ndrewxie · 2023-05-09T20:08:42Z

the information about universes is also available before we have the canonical response, because how could we otherwise create the response? 😁

you can look at how canonicalization gets the universe for existential variables, iirc we use infcx.probe_var or sth

Haha I guess you're right, thanks for the help! I guess the new motto going forward is, "if you can't find something, just ctrl + f under InfCtxt" 😄

EDIT: Just now I realized that my whole constraint_walker stuff could be done with a TypeFolder, which would be far cleaner and be less error-prone. I'm currently re-writing that component.

lcnr

the way you currently lower region constraints to a set of variables feels wrong to me 🤔 didn't yet go over the extern solver in detail, but if you have &'var (): 'static and &'static (): 'var they should result in the same input for the solver afaict.

This is currently somewhat difficult for me to review and understand as there are a few things which feel a bit weird '^^ would you be up to take some time to chat about this in sync? either via text on zulip or via zoom?

Something else which would be really valuable would be some example lowerings from rust syntax to the solver constraints. You already did some manual lowering for the tests and having some small examples of "input X lowers to these constraints" would be very helpful.

compiler/rustc_trait_selection/src/solve/eval_ctxt/canonical/dedup_solver/constraint_walker.rs

ndrewxie · 2023-05-13T18:22:18Z

if you have &'var (): 'static and &'static (): 'var they should result in the same input for the solver afaict.

Just to confirm, this is because if var outlives static, then var must be static, so therefore static outlives var too, right? I guess this makes sense, but how could we apply it to more complicated types like &'var1 (&'var2, &'var3, &'var4)? Would we say that only &'static (): 'var1, or would this be true for var2, var3 and var4?

would you be up to take some time to chat about this in sync? either via text on zulip or via zoom?

Sure! Zulip would be more convenient for me, but both work. Github tells me you're in Germany, so on weekends any time after 15:00 CEST (9:00 AM EDT for me) should be fine. On weekdays, I'm basically limited to after 4:00 PM EDT, so after 22:00 CEST should work.

But yeah the solver in its present state isn't very clear - I'll rework it a bit today and add the comments you asked for. The solver is certainly related to graph theory, but my understanding of that is admittedly limited, and I can't really find a neat way to map the majority of it onto pre-existing graph theory problems...

lcnr · 2023-05-15T10:08:12Z

Just to confirm, this is because if var outlives static, then var must be static, so therefore static outlives var too, right?

woops, I meant that we should not consider them to be equal but iirc your solver currently generates the same constraints for them. What are the generated constraints for &'var (): 'static and &'static (): 'var rn?

ndrewxie · 2023-05-15T19:32:54Z

woops, I meant that we should not consider them to be equal but iirc your solver currently generates the same constraints for them.

Well, yes and no - the solver does extract the same variables for both (if we say 'var is index 0, then both will be represented as [0]), however, the solver will not attempt to merge the constraints. The reason for this lies in the constraint cliques - the idea is that we partition constraints into groups ("cliques") based on a variety of properties, and we only try merging constraints if they belong to the same clique. In this case, those two constraints will not belong to the same clique because of the placement of the &'static.

These cliques get refined later in the solving process, but when we first scan the constraints and extract variables, we create "rough" cliques by the following procedure:

Take an input constraint and walk over it (TypeFolder) and extract variables as a Vec
A copy of each constraint is then made, with all the variables replaced with a dummy placeholder (done through TypeFolder). For example, &'var (): 'static would become &<SOME DUMMY PLACEHOLDER>: 'static, and &'static (): var would become &'static (): <SOME DUMMY PLACEHOLDER>. I think I called them "erased constraints" or something in my code. Essentially, this process erases the information of the actual var values, however, everything that isn't a var value (e.g. 'static) is still kept. Moreover, the order and location of all these terms is still retained. This allows us to answer the question of, "are these constraints equal if we ignore variables"?
We then build a map, mapping an erased constraint to a set of constraints that produce that erased constraint. These sets of constraints now form the constraint cliques.

In the case you provided, two cliques would be made:

&'static (): <SOME DUMMY PLACEHOLDER>, which corresponds to &'static (): var
&<SOME DUMMY PLACEHOLDER>: 'static, which corresponds to &'var (): 'static
This clique information is passed into the solver, so the solver will not attempt to merge these two constraints.

My apologies if this was poorly explained in the comments (or is poorly explained now 😅). I'll put in better documentation for this part later today.

bors · 2023-05-25T12:08:53Z

☔ The latest upstream changes (presumably #111473) made this pull request unmergeable. Please resolve the merge conflicts.

…/tools/cargo (hopefully)

…canonicalization, reworked constraint_walker to use TypeFolder/TypeFoldable

…w more comments

…wer constraints over older

…wer constraints over older (2)

wesleywiser · 2023-07-06T14:46:15Z

Hi @lcnr, just letting you know that this PR has been updated. I think it's ready for review whenever you have a chance. Thanks! 🙂

lcnr · 2023-08-11T10:03:01Z

closing this for now as I hope the pattern fixed here does not occur in actual code so it adds complexity to the solver. I am still thankful to @ndrewxie for working on this and chatting about it on zulip, as this provided me with a far better understanding of what's going on here and also uncovered a bigger issue of our overflow handling.

rustbot assigned b-naber May 4, 2023

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. WG-trait-system-refactor The Rustc Trait System Refactor Initiative (-Znext-solver) labels May 4, 2023

This comment has been minimized.

Sign in to view

rustbot assigned lcnr and unassigned b-naber May 4, 2023

lcnr reviewed May 4, 2023

View reviewed changes

ndrewxie force-pushed the dedup branch from 6a0a4be to bf9b336 Compare May 9, 2023 03:34

lcnr reviewed May 13, 2023

View reviewed changes

compiler/rustc_trait_selection/src/solve/eval_ctxt/canonical/dedup_solver/constraint_walker.rs Outdated Show resolved Hide resolved

ndrewxie added 4 commits May 28, 2023 16:40

Initial attempt at deduplicating region constraints

89d955f

Added a few comments

074b9b0

Added a few comments (3)

96f35a7

Fixed Rc/Arc issue and tidy errors

dce80d4

ndrewxie added 10 commits May 28, 2023 16:42

Fiddling around with git... Fixed Rc/Arc issue with Lrc, unstaged src…

3ac13f5

…/tools/cargo (hopefully)

Removed debug printlns (whoops)

a4e4973

fixup! Added a few comments

6de1233

Moved deduplication to compute_external_query_constraints instead of …

3a3e40c

…canonicalization, reworked constraint_walker to use TypeFolder/TypeFoldable

Added more comments + slight cleanup to code

4e10821

Removed some ignore warning markers added during debugging

3d28fda

Minor simplifications+optimizations for core dedup solver, added a fe…

73dbf78

…w more comments

Removed some (likely premature) optimizations to improve code clarity

021310a

Removed unsound notion of unremovable_vars, prioritized removal of ne…

7c55e56

…wer constraints over older

Removed unsound notion of unremovable_vars, prioritized removal of ne…

61d08a6

…wer constraints over older (2)

ndrewxie force-pushed the dedup branch from 1495199 to 61d08a6 Compare May 29, 2023 01:01

ndrewxie added 2 commits May 28, 2023 23:20

Hopefully simplified solver impl a bit

711986b

Hopefully simplified solver impl a bit (2)

67fbe17

lcnr closed this Aug 11, 2023

apiraino removed the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`-Z trait-solver=next`: Deduplicate region constraints in query responses #111172

`-Z trait-solver=next`: Deduplicate region constraints in query responses #111172

ndrewxie commented May 4, 2023 •

edited

Loading

rustbot commented May 4, 2023

rustbot commented May 4, 2023

rustbot commented May 4, 2023

ndrewxie commented May 4, 2023 •

edited

Loading

This comment has been minimized.

ndrewxie commented May 4, 2023

lcnr left a comment

ndrewxie commented May 4, 2023 •

edited

Loading

compiler-errors commented May 8, 2023

ndrewxie commented May 8, 2023

compiler-errors commented May 8, 2023 •

edited

Loading

ndrewxie commented May 9, 2023 •

edited

Loading

lcnr commented May 9, 2023 •

edited

Loading

ndrewxie commented May 9, 2023 •

edited

Loading

lcnr left a comment •

edited

Loading

ndrewxie commented May 13, 2023 •

edited

Loading

lcnr commented May 15, 2023

ndrewxie commented May 15, 2023 •

edited

Loading

bors commented May 25, 2023

wesleywiser commented Jul 6, 2023

lcnr commented Aug 11, 2023

-Z trait-solver=next: Deduplicate region constraints in query responses #111172

-Z trait-solver=next: Deduplicate region constraints in query responses #111172

Conversation

ndrewxie commented May 4, 2023 • edited Loading

rustbot commented May 4, 2023

rustbot commented May 4, 2023

rustbot commented May 4, 2023

ndrewxie commented May 4, 2023 • edited Loading

This comment has been minimized.

ndrewxie commented May 4, 2023

lcnr left a comment

Choose a reason for hiding this comment

ndrewxie commented May 4, 2023 • edited Loading

compiler-errors commented May 8, 2023

ndrewxie commented May 8, 2023

compiler-errors commented May 8, 2023 • edited Loading

ndrewxie commented May 9, 2023 • edited Loading

lcnr commented May 9, 2023 • edited Loading

ndrewxie commented May 9, 2023 • edited Loading

lcnr left a comment • edited Loading

Choose a reason for hiding this comment

ndrewxie commented May 13, 2023 • edited Loading

lcnr commented May 15, 2023

ndrewxie commented May 15, 2023 • edited Loading

bors commented May 25, 2023

wesleywiser commented Jul 6, 2023

lcnr commented Aug 11, 2023

`-Z trait-solver=next`: Deduplicate region constraints in query responses #111172

`-Z trait-solver=next`: Deduplicate region constraints in query responses #111172

ndrewxie commented May 4, 2023 •

edited

Loading

ndrewxie commented May 4, 2023 •

edited

Loading

ndrewxie commented May 4, 2023 •

edited

Loading

compiler-errors commented May 8, 2023 •

edited

Loading

ndrewxie commented May 9, 2023 •

edited

Loading

lcnr commented May 9, 2023 •

edited

Loading

ndrewxie commented May 9, 2023 •

edited

Loading

lcnr left a comment •

edited

Loading

ndrewxie commented May 13, 2023 •

edited

Loading

ndrewxie commented May 15, 2023 •

edited

Loading