forego caching cycles leads to a severe perf regression #60846

ipetkov · 2019-05-15T07:40:14Z

#60444 has introduced a severe perf regression when building/testing the conch-runtime crate.

Previously a test run would take ~5 mins, and with the latest nightly (rustc 1.36.0-nightly (372be4f36 2019-05-14)) it now takes ~82(!!) mins.

$ git clone https://github.com/ipetkov/conch-runtime.git
$ cd conch-runtime
$ cargo test --lib
   # snip
    Finished dev [unoptimized + debuginfo] target(s) in 4m 16s
$ cargo clean
$ cargo +nightly test --lib
   # snip
    Finished dev [unoptimized + debuginfo] target(s) in 82m 54s

Crate info

The crate offers the functionality to execute shell programs. Each piece of the grammar is represented as a node which can hold generic sub-nodes. The reasoning for this is so that the crate consumer could customize their AST with different/custom nodes, while reusing existing implementations.

The shell grammar is deeply recursive. Basically each command can vary in complexity (compound commands such as case, for, or simple commands like echo foo), but is ultimately made up of a list of shell words (literals, interpolations, etc.). Because each word can contain a command substitution, the AST type is recursive (a Command<W> has a Word<C> type, which gives us Command<Word<Command<...>>).

There are two "top-level" type definitions which seek to unify the entire AST tree concretely which are basically TopLevelCommand(Command<TopLevelWord>) TopLevelWord(Word<TopLevelCommand>).

The crate also heavily uses generics and trait bounds (perhaps overly so), however, there's hopefully some low hanging fruits that can reduce the 16x slow down in performance.

cc @nikomatsakis @pnkfelix

The text was updated successfully, but these errors were encountered:

pnkfelix · 2019-05-16T10:38:49Z

a 16x compile-time regression does sound pretty bad indeed.

pnkfelix · 2019-05-16T11:35:45Z

triage: P-high. Leaving nomination tag, as I would like to discuss strategies for addressing this at the meeting, if possible.

cramertj · 2019-05-20T17:10:40Z

@matklad also thinks this might have caused this failure in rust-analyzer: rust-lang/rust-analyzer#1283

pnkfelix · 2019-05-23T14:34:16Z

"discussed" at T-compiler meeting. Assigning to self to investigate. Removing nomination tag.

nikomatsakis · 2019-05-30T14:20:49Z

Well, it was known that this could cause problems in performance. I don't know that there is a simple fix. (I suspect the errors in rust-analyzer are legit, as well)

nikomatsakis · 2019-05-30T14:21:30Z

But I was contemplating starting on a more complete re-write of the trait solver (kind of an intermediate step towards switching to chalk). I think that might be what is ultimately needed. (Note that chalk actually has a variant of this same bug...)

nikomatsakis · 2019-06-10T18:18:40Z

I have a potential fix for this.

UPDATE: But I may have just realized a flaw in the caching scheme I was planning on.

nikomatsakis · 2019-06-11T23:11:47Z

OK, #61754 is up, though still doing final tests. 🤞 When I ran it locally, it seemed to resolve the perf slowdown.

based on rust-lang#61754 (comment) I am adding `bootstrap` to the cfg-preconditions for the two manual `unsafe impls`'s of `Send` and `Sync` for `TokenTree`.

@pnkfelix

create a "provisional cache" to restore performance in the case of cycles Introduce a "provisional cache" that caches the results of auto trait resolutions but keeps them from entering the *main* cache until everything is ready. This turned out a bit more complex than I hoped, but I don't see another short term fix -- happy to take suggestions! In the meantime, it's very clear we need to rework the trait solver. This resolves the extreme performance slowdown experienced in #60846 -- I plan to add a perf.rust-lang.org regression test to track this. Caveat: I've not run `x.py test` in full yet. r? @pnkfelix cc @arielb1 Fixes #60846

@pnkfelix

…caching-perf-3, r=pnkfelix create a "provisional cache" to restore performance in the case of cycles Introduce a "provisional cache" that caches the results of auto trait resolutions but keeps them from entering the *main* cache until everything is ready. This turned out a bit more complex than I hoped, but I don't see another short term fix -- happy to take suggestions! In the meantime, it's very clear we need to rework the trait solver. This resolves the extreme performance slowdown experienced in rust-lang#60846 -- I plan to add a perf.rust-lang.org regression test to track this. Caveat: I've not run `x.py test` in full yet. r? @pnkfelix cc @arielb1 Fixes rust-lang#60846

Centril added I-compiletime Issue: Problems and improvements with respect to compile times. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels May 15, 2019

jonas-schievink added I-nominated regression-from-stable-to-nightly Performance or correctness regression from stable to nightly. labels May 15, 2019

pnkfelix added the A-traits Area: Trait system label May 15, 2019

pnkfelix added the P-high High priority label May 16, 2019

pnkfelix self-assigned this May 23, 2019

pnkfelix removed the I-nominated label May 23, 2019

jonas-schievink added regression-from-stable-to-beta Performance or correctness regression from stable to beta. and removed regression-from-stable-to-nightly Performance or correctness regression from stable to nightly. labels May 23, 2019

jonas-schievink mentioned this issue May 25, 2019

infinite loop when compiling syntax at stage1 ? #61162

Closed

ipetkov added a commit to ipetkov/conch-runtime that referenced this issue Jun 3, 2019

ci: add beta workaround due to rust-lang/rust#60846

34461aa

ipetkov mentioned this issue Jun 3, 2019

regression: overflow evaluating, Send/Sync? #61472

Closed

nikomatsakis mentioned this issue Jun 11, 2019

create a "provisional cache" to restore performance in the case of cycles #61754

Merged

bors closed this as completed in #61754 Jun 16, 2019

pnkfelix mentioned this issue Jun 19, 2019

Forgone caching in cycles caused much overflow in trait solving #61960

Open

4 tasks

estebank mentioned this issue Jul 16, 2019

ICE: Rust spins when referencing associated types in where clause #62430

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

forego caching cycles leads to a severe perf regression #60846

forego caching cycles leads to a severe perf regression #60846

ipetkov commented May 15, 2019

pnkfelix commented May 16, 2019

pnkfelix commented May 16, 2019

cramertj commented May 20, 2019

pnkfelix commented May 23, 2019

nikomatsakis commented May 30, 2019

nikomatsakis commented May 30, 2019

nikomatsakis commented Jun 10, 2019 •

edited

Loading

nikomatsakis commented Jun 11, 2019

forego caching cycles leads to a severe perf regression #60846

forego caching cycles leads to a severe perf regression #60846

Comments

ipetkov commented May 15, 2019

pnkfelix commented May 16, 2019

pnkfelix commented May 16, 2019

cramertj commented May 20, 2019

pnkfelix commented May 23, 2019

nikomatsakis commented May 30, 2019

nikomatsakis commented May 30, 2019

nikomatsakis commented Jun 10, 2019 • edited Loading

nikomatsakis commented Jun 11, 2019

nikomatsakis commented Jun 10, 2019 •

edited

Loading