Massive compiler perf regression #31157

Gankra · 2016-01-24T03:16:01Z

First reported on Reddit here. See that post for more details and examples.

This crate has gone from a build time of 17 seconds with the compiler consuming 168MB of RAM at peak to 320 secs and 1670MB. Apparently this regression occurred somewhere between jan 15-18, and is in the current beta. The author hypothesizes that it may have to do with complex usage of traits.

dirk · 2016-01-24T04:27:32Z

Quoting a comment from Reddit (see here) so it doesn't get lost:

Paging /u/arielby and /u/nikomatsakis, they always know what's going on in the type checker

EDIT: Does the Jan 18 nightly contain the obligation forest (#30533), by any chance?

aturon · 2016-01-24T04:53:23Z

cc @nikomatsakis @arielb1

White-Oak · 2016-01-24T05:48:11Z

+1 on this (for reference output of cargo rustc -- -Z time-passes). When talking about usage of complex traits (example, once again, here), it was always been slow, but on an acceptible rate. The author of library even goes about that in a README.md:

Be aware, due to an ongoing issue with rustc, the compile time of your code will exponentially increase with the complexity of your parsers. In practice I've found things get bad after about 10 combinations or so. You can get around this by boxing a parser...

The issue he mentioned is #22204. This (current) may be considered duplicate of that, though, again, the problem was not of the current magnitude.

The author considers boxing to help:

This "flattens" the type signature of the parser into a trait object, which will improve compile-time at the cost of runtime performance due to dynamic dispath.

But, once again, it doesn't help much now.

White-Oak · 2016-01-24T05:58:53Z

Ah, as well, this may be considered as a regression from stable to beta:
Code compiles rather quickly on stable 1.6, but slowly on beta 1.7. I just went further to find exact nightlies, which differ in compilation times.

gereeter · 2016-01-24T06:46:38Z

Note that the obligation forest PR was merged on January 16, so it did get merged in the window in which the regression happened.

nikomatsakis · 2016-01-24T09:14:04Z

Seems pretty likely to have to do with the obligation forest and also with the change in caching behavior that accompanied it. I can investigate some.

MagaTailor · 2016-01-24T09:28:57Z

Is the syntax_syntex issue related? It was always bad but now it's definitely worse.

nikomatsakis · 2016-01-25T23:17:06Z

I can certainly confirm that this is traits-related; at least, the crate in question spends about 85% of its compilation time in middle::traits. I'm looking more in depth now.

nikomatsakis · 2016-01-25T23:17:14Z

@petevine unlikely to be related to that, but I don't know

nikomatsakis · 2016-01-25T23:32:55Z

Seems pretty clear that the caching of sized results is no longer kicking in like it once did. This probably affects the script crate from servo too.

nikomatsakis · 2016-01-25T23:34:00Z

e.g., we repeatedly prove that T: Sized for the following types T. Here, the first number is the number of times we have proved it in my current logs. Note that compilation did not yet complete. :)

  79297 <*const alloc::rc::RcBox<fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}>
  81467 <core::marker::PhantomData<alloc::rc::RcBox<fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}>>
  81467 <core::nonzero::NonZero<*const alloc::rc::RcBox<fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}>>
  82807 <core::ptr::Shared<alloc::rc::RcBox<fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}>>
  82984 <peruse::parsers::RecursiveParser<[grammar_lexer::Token], grammar::Expr, fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}>
  82999 <alloc::rc::Rc<fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}>
 132273 <fn() -> Box<peruse::parsers::Parser<I=[grammar_lexer::Token], O=grammar::Expr>> {parser::program::expression}

nikomatsakis · 2016-01-28T21:09:21Z

triage: P-high

@soltanmm

…he, r=aturon Have the `ObligationForest` keep some per-tree state (or type `T`) and have it give a mutable reference for use when processing obligations. In this case, it will be a hashmap. This obviously affects the work that @soltanmm has been doing on snapshotting. I partly want to toss this out there for discussion. Fixes #31157. (The test in question goes to approx. 30s instead of 5 minutes for me.) cc #30977. cc @aturon @arielb1 @soltanmm r? @aturon who reviewed original `ObligationForest`

White-Oak · 2016-02-05T20:24:39Z

So, as this was fixed -- is it (the fix) going to be in the next nightly?

nikomatsakis · 2016-02-05T20:37:33Z

Ought to be, yes.

On Fri, Feb 5, 2016 at 3:25 PM, Oak notifications@github.com wrote:

So, as this was fixed -- is it (the fix) going to be in the next nightly?

—
Reply to this email directly or view it on GitHub
#31157 (comment).

White-Oak · 2016-02-08T03:48:45Z

@nikomatsakis excuse me for bothering again.

What about beta? Should this fix be backported to it? As states in a first comment -- this regression is in beta as well.
Well, for me time went from 40 secs (2016-01-15) to 56 secs (current nightly: 2016-02-07) which is almost 30% time regression, but I guess that's best that could be done.

nikomatsakis · 2016-02-08T15:36:22Z

@White-Oak

What about beta? Should this fix be backported to it? As states in a first comment -- this regression is in beta as well.

Good point. I've nominated the PR for backporting.

Well, for me time went from 40 secs (2016-01-15) to 56 secs (current nightly: 2016-02-07) which is almost 30% time regression, but I guess that's best that could be done.

It may be possible to improve further.

Marwes · 2016-02-17T12:31:21Z

Is there any work done towards improving this further? combine still suffers badly from this change, even becoming so bad that Travis times out when building it (see https://travis-ci.org/Marwes/combine)

White-Oak · 2016-02-17T13:08:23Z

@Marwes it is not backported to beta yet.

bluss · 2016-02-17T13:14:49Z

That travis link shows a regression in cargo build time there from 96 seconds in stable, 374 seconds in nightly. Can we keep an issue for that, or is there no way to recover the compilation time?

Edit: Ok, that's lopsided since nightly runs benchmarking and things. I guess those timings are not usable.

Marwes · 2016-02-17T14:30:45Z

Here is a link to a build before the regression so there is still a significant slowdown. And even on nightly it fails occasionally https://travis-ci.org/Marwes/combine/jobs/109742755.

nikomatsakis · 2016-02-18T00:46:02Z

I'm willing to re-open the issue on the basis that there is still ground to get back, even if the worst part is solved.

nikomatsakis · 2016-02-18T00:46:10Z

triage: P-medium

nikomatsakis · 2016-02-18T00:46:20Z

However, I'm bumping priority to medium.

brson · 2016-10-20T16:56:00Z

Needs to be retested and a test case added to perf.rlo cc @nrc @Mark-Simulacrum.

Mark-Simulacrum · 2016-10-20T20:20:10Z

Will add test case to perf.rlo later, but currently time -v yields:

$ /usr/bin/time -v cargo build
   Compiling regex-syntax v0.2.2
   Compiling libc v0.2.5
   Compiling memchr v0.1.7
   Compiling aho-corasick v0.4.0
   Compiling regex v0.1.48
   Compiling peruse v0.3.0 (https://github.com/DanSimon/peruse.git#dbfc0054)
   Compiling parser v0.1.0 (file:///home/mark/Edit/rust-compilation-time-ram-regression)
    Finished debug [unoptimized + debuginfo] target(s) in 9.63 secs
    Command being timed: "cargo build"
    User time (seconds): 10.44
    System time (seconds): 0.31
    Percent of CPU this job got: 108%
    Elapsed (wall clock) time (h:mm:ss or m:ss): 0:09.91
    Average shared text size (kbytes): 0
    Average unshared data size (kbytes): 0
    Average stack size (kbytes): 0
    Average total size (kbytes): 0
    Maximum resident set size (kbytes): 150980
    Average resident set size (kbytes): 0
    Major (requiring I/O) page faults: 0
    Minor (reclaiming a frame) page faults: 122458
    Voluntary context switches: 101
    Involuntary context switches: 42
    Swaps: 0
    File system inputs: 0
    File system outputs: 66112
    Socket messages sent: 0
    Socket messages received: 0
    Signals delivered: 0
    Page size (bytes): 4096
    Exit status: 0

So I think this issue can be considered fixed. Can someone clarify exactly what part of the compiler was being stressed? I can add this crate to perf.rlo no problem, but perhaps something better can be designed?

White-Oak · 2016-10-21T14:17:43Z

I've reported this originally, and now I experience the same values as @Mark-Simulacrum. I've tested it with several nightlies over the year, and it seems it was getting better and better (from original 30 secs to 20, 15, 12 and finally 10 as of today). Feels great. Thanks to the rust team for the effort put on compiling speed optimizations!

This exact issue with the example crate may be closed as solved, however the original drop in performance was caused by the introduction of Obligation Forest as explained above by @nikomatsakis.
The latest thing I know, it was reverted on beta in #31851. Hope someone can clarify it better and further, although I'm afraid the track can be lost.

Edit: Actually, the crate itself takes only 5 seconds to build, which is awesome.

Mark-Simulacrum · 2017-06-20T16:39:13Z

Closing in favor of rust-lang-deprecated/rustc-perf-collector#2 which tracks adding a test for this.

Consider changing assert! to debug_assert! when it calls visit_with The perf run from rust-lang#52956 revealed that there were 3 benchmarks that benefited most from changing `assert!`s to `debug_assert!`s: - issue rust-lang#46449: avg -4.7% for -check - deeply-nested (AKA rust-lang#38528): avg -3.4% for -check - regression rust-lang#31157: avg -3.2% for -check I analyzed their fixing PRs and decided to look for potentially heavy assertions in the files they modified. I noticed that all of the non-trivial ones contained indirect calls to `visit_with()`. It might be a good idea to consider changing `assert!` to `debug_assert!` in those places in order to get the performance wins shown by the benchmarks.

Gankra added I-slow Issue: Problems and improvements with respect to performance of generated code. A-compiler labels Jan 24, 2016

sfackler added I-nominated T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jan 24, 2016

rust-highfive added P-high High priority and removed I-nominated labels Jan 28, 2016

nikomatsakis mentioned this issue Jan 29, 2016

Add snapshotting to ObligationForest, take 2 #31175

Closed

Marwes mentioned this issue Jan 31, 2016

Extremely long compile times Marwes/combine#21

Closed

nikomatsakis mentioned this issue Feb 1, 2016

Add a per-tree cache into the obligation forest #31349

Merged

nrc assigned nikomatsakis Feb 4, 2016

bors closed this as completed in #31349 Feb 5, 2016

nikomatsakis reopened this Feb 18, 2016

rust-highfive added P-medium Medium priority and removed P-high High priority labels Feb 18, 2016

sanxiyn added I-compiletime Issue: Problems and improvements with respect to compile times. and removed I-slow Issue: Problems and improvements with respect to performance of generated code. labels Mar 23, 2016

White-Oak mentioned this issue Oct 24, 2016

Added a proper support for cargo --message-format json AtomLinter/linter-rust#84

Merged

steveklabnik removed the A-compiler label Mar 24, 2017

Mark-Simulacrum mentioned this issue Jun 20, 2017

Obligation forest test rust-lang-deprecated/rustc-perf-collector#2

Closed

Mark-Simulacrum closed this as completed Jun 20, 2017

Mark-Simulacrum mentioned this issue Sep 9, 2017

Add crate to the benchmarks rust-lang/rustc-perf#148

Closed

This was referenced Aug 2, 2018

Use debug_assert! instead of assert! where possible #52956

Closed

Consider changing assert! to debug_assert! when it calls visit_with #53025

Merged

CAD97 mentioned this issue May 27, 2021

Refactor Rc and Arc to use a prefix allocator #84338

Closed

pnkfelix mentioned this issue Nov 2, 2021

Update cargo #90490

Merged

kennytm mentioned this issue Nov 7, 2021

Add support for allocators in Rc & Arc #89132

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Massive compiler perf regression #31157

Massive compiler perf regression #31157

Gankra commented Jan 24, 2016

dirk commented Jan 24, 2016

aturon commented Jan 24, 2016

White-Oak commented Jan 24, 2016

White-Oak commented Jan 24, 2016

gereeter commented Jan 24, 2016

nikomatsakis commented Jan 24, 2016

MagaTailor commented Jan 24, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 28, 2016

White-Oak commented Feb 5, 2016

nikomatsakis commented Feb 5, 2016

White-Oak commented Feb 8, 2016

nikomatsakis commented Feb 8, 2016

Marwes commented Feb 17, 2016

White-Oak commented Feb 17, 2016

bluss commented Feb 17, 2016

Marwes commented Feb 17, 2016

nikomatsakis commented Feb 18, 2016

nikomatsakis commented Feb 18, 2016

nikomatsakis commented Feb 18, 2016

brson commented Oct 20, 2016

Mark-Simulacrum commented Oct 20, 2016

White-Oak commented Oct 21, 2016 •

edited

Loading

Mark-Simulacrum commented Jun 20, 2017

Massive compiler perf regression #31157

Massive compiler perf regression #31157

Comments

Gankra commented Jan 24, 2016

dirk commented Jan 24, 2016

aturon commented Jan 24, 2016

White-Oak commented Jan 24, 2016

White-Oak commented Jan 24, 2016

gereeter commented Jan 24, 2016

nikomatsakis commented Jan 24, 2016

MagaTailor commented Jan 24, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 25, 2016

nikomatsakis commented Jan 28, 2016

White-Oak commented Feb 5, 2016

nikomatsakis commented Feb 5, 2016

White-Oak commented Feb 8, 2016

nikomatsakis commented Feb 8, 2016

Marwes commented Feb 17, 2016

White-Oak commented Feb 17, 2016

bluss commented Feb 17, 2016

Marwes commented Feb 17, 2016

nikomatsakis commented Feb 18, 2016

nikomatsakis commented Feb 18, 2016

nikomatsakis commented Feb 18, 2016

brson commented Oct 20, 2016

Mark-Simulacrum commented Oct 20, 2016

White-Oak commented Oct 21, 2016 • edited Loading

Mark-Simulacrum commented Jun 20, 2017

White-Oak commented Oct 21, 2016 •

edited

Loading