Backend refactorings and perf improvements #6867

alexcrichton · 2019-04-22T22:29:20Z

This PR is extracted from #6864 and then also additionally adds some commits related to performance optimizations that I noticed while profiling #6864. Each commit is in theory standalone and should pass all the tests, as well as being descriptive about what it's doing.

I... don't think this needed any more! Doing some digging looks like this was originally added in 79768eb. That was so early it didn't even use a PR and was almost 5 years ago. Since then we've had a huge number of changes to the backend and `Unit` nowadays does all the deduplication we need, so no need to store a `Vec` here and we can just have a mapping of `Key` to `Job` and that's it.

This looks like it was a bug ever present from the original implementation of a GNU jobserver in rust-lang#4110, but we currently unconditionally request a token is allocated for any job we pull off our job queue. Rather we only need to request tokens for everything but the first job because we already have an implicit token for that job.

Take a `&Context` argument instead of a few component arguments, avoids passing around some extraneous information.

This isn't actually necessary with a bit of refactoring, and in general it makes management of `JobQueue` simpler since there's one less type to worry about. The one main usage of `Key` vs `Unit` was that `Key` was `Send` to be placed in `Message`, but this was replaced by just assigning each `Unit` an ever-increasing integer, and then `Finished` contains the integer rather than the `Key` itself which we can map once we get back to the main thread.

The code lying around in `dep_targets` is pretty old at this point, but given the current iteraiton there's no need to re-sort on each call to `dep_targets`, only initially during construction!

We only need to parse this information once, so calculate it when a `TargetInfo` is created and then just reuse that from then on out.

This commit starts to intern `Unit` structures for a few reasons: * This primarily makes equality and hashing much faster. We have tons of hash lookups with units, and they were showing up quite high in profiles. It turns out `Unit` hashes a *ton* of data, and most of it is always redundant. To handle this they're all only hashed once now and hashing/equality are just pointer checks. * The size of `Unit` is now drastically reduced to just one pointer, so movement of units throughout the backend should be much more efficient.

rust-highfive · 2019-04-22T22:29:27Z

r? @ehuss

(rust_highfive has picked a reviewer for you, use r? to override)

alexcrichton · 2019-04-22T22:30:09Z

I should mention that the overall performance wins of this were pretty modest, I was seeing ~10% with an average of 180-190ms for a null build of Cargo itself, whereas afterwards it's 170-180ms but the variances are quite high.

This commit moves it from a linear query which does a lot of hashing of `PathBuf` to an `O(1)` query that only hashes `PackageId`. This improves Servo's null build performance by 50ms or so, causing a number of functions to disappear from profiles.

This commit moves a linear scan which happens once-per-each-dependency to an O(1) lookup which happens only once for each package. This removes another 30ms or so from a null build in Servo.

alexcrichton · 2019-04-22T23:45:24Z

I should also note that the lion's share of a possible improvement to Cargo null build time will likely be #6866 as that's >50% of a null build from my measurements. While we can't get it down to zero probably we can make it way faster in theory.

ehuss

Nice! I've been concerned about how big Unit is, and I'm a little surprised it doesn't help more, but I guess computers are fast at copying things.

r=me with the comment.

ehuss · 2019-04-23T17:27:52Z

src/cargo/core/compiler/unit.rs

+    fn intern_inner(&'a self, item: &UnitInner<'a>) -> &'a UnitInner<'a> {
+        let mut me = self.state.borrow_mut();
+        if let Some(item) = me.cache.get(item) {
+            return unsafe { &*(&**item as *const UnitInner<'a>) };


I think I understand what this is doing, but can you add a comment explaining it, along with why it is safe?

Is this correct: Starting with &Box<UnitInner> it pulls UnitInner out of the box, casts to a pointer to force the borrow checker to forget about the borrow, then converts the pointer back to &UnitInner?

Correct yeah, and I'll leave some comments, an excellent point!

alexcrichton · 2019-04-23T21:58:39Z

@bors: r=ehuss

bors · 2019-04-23T21:58:40Z

📌 Commit 32269f4 has been approved by ehuss

bors · 2019-04-23T21:58:47Z

⌛ Testing commit 32269f4 with merge b13ed92aaada88b7053fdcc0d871251ff3b34a55...

bors · 2019-04-23T22:19:39Z

💔 Test failed - checks-travis

ehuss · 2019-04-23T22:26:06Z

Hmm, something looks wrong with rustup, it's failing for me elsewhere.

alexcrichton · 2019-04-25T15:26:49Z

@bors: retry

bors · 2019-04-25T15:52:13Z

⌛ Testing commit 32269f4 with merge 6562942...

Backend refactorings and perf improvements This PR is extracted from #6864 and then also additionally adds some commits related to performance optimizations that I noticed while profiling #6864. Each commit is in theory standalone and should pass all the tests, as well as being descriptive about what it's doing.

bors · 2019-04-25T16:29:16Z

☀️ Test successful - checks-travis, status-appveyor
Approved by: ehuss
Pushing 6562942 to master...

Add explicit lifetimes to Executor::init. Since the unit interner was added in #6867, I am getting a strange borrow check error compiling RLS with the latest Cargo (that is, linking rls with the new cargo). Adding explicit lifetimes to the `Executor::init` function seems to fix the problem. I don't 100% understand the error, because none of the relevant function or type signatures changed in that PR. The only thing that gives me a clue is [this change](https://github.com/rust-lang/cargo/pull/6867/files#diff-0bb62cb712c0811a17d7a726e068bf65L112) to `BuildPlan::add` which does something similar, but that is not directly called by RLS. The error looks like this: ``` error[E0623]: lifetime mismatch --> rls/src/build/cargo_plan.rs:149:24 | 126 | unit: &Unit<'_>, | -------- these two types are declared with different lifetimes... 127 | cx: &Context<'_, '_>, | --------------- ... 149 | let units = cx.dep_targets(unit); | ^^^^^^^^^^^ ...but data from `unit` flows into `cx` here ``` I generally don't like making changes if I don't understand them, but I'm a little stumped here how this is happening even though everything is defined with the same (`'a`) lifetime. This unblocks updating RLS to the latest Cargo.

alexcrichton added 7 commits April 22, 2019 13:47

Refactor JobQueue::run slightly

21cae01

Take a `&Context` argument instead of a few component arguments, avoids passing around some extraneous information.

Sort units once, not each call to dep_targets

7246028

The code lying around in `dep_targets` is pretty old at this point, but given the current iteraiton there's no need to re-sort on each call to `dep_targets`, only initially during construction!

Avoid reparsing env_args continuously

bd8253f

We only need to parse this information once, so calculate it when a `TargetInfo` is created and then just reuse that from then on out.

rust-highfive assigned ehuss Apr 22, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 22, 2019

alexcrichton added 2 commits April 22, 2019 16:01

Hoist a workspace membership check out of a loop

d274fba

This commit moves a linear scan which happens once-per-each-dependency to an O(1) lookup which happens only once for each package. This removes another 30ms or so from a null build in Servo.

ehuss reviewed Apr 23, 2019

View reviewed changes

Document unsafety in UnitInterner

32269f4

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 23, 2019

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Apr 23, 2019

alexcrichton mentioned this pull request Apr 24, 2019

Implement the Cargo half of pipelined compilation #6864

Closed

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 25, 2019

bors merged commit 32269f4 into rust-lang:master Apr 25, 2019

This was referenced Apr 25, 2019

Make cargo preduce dependency binaries to a shared directory for reuse #6437

Closed

Bugfix/crt static in config #6736

Closed

ehuss mentioned this pull request Apr 30, 2019

Add explicit lifetimes to Executor::init. #6893

Merged

alexcrichton deleted the improvements branch May 6, 2019 18:12

This was referenced May 24, 2019

Make resolution with correct lock file faster #5321

Closed

Crate graph resolution on parity-wasm with a lockfile is noticeably slow #5817

Closed

ehuss added this to the 1.36.0 milestone Feb 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backend refactorings and perf improvements #6867

Backend refactorings and perf improvements #6867

alexcrichton commented Apr 22, 2019

rust-highfive commented Apr 22, 2019

alexcrichton commented Apr 22, 2019

alexcrichton commented Apr 22, 2019

ehuss left a comment

ehuss Apr 23, 2019

alexcrichton Apr 23, 2019

alexcrichton commented Apr 23, 2019

bors commented Apr 23, 2019

bors commented Apr 23, 2019

bors commented Apr 23, 2019

ehuss commented Apr 23, 2019

alexcrichton commented Apr 25, 2019

bors commented Apr 25, 2019

bors commented Apr 25, 2019

Backend refactorings and perf improvements #6867

Backend refactorings and perf improvements #6867

Conversation

alexcrichton commented Apr 22, 2019

rust-highfive commented Apr 22, 2019

alexcrichton commented Apr 22, 2019

alexcrichton commented Apr 22, 2019

ehuss left a comment

Choose a reason for hiding this comment

ehuss Apr 23, 2019

Choose a reason for hiding this comment

alexcrichton Apr 23, 2019

Choose a reason for hiding this comment

alexcrichton commented Apr 23, 2019

bors commented Apr 23, 2019

bors commented Apr 23, 2019

bors commented Apr 23, 2019

ehuss commented Apr 23, 2019

alexcrichton commented Apr 25, 2019

bors commented Apr 25, 2019

bors commented Apr 25, 2019