-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build script rerun-if changes break rebuild detection. #7362
Comments
…ichton Fix build script sanitizer check. rust-lang#64166 changed the way the sanitizer build scripts work. However, they were changed so that they switch between new-style to old-style cargo fingerprints. This trips up on rust-lang/cargo#6779. It also causes rustbuild to panic. If you build stage1 std (with sanitizers off), and then enable sanitizers, it panics. (This is because the build scripts don't declare that they need to re-run.) This PR will trip rust-lang/cargo#6779 again, unfortunately. I've been having way too many unexplained rebuilds in rust-lang/rust recently, but at least I'll know why this time. This doesn't fix all problems with the build scripts, but arguably they should be fixed in cargo. For example, the build scripts change which rerun-if statements they declare between runs which triggers rust-lang/cargo#7362. The test for this is: 1. Turn off sanitizers (which is the default) 2. `./x.py build --stage=1 src/libstd` 3. `./x.py build --stage=1 src/libstd` again should be a null build. 4. Enable sanitizers. 5. `./x.py build --stage=1 src/libstd` should rebuild with sanitizers enabled. 6. `./x.py build --stage=1 src/libstd` again should be a null build. This actually rebuilds due to rust-lang/cargo#7362 because the rerun-if directives changed between step 3 and 5. A 3rd attempt should be a null build.
Ok I think I understand what's happening here as well, and everything you mentioned seems spot on. One thing I was a bit confused at is where the memoized hashes that need to be invalidated come from, but this happens during an incremental build where we compare a bunch of hashes, so before the build starts we've calculated a bunch of hashes, including the parents of the one being invalidated. Then when it's invalidated everything stops working because the parent memoized hashes aren't invalidated when the child is. Also to clarify where debug cargo fails, it trips this assertion which asserts the memoized hash is indeed matching the hash of the actual struct. As to how to fix this, that's a good question. This is the only instance of invalidating a hash and I would prefer that it doesn't have too large of an impact on Cargo. That being said the fact that the hash changes at runtime is pretty fundamental to how build scripts work, so it should probably have some impact on design. A quick-and-dirty solution would look like: diff --git a/src/cargo/core/compiler/fingerprint.rs b/src/cargo/core/compiler/fingerprint.rs
index 27392ac7c..ff7320abe 100644
--- a/src/cargo/core/compiler/fingerprint.rs
+++ b/src/cargo/core/compiler/fingerprint.rs
@@ -301,7 +301,6 @@ pub fn prepare_target<'a, 'cfg>(
// hobble along if it happens to return `Some`.
if let Some(new_local) = (gen_local)(&deps, None)? {
*fingerprint.local.lock().unwrap() = new_local;
- *fingerprint.memoized_hash.lock().unwrap() = None;
}
write_fingerprint(&loc, &fingerprint)
@@ -613,6 +612,10 @@ impl Fingerprint {
ret
}
+ pub fn clear_memoized(&self) {
+ *self.memoized_hash.lock().unwrap() = None;
+ }
+
/// Compares this fingerprint with an old version which was previously
/// serialized to filesystem.
///
diff --git a/src/cargo/core/compiler/job_queue.rs b/src/cargo/core/compiler/job_queue.rs
index cea9e4e23..f7637eba1 100644
--- a/src/cargo/core/compiler/job_queue.rs
+++ b/src/cargo/core/compiler/job_queue.rs
@@ -248,6 +248,10 @@ impl<'a, 'cfg> JobQueue<'a, 'cfg> {
.take()
.map(move |srv| srv.start(move |msg| drop(tx.send(Message::FixDiagnostic(msg)))));
+ for unit in cx.fingerprints.values() {
+ unit.clear_memoized();
+ }
+
// Use `crossbeam` to create a scope in which we can execute scoped
// threads. Note that this isn't currently required by Cargo but it was
// historically required. This is left in for now in case we need the which is to basically just zero out all memoizations just before we start doing real work. That may be an alright long-term solution too, but I'd want to at least attempt to take a look at the perf impact of that. |
I thought about doing the same thing, too. I was wondering if there might be a more clever solution (I was investigating if the I did some tests with your patch, and I don't see any performance difference. Do you want to open a PR with it? |
Sure yeah, I'll run that tomorrow and do some more poking at it myself as well. |
Clear out memoized hashes before building crates Build script updates during execution can change the memoized hash of a `Fingerprint`, and while previously we cleared out a single build script's memoized hash we forgot to clear out everything that depended on it as well. This commit pessimistically clears out all `Fingerprint` memoized hashes just before building to ensure that during the build everything has the most up-to-date view of the world, and when build scripts change fingerprints everything that depends on them won't have run yet. Closes #7362
If a build script changes what values it prints for "rerun-if" directives between runs, then the rebuild detection doesn't work correctly. An example of what happens:
A real-world example is the llvm and sanitizer build scripts in rust-lang/rust (like this). I suspect this is a relatively common idiom.
There is also a problem, when running a debug build of cargo, it causes cargo to panic and hangs. In non-debug builds, it hits this unhelpful line because it has become confused.
The problem is this line. When the
local
value is updated, the cached hash is invalidated. However, the invalidation needs to propagate to all parent units as well. Otherwise they write out their fingerprint with thedeps
value using the old hash.The panics in debug mode are due to the json validations. The round-trip fails because the cached hash values are incorrect. It hangs because of #6433.
I've been trying to think of some way to work around this, but haven't come up with any good ideas. @alexcrichton do you have any ideas how to invalidate the memoized hashes? Or maybe a completely different approach?
Note that this is similar in spirit to #6779, but I think the solution may be different.
Here is a sample test. Similar problems occur with
rerun-if-changed
, but env vars are a little easier to use. Each of the "FIXME" lines fails with current cargo.The text was updated successfully, but these errors were encountered: