release-19.2: storage: write to AbortSpan less, clean up AbortSpan more #45553

nvanbenschoten · 2020-02-29T22:23:20Z

Backport 7/7 commits from #42765.

/cc @cockroachdb/release

This PR introduces two improvements to our handling of the AbortSpan. It also introduces the first set of comprehensive testing around AbortSpan entry state transitions, which was sorely missing.

This comes after a few different customer issues that at least at first glance appeared to be AbortSpan leaks. There's still more to do to resolve those, mostly by improving GC, but anything we can do here on the frontend to reduce the number of AbortSpan entries that need to be GCed in the first place helps.

Clean up span on non-poisoning, aborting EndTransaction request

Fixes #29128.

Before this change, an EndTransaction request sent to rollback a transaction record would not remove any AbortSpan entries, even if its own Poison flag was set to false. This allowed AbortSpan entries to leak. This commit fixes this behavior by removing the AbortSpan entry in this case.

There were concerns raised in #29128 about this being safe. It turns out that we already do this on every Range except the transaction record's Range during normal intent resolution, so this isn't introducing any new concerns.

Only write AbortSpan entries if intents are removed

This reduces the frequency of AbortSpan entries that can be abandoned even without a transaction coordinator failure. Specifically, it protects against the case where intent resolution races with a transaction coordinator cleaning up its own transaction record and intents. This can happen for both aborted and committed transactions.

In the first case, a pusher might find a transaction's intent and then find its record to be aborted after that transaction had cleanly rolled back its own intents. Even though the transaction's coordinator had already cleaned up and potentially "unpoisoned" AbortSpans, the pusher would happily re-introduce AbortSpan records when it goes to resolve the intents that were already cleaned up. These AbortSpan entries would be fully abandoned and have to wait out the GC.

Similarly, in the second case, the transaction might have committed. Here, the pushee might hit an intent and the txn coordinator might clean up and auto-GC its txn record before the pushee arrives at the txn record. Once the pushee gets there, it would mistake the txn for aborted (by design) and proceed to write an AbortSpan record where the intent it had once observed had been (not by design).

We can tell both of these cases by simply recognizing whether intent resolution actually succeeds. If intent resolution doesn't find an intent, then we might be in either case. That's fine, because we only need to ever poison the abort span if we actually remove an intent that could confuse a zombie transaction.

…st.go This mirrors `cmd_truncate_log.go`.

Up until this point, we had no testing around this. For instance, not a single test would catch it if we removed the deletion path from SetAbortSpan. Release note: None

…ction request Fixes cockroachdb#29128. Before this change, an EndTransaction request sent to rollback a transaction record would not remove any AbortSpan entries, even if its own Poison flag was set to false. This allowed AbortSpan entries to leak. This commit fixes this behavior by removing the AbortSpan entry in this case. There were concerns raised in cockroachdb#29128 about this being safe. It turns out that we already do this on every Range except the transaction record's Range, so this isn't introducing any new concerns. Release note (bug fix): AbortSpan records are now cleaned up more aggresively when doing so is known to be safe.

This reduces the frequency of AbortSpan entries that can be abandoned even without a transaction coordinator failure. Specifically, it protects against the case where intent resolution races with a transaction coordinator cleaning up its own transaction record and intents. This can happen for both aborted and committed transactions. In the first case, a pusher might find a transaction's intent and then find its record to be aborted after that transaction had cleanly rolled back its own intents. Even though the transaction's coordinator had already cleaned up and potentially "unpoisoned" AbortSpans, the pusher would happily re-introduce AbortSpan records when it goes to resolve the intents that were already cleaned up. These AbortSpan entries would be fully abandoned and have to wait out the GC. Similarly, in the second case, the transaction might have committed. Here, the pushee might hit an intent and the txn coordinator might clean up and auto-GC its txn record before the pushee arrives at the txn record. Once the pushee gets there, it would mistake the txn for aborted (by design) and proceed to write an AbortSpan record where the intent it had once observed had been (not by design). We can tell both of these cases by simply recognizing whether intent resolution actually succeeds. If intent resolution doesn't find an intent, then we might be in either case. That's fine, because we only need to ever poison the abort span if we actually remove an intent that could confuse a zombie transaction. Release note: None

Return an error if not. Release note: None

cockroach-teamcity · 2020-02-29T22:23:32Z

This change is

tbg · 2020-03-02T08:12:28Z

LGTM

Purely a rename. Release note: None

nvanbenschoten added 5 commits February 29, 2020 16:49

storage/batcheval: rename truncate_log_test.go to cmd_truncate_log_te…

123f39c

…st.go This mirrors `cmd_truncate_log.go`.

storage/batcheval: test requests that modify the AbortSpan

1485f7a

Up until this point, we had no testing around this. For instance, not a single test would catch it if we removed the deletion path from SetAbortSpan. Release note: None

storage: assert that Poison=false for EndTxn(Commit=true) reqs

ab67a06

Return an error if not. Release note: None

nvanbenschoten requested a review from tbg February 29, 2020 22:23

nvanbenschoten requested a review from a team as a code owner February 29, 2020 22:23

nvanbenschoten mentioned this pull request Feb 29, 2020

storage: write to AbortSpan less, clean up AbortSpan more #42765

Merged

tbg approved these changes Mar 2, 2020

View reviewed changes

storage/batcheval: rename SetAbortSpan to UpdateAbortSpan

8ae6eac

Purely a rename. Release note: None

nvanbenschoten force-pushed the backport19.2-42765 branch from de7eeec to 8ae6eac Compare March 2, 2020 21:41

nvanbenschoten merged commit a7db18b into cockroachdb:release-19.2 Mar 2, 2020

nvanbenschoten deleted the backport19.2-42765 branch March 2, 2020 22:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release-19.2: storage: write to AbortSpan less, clean up AbortSpan more #45553

release-19.2: storage: write to AbortSpan less, clean up AbortSpan more #45553

nvanbenschoten commented Feb 29, 2020

cockroach-teamcity commented Feb 29, 2020

tbg commented Mar 2, 2020

release-19.2: storage: write to AbortSpan less, clean up AbortSpan more #45553

release-19.2: storage: write to AbortSpan less, clean up AbortSpan more #45553

Conversation

nvanbenschoten commented Feb 29, 2020

Clean up span on non-poisoning, aborting EndTransaction request

Only write AbortSpan entries if intents are removed

cockroach-teamcity commented Feb 29, 2020

tbg commented Mar 2, 2020