[CRDB-2743] sql: create builltin generator crdb_internal.probe_range #79546

Santamaura · 2022-04-06T21:49:31Z

Previously there was difficulty in diagnosing kv layer health when
an incident occurs. This patch introduces the new virtual table
crdb_internal.probe_range which utilizes the kvprober to probe
each range to determine if the range can be reached or not.

resolves #61695

Release note: None

cockroach-teamcity · 2022-04-06T21:49:39Z

This change is

Santamaura · 2022-04-06T22:00:42Z

One thing I am unsure of is in the case of a timeout whether the Prober.writePlanner will update the next step or not. If it doesn't and we are iterating through ranges we've already visited we may end up in an infinite loop.

tbg

Thanks Alex! I left some conceptual comments that I think will simplify things a bit, and will also address your question (by removing it).

Reviewed 5 of 5 files at r1, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, @Santamaura, and @zachlite)

pkg/sql/sem/builtins/generator_builtins.go, line 478 at r1 (raw file):

		makeGeneratorOverload(
			tree.ArgTypes{
				{Name: "timeout", Typ: types.Int},

shouldn't this be something like types.IntervalDurationType or types.Interval?

pkg/sql/sem/builtins/generator_builtins.go, line 1749 at r1 (raw file):

// rangeKeyIterator is a ValueGenerator that iterates over all
// SQL keys in a target range.
// REFER TO HERE

?

pkg/sql/sem/builtins/generator_builtins.go, line 2436 at r1 (raw file):

	if err != nil {
		return nil, pgerror.Newf(
			pgcode.InFailedSQLTransaction,

Is this really the right code?

pkg/sql/sem/builtins/generator_builtins.go, line 2465 at r1 (raw file):

		prober:          *kvProber,
		ranges:          make(map[int64]int),
		remainingRanges: int64(numRanges),

This doesn't seem right. The range count changes out from under you through splits and merges, so any code that stores it like this is likely fishy.

pkg/sql/sem/builtins/generator_builtins.go, line 2481 at r1 (raw file):

// Next implements the tree.ValueGenerator interface.
func (p *probeRangeGenerator) Next(ctx context.Context) (bool, error) {
	if p.remainingRanges == 0 {

Yeah this isn't going to be correct. If a range splits while the probe is underway, you'll end up stopping short. I'm not sure what will happen if a range merges but it doesn't really matter, we need to fix this.

I think the root problem is that this code wants to use the planner "as is". Instead, we should refactor the planner to make the interesting parts of it more reusable, and then reuse only those parts. We need only two things here: iterate through all range descriptors (i.e. discover all of the ranges in the system, with a consistent view) and then probe each of them.

Thinking this through a bit, there actually isn't a good reason to use the planner's ability to discover ranges, as it's too complex and prober-specific for what we need. The code we want is really this:

cockroach/pkg/sql/crdb_internal.go

Lines 3224 to 3227 in d4daa99

    
           ranges, err := kvclient.ScanMetaKVs(ctx, p.txn, roachpb.Span{ 
        
           	Key:    keys.MinKey, 
        
           	EndKey: keys.MaxKey, 
        
           })

This isn't streaming which isn't ideal, but it's also what is used by crdb_internal.ranges_no_leases, so when we fix that we'll automatically fix it here. It is fine to use for now.

I'm also noticing that the internal_no_leases generator is much more compact and doesn't seem to need to define a new type, so maybe you can adopt this pattern here and forfeit a bunch of boilerplate? I might be missing something.

As for the probing itself, we "just" need to export proberOpsImpl:

cockroach/pkg/kv/kvprober/kvprober.go

Lines 160 to 186 in 18b95cb

    
           // proberOpsImpl is used to probe the kv layer. 
        
           type proberOpsImpl struct { 
        
           } 
        
           // We attempt to commit a txn that reads some data at the key. 
        
           func (p *proberOpsImpl) Read(key interface{}) func(context.Context, *kv.Txn) error { 
        
           	return func(ctx context.Context, txn *kv.Txn) error { 
        
           		_, err := txn.Get(ctx, key) 
        
           		return err 
        
           	} 
        
           } 
        
           // We attempt to commit a txn that puts some data at the key then deletes 
        
           // it. The test of the write code paths is good: We get a raft command that 
        
           // goes thru consensus and is written to the pebble log. Importantly, no 
        
           // *live* data is left at the key, which simplifies the kvprober, as then 
        
           // there is no need to clean up data at the key post range split / merge. 
        
           // Note that MVCC tombstones may be left by the probe, but this is okay, as 
        
           // GC will clean it up. 
        
           func (p *proberOpsImpl) Write(key interface{}) func(context.Context, *kv.Txn) error { 
        
           	return func(ctx context.Context, txn *kv.Txn) error { 
        
           		if err := txn.Put(ctx, key, putValue); err != nil { 
        
           			return err 
        
           		} 
        
           		return txn.Del(ctx, key) 
        
           	} 
        
           }

because that's what we want to invoke for the probe.

So in this PR, I would expect:

get list of descs (just like ranges_no_leases)
for each desc, invoke kvProbe{}.Write (or read depending on options), with much of the garnish that you have here already, and produce a row

Santamaura

The comments did help thanks 👍

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, @tbg, and @zachlite)

pkg/sql/sem/builtins/generator_builtins.go, line 478 at r1 (raw file):

Previously, tbg (Tobias Grieger) wrote…

shouldn't this be something like types.IntervalDurationType or types.Interval?

Yeah makes more sense to be interval, changed.

pkg/sql/sem/builtins/generator_builtins.go, line 1749 at r1 (raw file):

Previously, tbg (Tobias Grieger) wrote…

?

A comment from when I was figuring out how builtin generators worked, removed.

pkg/sql/sem/builtins/generator_builtins.go, line 2436 at r1 (raw file):

Previously, tbg (Tobias Grieger) wrote…

Is this really the right code?

The error code is irrelevant now, removed.

pkg/sql/sem/builtins/generator_builtins.go, line 2465 at r1 (raw file):

Previously, tbg (Tobias Grieger) wrote…

This doesn't seem right. The range count changes out from under you through splits and merges, so any code that stores it like this is likely fishy.

Right that makes sense.

pkg/sql/sem/builtins/generator_builtins.go, line 2481 at r1 (raw file):

I think the root problem is that this code wants to use the planner "as is". Instead, we should refactor the planner to make the interesting parts of it more reusable, and then reuse only those parts. We need only two things here: iterate through all range descriptors (i.e. discover all of the ranges in the system, with a consistent view) and then probe each of them.

Yeah I was iffy about using the current planner but wasn't sure if we should get the ranges how crdb_internal.ranges_no_leases does it. I've refactored the generator to use this way and then also just use Read or Write now.

I'm also noticing that the internal_no_leases generator is much more compact and doesn't seem to need to define a new type, so maybe you can adopt this pattern here and forfeit a bunch of boilerplate? I might be missing something.

When I was looking over those, there didn't seem to be a way of passing arguments into generators created in that way, if it turns out they can have arguments passed I can move it over.

tbg · 2022-04-14T10:24:11Z

Thanks! I pushed a commit with some fixes and a bunch of TODO(alex)es which are basically my review comments. Please integrate the code changes into your commit (if they make sense to you), no need to preserve my authorship.

tbg

Reviewed 5 of 5 files at r2, 2 of 2 files at r3.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, @tbg, and @zachlite)

tbg

Reviewed 1 of 1 files at r4, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, and @zachlite)

Santamaura · 2022-04-20T19:31:55Z

I think I covered everything from the TODOs, let me know if there's any outstanding

tbg

Reviewed 7 of 7 files at r5, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, @Santamaura, and @zachlite)

pkg/sql/sem/builtins/generator_probe_ranges.go, line 74 at r5 (raw file):

				A write probe will effectively probe reads as well.
			`,
			// TODO(alex): add unit tests of this via a LogicTest. Make sure that the test doesn't verify anything that changes

Here's a TODO still. You have this test, though the "trace matches the string ..." part doesn't seem to be in it.

pkg/sql/sem/builtins/generator_probe_ranges.go, line 91 at r5 (raw file):

		EnumData: &types.EnumMetadata{
			LogicalRepresentations: enumMembers,
			PhysicalRepresentations: [][]byte{

I have no idea what's going on here, you may want to get a SQL person's review as well at least for these generator parts.

pkg/sql/sem/builtins/generator_probe_ranges.go, line 194 at r5 (raw file):

	}()

	ops := &kvprober.ProberOps{}

nit: you can make this a field on probeRangeGenerator. ProberOps is a struct{} so this line technically doesn't allocate but it saves future readers a double take.

pkg/sql/logictest/testdata/logic_test/generator_probe_ranges, line 17 at r5 (raw file):


query T
SELECT substring('$trace_str', 33, 34)

What's this doing? We want to check for evidence of a trace event that comes out of KV which I am sure this isn't doing.

Santamaura

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, @tbg, and @zachlite)

pkg/sql/sem/builtins/generator_probe_ranges.go, line 74 at r5 (raw file):

Previously, tbg (Tobias Grieger) wrote…

Here's a TODO still. You have this test, though the "trace matches the string ..." part doesn't seem to be in it.

😄 I didn't realize we wanted to actually match 'proposing command'. Fixed.

pkg/sql/sem/builtins/generator_probe_ranges.go, line 91 at r5 (raw file):

Previously, tbg (Tobias Grieger) wrote…

I have no idea what's going on here, you may want to get a SQL person's review as well at least for these generator parts.

Good idea, if nobody reviews in some time I'll reach out to someone on SQL.

pkg/sql/sem/builtins/generator_probe_ranges.go, line 194 at r5 (raw file):

Previously, tbg (Tobias Grieger) wrote…

nit: you can make this a field on probeRangeGenerator. ProberOps is a struct{} so this line technically doesn't allocate but it saves future readers a double take.

Done.

pkg/sql/logictest/testdata/logic_test/generator_probe_ranges, line 17 at r5 (raw file):

Previously, tbg (Tobias Grieger) wrote…

What's this doing? We want to check for evidence of a trace event that comes out of KV which I am sure this isn't doing.

Changed so that we confirm the verbose trace includes 'proposing command'

tbg

thank you! Still think this is worth a glance from SQL but this may even happen after the merge IMO.

Reviewed 2 of 2 files at r6, all commit messages.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on @joshimhoff, @koorosh, @Santamaura, and @zachlite)

pkg/sql/sem/builtins/generator_probe_ranges.go, line 74 at r5 (raw file):

Previously, Santamaura (Alex Santamaura) wrote…

😄 I didn't realize we wanted to actually match 'proposing command'. Fixed.

Wanna remove the TODO?

pkg/sql/sem/builtins/generator_probe_ranges.go, line 125 at r6 (raw file):

	// The below are updated during calls to Next() throughout the lifecycle of
	// probeRangeGenerator.
	ops    *kvprober.ProberOps

I was thinking ops kvprober.ProberOps and then in makeProbeRangeGenerator you can omit the field since the zero value is already good to go.

pkg/sql/logictest/testdata/logic_test/generator_probe_ranges, line 14 at r6 (raw file):


# test that the trace has a string matching `proposing command` to verify trace events
# from the kvserver write path are received

Nit:

# Test that .... are received.

Previously there was difficulty diagnosing kv layer health when an incident occurs. This patch introduces a new virtual table crdb_internal.probe_ranges which utilitzes the kvprober probe each range to determine if the range can be reached or not. resolves cockroachdb#61695 Release note: None

Santamaura

Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale) (waiting on @joshimhoff, @koorosh, @tbg, and @zachlite)

pkg/sql/sem/builtins/generator_probe_ranges.go, line 74 at r5 (raw file):

Previously, tbg (Tobias Grieger) wrote…

Wanna remove the TODO?

Oops forgot. Removed.

pkg/sql/sem/builtins/generator_probe_ranges.go, line 125 at r6 (raw file):

Previously, tbg (Tobias Grieger) wrote…

I was thinking ops kvprober.ProberOps and then in makeProbeRangeGenerator you can omit the field since the zero value is already good to go.

👍 done.

pkg/sql/logictest/testdata/logic_test/generator_probe_ranges, line 14 at r6 (raw file):

Previously, tbg (Tobias Grieger) wrote…

Nit:

# Test that .... are received.

Done.

Santamaura · 2022-04-25T15:39:16Z

bors r+

craig · 2022-04-25T17:18:36Z

Build succeeded:

GitHub CI (Cockroach)

joshimhoff · 2022-05-06T15:14:16Z

I wonder if this tool would be available during a full cluster SQL outage such as the recent one involving a down system.lease? Maybe I'll run a test today!

joshimhoff · 2022-05-06T15:34:44Z

It worked fine:

root@:26257/?> select range_id, error, end_to_end_latency_ms from crdb_internal.probe_ranges(INTERVAL '1s', 'write');
  range_id | error | end_to_end_latency_ms
-----------+-------+------------------------
         1 |       |                    21
         2 |       |                    37
         3 |       |                    36
         4 |       |                    36
         5 |       |                    37
         6 |       |                    35
         7 | boom  |         9223372036854
         8 |       |                    34
         9 |       |                    36
        10 |       |                    37
        11 |       |                    35
        12 |       |                    36
        13 |       |                    35
        14 |       |                    36
        15 |       |                    18
        16 |       |                    36
        17 |       |                    37
        18 |       |                    36
        19 |       |                    35
        20 |       |                    37
        21 |       |                    35
        22 |       |                    35
        23 |       |                    37
        24 |       |                    35
        25 |       |                    36
        26 |       |                    36
        27 |       |                    36
        28 |       |                    38
        29 |       |                    35
        30 |       |                    35
        31 |       |                    36
        32 |       |                    35
        33 |       |                    35
        34 |       |                    36
        35 |       |                    35
        36 |       |                    35
        37 |       |                    37
        38 |       |                    35
        39 |       |                    35
        40 |       |                    36
        41 |       |                    35
        42 |       |                    36
        43 |       |                    35
        54 |       |                    36
        64 |       |                    36
(45 rows)

Here is the change I made locally to test:

~/g/s/g/c/cockroach [master] $ git diff
diff --git a/pkg/kv/kvserver/replica_send.go b/pkg/kv/kvserver/replica_send.go
index 3295886e91..78ccc46b7f 100644
--- a/pkg/kv/kvserver/replica_send.go
+++ b/pkg/kv/kvserver/replica_send.go
@@ -126,6 +126,10 @@ func (r *Replica) sendWithoutRangeID(
 ) (_ *roachpb.BatchResponse, rErr *roachpb.Error) {
        var br *roachpb.BatchResponse
 
+       if ba.RangeID == 7 {
+               return nil, roachpb.NewError(errors.New("boom"))
+       }
+
        if r.leaseholderStats != nil && ba.Header.GatewayNodeID != 0 {
                r.leaseholderStats.RecordCount(r.getBatchRequestQPS(ctx, ba), ba.Header.GatewayNodeID)
        }

I'm going to keep exploring this Q, but I'll hold off on posting anything, unless I learn something interesting.

joshimhoff · 2022-05-06T15:43:24Z

Maybe I'll write an integration test that makes all system ranges that we don't expect to be hard deps of crdb_internal.probe_range unavailable and sees what happens. Not sure if we should merge it, but it'll be fun at least.

joshimhoff · 2022-05-06T17:19:29Z

With all ranges with range IDs >= 2 unavailable, connecting to an insecure five node cluster + querying crdb_internal.probe_range is possible.

root@:26257/defaultdb> select range_id, error from crdb_internal.probe_ranges(INTERVAL '1s', 'write');
  range_id | error
-----------+--------
         1 |
         2 | boom
         3 | boom
         4 | boom
         5 | boom
         6 | boom
         7 | boom
         8 | boom
         9 | boom
        10 | boom
        11 | boom
        12 | boom
        13 | boom
        14 | boom
        15 | boom
        16 | boom
        17 | boom
        18 | boom
        19 | boom
        20 | boom
        21 | boom
        22 | boom
        23 | boom
        24 | boom
        25 | boom
        26 | boom
        27 | boom
        28 | boom
        29 | boom
        30 | boom
        31 | boom
        32 | boom
        33 | boom
        34 | boom
        35 | boom
        36 | boom
        37 | boom
        38 | boom
        39 | boom
        40 | boom
        41 | boom
        42 | boom
        43 | boom
        44 | boom
(44 rows)


Time: 68ms total (execution 68ms / network 0ms)

OTOH, if range with range ID = 1 is down also, we get an error. This makes sense, since probing requires an available meta2 range.

root@:26257/defaultdb> select range_id, error from crdb_internal.probe_ranges(INTERVAL '1s', 'write');
ERROR: boom

This is really awesome!

Some Qs:

Have we considered backporting this? I feel pretty confident that having this in CC will both improve time to fix and reduce bogus KV escalations.
Does anyone here have desired for me to write an integration test to make sure we maintain the nice availability property of crdb_internal.probe_range? The property is: crdb_internal.probe_range should be available unless meta2 is down.
If all ranges but meta2 are down, will connecting to a secure cluster work, assuming I use the root user plus a client credential? I hope so.

joshimhoff · 2022-05-06T20:18:14Z

Tiny follow up: #81107

joshimhoff · 2022-05-09T19:04:15Z

If all ranges but meta2 are down, will connecting to a secure cluster work, assuming I use the root user plus a client credential? I hope so.

I tested this. I could connect as root with root client credentials. Even with all ranges down including meta2. Yay! And this comment confirms that this behavior is a design goal:

// - if the user is root, the user is reported to exist immediately
// without querying system.users at all. The password retrieval
// is delayed until actually needed by the authentication method.
// This way, if the client presents a valid TLS certificate
// the password is not even needed at all. This is useful for e.g.
// cockroach node status.

81107: sql: trace meta2 scans made by crdb_internal.probe_ranges r=joshimhoff a=joshimhoff **sql: trace meta2 scans made by crdb_internal.probe_ranges** With this commit, CRDB traces the meta2 scan made by crdb_internal.probe_ranges. If the scan fails, the trace is attached to the error returned to the SQL client. This way, outages involving a down meta2 range are observable. According to my testing, meta2 is the only system range that can make crdb_internal.probe_ranges unavailable. As a result, with this commit, we should get a trace of a query to the unavailable range, regardless of which range is unavailable, either in the SQL table output of crdb_internal.probe_ranges or in an error string. Release note. None. ------------------------ This is based on some thinking & testing I did at #79546 (comment). With this commit, we get: ``` root@:26257/defaultdb> select range_id, error from crdb_internal.probe_ranges(INTERVAL '1s', 'write'); ERROR: 0.000ms 0.000ms === operation:meta2scan _verbose:1 client:127.0.0.1:49588 node:1 user:root 0.010ms 0.010ms === operation:txn coordinator send _verbose:1 client:127.0.0.1:49588 node:1 txnID:be5903f5-1817-481b-8dcf-2e2bc8b0f052 user:root 0.020ms 0.010ms === operation:dist sender send _verbose:1 client:127.0.0.1:49588 node:1 txn:be5903f5 user:root 0.063ms 0.043ms event:kv/kvclient/kvcoord/range_iter.go:175 [n1,client=127.0.0.1:49588,user=root,txn=be5903f5] querying next range at /Meta2/"\x00" 0.122ms 0.059ms event:kv/kvclient/kvcoord/dist_sender.go:2032 [n1,client=127.0.0.1:49588,user=root,txn=be5903f5] r1: sending batch 1 Scan to (n1,s1):1 0.132ms 0.010ms event:rpc/nodedialer/nodedialer.go:161 [n1,client=127.0.0.1:49588,user=root,txn=be5903f5] sending request to local client 0.136ms 0.004ms === operation:/cockroach.roachpb.Internal/Batch _verbose:1 node:1 span.kind:server 0.144ms 0.008ms event:server/node.go:996 [n1] node received request: 1 Scan 0.174ms 0.030ms event:kv/kvserver/store_send.go:157 [n1,s1] executing Scan [/Meta2/"\x00",/Meta2/Max), [txn: be5903f5], [can-forward-ts] 0.335ms 0.161ms event:server/node.go:1009 [n1] error from stores.Send: boom: "sql txn" meta={id=be5903f5 key=/Min pri=0.02292157 epo=0 ts=1651868092.726275000,0 min=1651868092.726275000,0 seq=0} lock=false stat=PENDING rts=1651868092.726275000,0 wto=false gul=1651868093.226275000,0 0.384ms 0.252ms event:kv/kvclient/kvcoord/dist_sender.go:2144 [n1,client=127.0.0.1:49588,user=root,txn=be5903f5] boom: "sql txn" meta={id=be5903f5 key=/Min pri=0.02292157 epo=0 ts=1651868092.726275000,0 min=1651868092.726275000,0 seq=0} lock=false stat=PENDING rts=1651868092.726275000,0 wto=false gul=1651868093.226275000,0 0.442ms 0.058ms event:kv/kvclient/kvcoord/dist_sender.go:1674 [n1,client=127.0.0.1:49588,user=root,txn=be5903f5] reply error Scan [/Meta2/"\x00",/Meta2/Max), [txn: be5903f5], [can-forward-ts]: boom: "sql txn" meta={id=be5903f5 key=/Min pri=0.02292157 epo=0 ts=1651868092.726275000,0 min=1651868092.726275000,0 seq=0} lock=false stat=PENDING rts=1651868092.726275000,0 wto=false gul=1651868093.226275000,0 : boom ``` If run a test like this: ``` ~/g/s/g/c/cockroach [tweak_crdb_internal_probe] $ git diff diff --git a/pkg/kv/kvserver/replica_send.go b/pkg/kv/kvserver/replica_send.go index 3295886e91..ff7a1ed4df 100644 --- a/pkg/kv/kvserver/replica_send.go +++ b/pkg/kv/kvserver/replica_send.go `@@` -12,7 +12,10 `@@` package kvserver +var sigCh = func() chan os.Signal { + ch := make(chan os.Signal, 1) + signal.Notify(ch, syscall.SIGHUP) + return ch +}() + +var weDead atomic.Int32 + // sendWithoutRangeID used to be called sendWithRangeID, accepted a `_forStacks // roachpb.RangeID` argument, and had the description below. Ever since Go // switched to the register-based calling convention though, this stopped `@@` -126,6 +138,16 `@@` func (r *Replica) sendWithoutRangeID( ) (_ *roachpb.BatchResponse, rErr *roachpb.Error) { var br *roachpb.BatchResponse + select { + case <-sigCh: + log.Warningf(ctx, "time to drop incoming raft messages for range") + weDead.Store(1) + default: + } + if weDead.Load() == 1 && ba.RangeID >= 1 { + return nil, roachpb.NewError(errors.New("boom")) + } + if r.leaseholderStats != nil && ba.Header.GatewayNodeID != 0 { r.leaseholderStats.RecordCount(r.getBatchRequestQPS(ctx, ba), ba.Header.GatewayNodeID) } ``` Without this commit, we just get: ``` root@:26257/?> select range_id, error from crdb_internal.probe_ranges(INTERVAL '1s', 'write'); ERROR: boom ``` Co-authored-by: Josh Imhoff <josh@cockroachlabs.com>

jreut · 2022-07-05T15:55:45Z

@Santamaura Is it possible to backport this function to v22.1? We would love to use it sooner than later!

Santamaura · 2022-07-05T16:23:11Z

@Santamaura Is it possible to backport this function to v22.1? We would love to use it sooner than later!

Sure, backport PR has been opened.

jreut · 2022-07-05T18:49:52Z

@Santamaura Is it possible to backport this function to v22.1? We would love to use it sooner than later!

Sure, backport PR has been opened.

Out of curiosity, could I have done this myself? Are there docs I should have read first?

Santamaura · 2022-07-05T19:33:39Z

I made the backport via our backport tool. Doc on backports is here: https://cockroachlabs.atlassian.net/wiki/spaces/CRDB/pages/900005932/Backporting+a+change+to+a+release+branch

So for this particular PR possibly (since there were no complicated merge conflicts) for large PRs it's usually better for the author to do it since they have better context.

joshimhoff · 2022-07-06T15:50:51Z

Assuming not difficult, can we also backport to 21.2? Thanks for this great tool.

Santamaura · 2022-07-12T17:45:44Z

Sorry @joshimhoff was working on something and completely forgot about the 21.2 backport request, I've opened one now.

joshimhoff · 2022-07-12T17:51:39Z

Thanks! Appreciate it.

…107752 #107802 #107803 106508: util/must: add runtime assertion API r=erikgrinaker a=erikgrinaker For details and usage examples, see the [package documentation](https://github.com/erikgrinaker/cockroach/blob/must/pkg/util/must/must.go). --- This patch adds a convenient and canonical API for runtime assertions, inspired by the Testify package used for Go test assertions. It is intended to encourage liberal use of runtime assertions throughout the code base, by making it as easy as possible to write assertions that follow best practices. It does not attempt to reinvent the wheel, but instead builds on existing infrastructure. Assertion failures are fatal in all non-release builds, including roachprod clusters and roachtests, to ensure they will be noticed. In release builds, they instead log the failure and report it to Sentry (if enabled), and return an assertion error to the caller for propagation. This avoids excessive disruption in production environments, where an assertion failure is often scoped to an individual RPC request, transaction, or range, and crashing the node can turn a minor problem into a full-blown outage. It is still possible to kill the node when appropriate via `log.Fatalf`, but this should be the exception rather than the norm. It also supports expensive assertions that must be compiled out of normal dev/test/release builds for performance reasons. These are instead enabled in special test builds. This is intended to be used instead of other existing assertion mechanisms, which have various shortcomings: * `log.Fatalf`: kills the node even in release builds, which can cause severe disruption over often minor issues. * `errors.AssertionFailedf`: only suitable when we have an error return path, does not fatal in non-release builds, and are not always notified in release builds. * `logcrash.ReportOrPanic`: panics rather than fatals, which can leave the node limping along. Requires the caller to implement separate assertion handling in release builds, which is easy to forget. Also requires propagating cluster settings, which aren't always available. * `buildutil.CrdbTestBuild`: only enabled in Go tests, not roachtests, roachprod clusters, or production clusters. * `util.RaceEnabled`: only enabled in race builds. Expensive assertions should be possible to run without the additional overhead of the race detector. For more details and examples, see the `must` package documentation. Resolves #94986. Epic: none Release note: None 107094: streamingest: unskip TestTenantStreamingUnavailableStreamAddress r=lidorcarmel a=lidorcarmel Changing a few things to get this test to pass under stress: - use 50 ranges instead of 10, because there are already 50-ish system ranges, so if we write only 10 more ranges those might not get distributed on all servers. - avoid reading from the source cluster after stopping a node, it's flaky, see #107499 for more info. Epic: none Fixes: #107023 Fixes: #106865 Release note: None 107717: server/profiler: remove `server.cpu_profile.enabled` setting r=xinhaoz a=xinhaoz Cpu profiling can be enabled by setting the cluster setting `server.cpu_profile.cpu_usage_combined_threshold`. This makes `server.cpu_profile.enabled` redundant and makes it more difficult and confusing to enable cpu profiling. This commit removes the `server.cpu_profile.enabled` setting entirely. Note that both jdefault values for the cluster settings set profiling off. Closes: #102024 Release note (sql change): The cluster setting `server.cpu_profile.enabled` has been removed. `server.cpu_profile.cpu_usage_combined_threshold` can enable and disable cpu profiling. 107720: cli: add probe_range in debug.zip r=j82w a=j82w PR #79546 introduces `crdb_internal.probe_range`. This PR adds the `crdb_internal.probe_range` to the debug.zip. The LIMIT gives a very approximately ~1000ms*100 target on how long this can take, so that running debug.zip against an unavailable cluster won't take too long. closes: #80360 Release note (cli change): The debug.zip now includes the `crdb_internal.probe_range` table with a limit of 100 rows to avoid the query from taking to long. 107727: server: deflake TestServerShutdownReleasesSession r=rafiss a=rafiss The tenant was not being fully stopped, so the test could encounter flakes. fixes #107592 Release note: None 107742: ui: show txn fingerprint details page with unspecified app r=xinhaoz a=xinhaoz Previously, when the app was not specified in the url search params for the txn details fingerprint page, the page would fail to load. This commit allows the page to load when there is no app specified but a fingerprint id that matches the requested page in the payload. The first matching fingerprint id is loaded. Additionally, the TransactionDetailsLink will not include the appNames search param unless the provided prop is non-nullish. Fixes: #107731 Release note (bug fix): Txn fingerprint details page in the console UI should load with the fingerprint details even if no app is specified in the URL. Demo: https://www.loom.com/share/810308d3dcd74ca888c42287ebafaecf 107745: kvserver: fix test merge queue when grunning unsupported r=irfansharif a=kvoli `TestMergeQueue/load-based-merging/switch...below-threshold` asserts that switching the split objective between CPU and QPS will not cause ranges to merge, even if their pre-switch load qualified them for merging. This test was broken when `grunning` was unsupported, as the objective never actually switches to anything other than QPS. Add a check for `grunning` support, and assert that a merge occurs if unsupported. Fixes: #106937 Epic: none Release note: None 107749: opt: add enable_durable_locking_for_serializable session variable r=DrewKimball,nvanbenschoten a=michae2 Follow-up from #105857 This commit ammends 6a3e43d to add a session variable to control whether guaranteed-durable locks are used under serializable isolation. Informs: #100194 Epic: CRDB-25322 Release note (sql change): Add a new session variable, `enable_durable_locking_for_serializable`, which controls locking durability under serializable isolation. With this set to true, SELECT FOR UPDATE locks, SELECT FOR SHARED locks, and constraint check locks (e.g. locks acquired during foreign key checks if `enable_implicit_fk_locking_for_serializable` is set to true) will be guaranteed-durable under serializable isolation, meaning they will always be held to transaction commit. (These locks are always guaranteed-durable under weaker isolation levels.) By default, under serializable isolation these locks are best-effort rather than guaranteed-durable, meaning in some cases (e.g. leaseholder transfer, node loss, etc.) they could be released before transaction commit. Serializable isolation does not rely on locking for correctness, only using it to improve performance under contention, so this default is a deliberate choice to avoid the performance overhead of lock replication. 107752: changefeedccl: prevent deadlock in TestChangefeedKafkaMessageTooLarge r=miretskiy a=jayshrivastava Previously, this test would deadlock due to kafka retrying messages too many times. These messages are stored in a buffer of size 1024 created by the CDC testing infra: https://github.com/cockroachdb/cockroach/blob/5c3f96d38cdc3a2d953ca3ffb1e39e97d7e5110e/pkg/ccl/changefeedccl/testfeed_test.go#L1819 The test asserts that 2000 messages pass through the buffer. When the test finishes, it stops reading from the buffer. The problem is that due to retries, there may be more messages sent to the buffer than that are read out of the buffer. Even after the 2000 messages are read and the test is shutting down, the sink may be blocked trying to put resolved messages (plus retries) in the buffer. If this happens, the changefeed resumer (same goroutine as the kafka sink) gets blocked and does not terminate when the job is cancelled at the end of the test. This change caps the number of retries at 200 for this test, so there should be no more than 200 extra messages plus a few resolved messages during this test. This is far less than the buffer size of 1024. See detailed explanation in #107591. Fixes: #107591 Epic: none Release note: None 107802: teamcity-trigger: don't start a job for an empty target r=healthy-pod a=rickystewart This makes no sense, so skip these cases. Closes: #107779 Closes: #107780 Closes: #107781 Epic: none Release note: None 107803: githubpost: set `map` field if `null` r=healthy-pod a=rickystewart Go is a really good language. Informs: #107779 Epic: none Release note: None Co-authored-by: Erik Grinaker <grinaker@cockroachlabs.com> Co-authored-by: Lidor Carmel <lidor@cockroachlabs.com> Co-authored-by: Xin Hao Zhang <xzhang@cockroachlabs.com> Co-authored-by: j82w <jwilley@cockroachlabs.com> Co-authored-by: Rafi Shamim <rafi@cockroachlabs.com> Co-authored-by: Austen McClernon <austen@cockroachlabs.com> Co-authored-by: Michael Erickson <michae2@cockroachlabs.com> Co-authored-by: Jayant Shrivastava <jayants@cockroachlabs.com> Co-authored-by: Ricky Stewart <ricky@cockroachlabs.com>

Santamaura requested review from tbg, zachlite and koorosh April 7, 2022 15:00

Santamaura force-pushed the manual-prober branch 4 times, most recently from b7a896d to 4da1cbc Compare April 7, 2022 19:08

tbg requested a review from joshimhoff April 11, 2022 08:25

tbg requested changes Apr 11, 2022

View reviewed changes

Santamaura force-pushed the manual-prober branch from 4da1cbc to 46e5626 Compare April 12, 2022 16:55

Santamaura commented Apr 12, 2022

View reviewed changes

Santamaura force-pushed the manual-prober branch from 46e5626 to f8ef75e Compare April 13, 2022 18:56

tbg reviewed Apr 14, 2022

View reviewed changes

tbg self-requested a review April 14, 2022 10:47

tbg reviewed Apr 14, 2022

View reviewed changes

Santamaura force-pushed the manual-prober branch 4 times, most recently from 2c70627 to 0ea1b2e Compare April 20, 2022 18:12

Santamaura marked this pull request as ready for review April 20, 2022 19:42

Santamaura requested a review from a team as a code owner April 20, 2022 19:42

Santamaura requested a review from a team April 20, 2022 19:42

tbg approved these changes Apr 20, 2022

View reviewed changes

Santamaura force-pushed the manual-prober branch from 0ea1b2e to 1521bed Compare April 21, 2022 20:56

Santamaura commented Apr 21, 2022

View reviewed changes

tbg approved these changes Apr 22, 2022

View reviewed changes

Santamaura force-pushed the manual-prober branch from 1521bed to 5a951cd Compare April 22, 2022 19:03

Santamaura commented Apr 22, 2022

View reviewed changes

craig bot merged commit 6bbb282 into cockroachdb:master Apr 25, 2022

Santamaura deleted the manual-prober branch April 25, 2022 18:03

joshimhoff mentioned this pull request May 6, 2022

sql: trace meta2 scans made by crdb_internal.probe_ranges #81107

Merged

joshimhoff mentioned this pull request May 12, 2022

Make some of crdb_internal.ranges_no_leases available in case of a large scale outage #81216

Closed

Santamaura mentioned this pull request Jul 5, 2022

release-22.1: [CRDB-2743] sql: create builltin generator crdb_internal.probe_range #83820

Merged

Santamaura mentioned this pull request Jul 12, 2022

release-21.2: [CRDB-2743] sql: create builltin generator crdb_internal.probe_range #84292

Open

ghost mentioned this pull request Jul 20, 2022

Fix style of crdb_internal.probe_ranges generated function documentation #84758

Closed

j82w mentioned this pull request Jul 27, 2023

cli: add probe_range in debug.zip #107720

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CRDB-2743] sql: create builltin generator crdb_internal.probe_range #79546

[CRDB-2743] sql: create builltin generator crdb_internal.probe_range #79546

Santamaura commented Apr 6, 2022 •

edited

Loading

cockroach-teamcity commented Apr 6, 2022

Santamaura commented Apr 6, 2022

tbg left a comment

Santamaura left a comment

tbg commented Apr 14, 2022 •

edited

Loading

tbg left a comment

tbg left a comment

Santamaura commented Apr 20, 2022

tbg left a comment

Santamaura left a comment

tbg left a comment

Santamaura left a comment

Santamaura commented Apr 25, 2022

craig bot commented Apr 25, 2022

joshimhoff commented May 6, 2022

joshimhoff commented May 6, 2022

joshimhoff commented May 6, 2022

joshimhoff commented May 6, 2022 •

edited

Loading

joshimhoff commented May 6, 2022

joshimhoff commented May 9, 2022

jreut commented Jul 5, 2022

Santamaura commented Jul 5, 2022

jreut commented Jul 5, 2022

Santamaura commented Jul 5, 2022 •

edited

Loading

joshimhoff commented Jul 6, 2022

Santamaura commented Jul 12, 2022

joshimhoff commented Jul 12, 2022

	ranges, err := kvclient.ScanMetaKVs(ctx, p.txn, roachpb.Span{
	Key: keys.MinKey,
	EndKey: keys.MaxKey,
	})

	// proberOpsImpl is used to probe the kv layer.
	type proberOpsImpl struct {
	}

	// We attempt to commit a txn that reads some data at the key.
	func (p proberOpsImpl) Read(key interface{}) func(context.Context, kv.Txn) error {
	return func(ctx context.Context, txn *kv.Txn) error {
	_, err := txn.Get(ctx, key)
	return err
	}
	}

	// We attempt to commit a txn that puts some data at the key then deletes
	// it. The test of the write code paths is good: We get a raft command that
	// goes thru consensus and is written to the pebble log. Importantly, no
	// live data is left at the key, which simplifies the kvprober, as then
	// there is no need to clean up data at the key post range split / merge.
	// Note that MVCC tombstones may be left by the probe, but this is okay, as
	// GC will clean it up.
	func (p proberOpsImpl) Write(key interface{}) func(context.Context, kv.Txn) error {
	return func(ctx context.Context, txn *kv.Txn) error {
	if err := txn.Put(ctx, key, putValue); err != nil {
	return err
	}
	return txn.Del(ctx, key)
	}
	}

[CRDB-2743] sql: create builltin generator crdb_internal.probe_range #79546

[CRDB-2743] sql: create builltin generator crdb_internal.probe_range #79546

Conversation

Santamaura commented Apr 6, 2022 • edited Loading

cockroach-teamcity commented Apr 6, 2022

Santamaura commented Apr 6, 2022

tbg left a comment

Choose a reason for hiding this comment

Santamaura left a comment

Choose a reason for hiding this comment

tbg commented Apr 14, 2022 • edited Loading

tbg left a comment

Choose a reason for hiding this comment

tbg left a comment

Choose a reason for hiding this comment

Santamaura commented Apr 20, 2022

tbg left a comment

Choose a reason for hiding this comment

Santamaura left a comment

Choose a reason for hiding this comment

tbg left a comment

Choose a reason for hiding this comment

Santamaura left a comment

Choose a reason for hiding this comment

Santamaura commented Apr 25, 2022

craig bot commented Apr 25, 2022

joshimhoff commented May 6, 2022

joshimhoff commented May 6, 2022

joshimhoff commented May 6, 2022

joshimhoff commented May 6, 2022 • edited Loading

joshimhoff commented May 6, 2022

joshimhoff commented May 9, 2022

jreut commented Jul 5, 2022

Santamaura commented Jul 5, 2022

jreut commented Jul 5, 2022

Santamaura commented Jul 5, 2022 • edited Loading

joshimhoff commented Jul 6, 2022

Santamaura commented Jul 12, 2022

joshimhoff commented Jul 12, 2022

Santamaura commented Apr 6, 2022 •

edited

Loading

tbg commented Apr 14, 2022 •

edited

Loading

joshimhoff commented May 6, 2022 •

edited

Loading

Santamaura commented Jul 5, 2022 •

edited

Loading