-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
roachtest: jepsen subcritical-skews tests a skipped due to ntp rate limiting #35599
roachtest: jepsen subcritical-skews tests a skipped due to ntp rate limiting #35599
Comments
SHA: https://github.com/cockroachdb/cockroach/commits/5ebfeec052f9cee4e63757defe7c9120643293db Parameters: To repro, try:
Failed test: https://teamcity.cockroachdb.com/viewLog.html?buildId=1174810&tab=buildLog
|
SHA: https://github.com/cockroachdb/cockroach/commits/7ce9188c6e64465d9dcb9f0ca0f113dd0e584da0 Parameters: To repro, try:
Failed test: https://teamcity.cockroachdb.com/viewLog.html?buildId=1178908&tab=buildLog
|
The subcritical-skews nemesis resynchronizes with ntp frequently. This has recently started failing because we're getting rate-limited by the NTP server (it hard-codes ntp.ubuntu.com). We need to either
|
See cockroachdb#35599 Release note: None
35284: storage,kv: make transaction deadline exceeded errors retriable r=andreimatei a=andreimatei Before this patch, they were opaque TransactionStatusErrors. The belief is that we should only be seeing such errors when a transaction is pushed by minutes. Shockingly, this seems to hapen enough in our tests, for example as described here: #18684 (comment) This patch marks the error as retriable, since it technically is. This patch also changes the semantics of the EndTransactionRequest.Deadline field to make it exclusive so that it matches the nature of SQL leases. No migration needed. Touches #18684 Release note (sql change): "transaction deadline exceeded" errors are now returned to the client with a retriable code. 35793: storage: fix TestRangeInfo flake and re-enable follower reads by default r=ajwerner a=ajwerner This PR addresses a test flake introduced by enabling follower reads in conjunction with #35130 which makes follower reads more generally possible in the face of lease transfer. Fixes #35758. Release note: None 35865: roachtest: Skip flaky jepsen nemesis r=tbg a=bdarnell See #35599 Release note: None Co-authored-by: Andrei Matei <andrei@cockroachlabs.com> Co-authored-by: Andrew Werner <ajwerner@cockroachlabs.com> Co-authored-by: Ben Darnell <ben@bendarnell.com>
Clearing the milestone so this gets re-triaged. |
While looking on other issues connected to jepsen tests I found that recent jepsen packages use pool.ntp.org instead of ntp.ubuntu.org. I changed it and gave it a try and surprise we are not throttled by pool and I see no more complains in the log. Since we have server address hardcoded into our tests it should be a quick win so that we could have tests reenabled. |
With jepsen change in place, I'll make a diff and see if it works or not. Running those tests with roachtest from dev looked fine. |
cc @cockroachdb/test-eng |
112710: roachtest: reenable Jepsen subcritical-skews test r=DarrylWong a=renatolabs The Jepsen version we are using already moved from `ntp.ubuntu.org` to `pool.ntp.org`. We should be able to run these tests again. https://github.com/cockroachdb/jepsen/blob/cdeef40a0cd24af0c989e0a7990ee1c7fa948f43/cockroachdb/src/jepsen/cockroach/time.clj#L27 Fixes: #35599 Release note: None Co-authored-by: Renato Costa <renato@cockroachlabs.com>
The Jepsen version we are using already moved from `ntp.ubuntu.org` to `pool.ntp.org`. We should be able to run these tests again. https://github.com/cockroachdb/jepsen/blob/cdeef40a0cd24af0c989e0a7990ee1c7fa948f43/cockroachdb/src/jepsen/cockroach/time.clj#L27 Fixes: #35599 Release note: None
The Jepsen version we are using already moved from `ntp.ubuntu.org` to `pool.ntp.org`. We should be able to run these tests again. https://github.com/cockroachdb/jepsen/blob/cdeef40a0cd24af0c989e0a7990ee1c7fa948f43/cockroachdb/src/jepsen/cockroach/time.clj#L27 Fixes: #35599 Release note: None
SHA: https://github.com/cockroachdb/cockroach/commits/a119a3a158725c9e3f9b8084d9398601c0e67007
Parameters:
To repro, try:
Failed test: https://teamcity.cockroachdb.com/viewLog.html?buildId=1170795&tab=buildLog
Jira issue: CRDB-4573
The text was updated successfully, but these errors were encountered: