(re)Formalize SQLError in VReplication, add underlying wrap/unwrap functionality #12327

shlomi-noach · 2023-02-13T07:50:11Z

Description

Fixes #12326

This PR re-formalizes SQLError in vreplication in two ways:

Uses vterrors.Wrapf(...) as opposed to fmt.Errorf(...)
As fallback method, uses NewSQLErrorFromError() as suggested by @systay in Bug Report: regression, VReplication errors lost their type, converted to errorString #12326 (comment)

While here:

Added Unwrap and UnwrapAll functions in vterrors
Using said functions in NewSQLErrorFromError() to first (attempt to) get the original error object, before resorting to parsing.

Added a bunch of unit tests.

Related Issue(s)

Fixes #12326
Precondition for #12323
Precondition for #12325
Formalizes changes made in #10828

Checklist

"Backport to:" labels have been added if this change should be back-ported
Tests were added or are not required
Documentation was added or is not required

Deployment Notes

…nctionality Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

vitess-bot · 2023-02-13T07:50:14Z

mattlord

The wrapping parts makes sense to me, but unless I'm missing something we should remove the utils.go changes.

mattlord · 2023-02-14T02:24:16Z

go/vt/vterrors/vterrors.go

+	for wasWrapped {
+		wasWrapped, err = Unwrap(err)
+	}


I feel like we should probably add a limit here to prevent never getting out of here some odd reason.

The logic is legit IMHO and the unit tests make good coverage. Please give this another thought. Would you want to see code that counts to 50 and then fails? Would that code look clearer or even more confusing? Please let me know and I'll do whatever you think.

Yeah, what to do when this fails due to hitting the limit would be unknown. We have some protection from infinite recursion and would panic: golang/go@757e0de

So I'm totally fine with it as-is.

go/vt/vttablet/tabletmanager/vreplication/utils.go

shlomi-noach · 2023-02-14T06:27:07Z

but unless I'm missing something we should remove the utils.go changes.

The addition to utils.go, ie the use of NewSQLErrorFromError(), is a fallback technique, designed to prevent future regressions. The function NewSQLErrorFromError() parses the text of an error, and reconstructs a SqlError object if the text seems to indicate a SQLError. I cannot design a endtoend or unit test that validates that any error path does indeed produce a real SqlError; it's impossible to validate that no one ever wraps an error with fmt.Errorf(); that's because we can't anticipate any and all errors, or any and all code paths. And some errors actually will not be SqlError.

So the idea of this safeguard is very appealing and I think we should keep it.

Now, as it happens, the current implementation NewSQLErrorFromError() always returns a SqlError, even if the error really has nothing to do with SQL. That's the implementation and I chose not to deal with it in this PR. I've already opened 6 different PRs for tackling the flaky tests, going down quite a few rabbit holes. So I wanted to stop at some point. I confirm to the existing behavior, and maybe in a future PR I'll tackle the NewSQLErrorFromError() logic.

At any case, if the error has nothing to do with SQL, you get a error object, which you need to cast to SqlError, that has sqlErr.Num == mysql.ERUnknownError. That's the existing logic and I agree it's confusing. But I still prefer to not try and fix everything in this PR. There's a dozen or more files that use this pattern and I wanted to keep this PR focused.

go/mysql/sql_error.go

Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

rohit-nayak-ps

lgtm

shlomi-noach · 2023-02-21T17:08:51Z

@mattlord since you had comments, can you please take another look?

mattlord · 2023-02-21T17:30:43Z

go/vt/vterrors/vterrors.go

+	for wasWrapped {
+		wasWrapped, err = Unwrap(err)
+	}


Yeah, what to do when this fails due to hitting the limit would be unknown. We have some protection from infinite recursion and would panic: golang/go@757e0de

So I'm totally fine with it as-is.

mattlord · 2023-02-21T17:32:35Z

but unless I'm missing something we should remove the utils.go changes.

The addition to utils.go, ie the use of NewSQLErrorFromError(), is a fallback technique, designed to prevent future regressions. The function NewSQLErrorFromError() parses the text of an error, and reconstructs a SqlError object if the text seems to indicate a SQLError. I cannot design a endtoend or unit test that validates that any error path does indeed produce a real SqlError; it's impossible to validate that no one ever wraps an error with fmt.Errorf(); that's because we can't anticipate any and all errors, or any and all code paths. And some errors actually will not be SqlError.

So the idea of this safeguard is very appealing and I think we should keep it.

Now, as it happens, the current implementation NewSQLErrorFromError() always returns a SqlError, even if the error really has nothing to do with SQL. That's the implementation and I chose not to deal with it in this PR. I've already opened 6 different PRs for tackling the flaky tests, going down quite a few rabbit holes. So I wanted to stop at some point. I confirm to the existing behavior, and maybe in a future PR I'll tackle the NewSQLErrorFromError() logic.

At any case, if the error has nothing to do with SQL, you get a error object, which you need to cast to SqlError, that has sqlErr.Num == mysql.ERUnknownError. That's the existing logic and I agree it's confusing. But I still prefer to not try and fix everything in this PR. There's a dozen or more files that use this pattern and I wanted to keep this PR focused.

Makes sense (I read through what that code does and how too). Thanks!

Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

shlomi-noach · 2023-02-21T18:37:01Z

Not sure what changed that a bunch of vtgate_* CI tests consistently fail. I merged main but to no effect.

shlomi-noach · 2023-02-22T04:46:08Z

The failing tests are due to the change

	err = vterrors.UnwrapAll(err)

I'm digging in.

shlomi-noach · 2023-02-22T05:05:58Z

OK i'm reverting the err = vterrors.UnwrapAll(err) change, it is not very important to this PR, will handle it in a later PR. It seems to affect too many tests.

Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

shlomi-noach · 2023-02-22T06:30:02Z

I see why the Unwrap addition in sql_error.go caused the tests to fail. There's logic that attempts to convert a vterrors error code into SQL code. Unwrap stripped down the vterrros information, thus said info was lost.

shlomi-noach · 2023-02-22T06:48:10Z

Will cherry pick this to v16 manually, since I've merged main in the process.

(re)Formalize SQLError in VReplication, add underlying wrap/unwrap fu…

8e58a57

…nctionality Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

shlomi-noach added Type: Bug Component: VReplication Component: Query Serving labels Feb 13, 2023

shlomi-noach requested review from rohit-nayak-ps, mattlord, harshit-gangal and systay as code owners February 13, 2023 07:50

vitess-bot bot added NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsWebsiteDocsUpdate What it says labels Feb 13, 2023

systay approved these changes Feb 13, 2023

View reviewed changes

This was referenced Feb 13, 2023

Fixing onlineddl_vrepl flakiness, and adding more tests #12325

Merged

Online DDL: improve retry of vreplication errors with vitess ALTER TABLE migrations #12323

Merged

shlomi-noach added Backport to: release-16.0 and removed NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsWebsiteDocsUpdate What it says labels Feb 13, 2023

mattlord reviewed Feb 14, 2023

View reviewed changes

rohit-nayak-ps reviewed Feb 16, 2023

View reviewed changes

go/mysql/sql_error.go Outdated Show resolved Hide resolved

frouioui mentioned this pull request Feb 16, 2023

Release of v16.0.0 #12293

Closed

51 tasks

update test to match new UnwrapAll() behavior

adafc18

Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

rohit-nayak-ps approved these changes Feb 21, 2023

View reviewed changes

mattlord approved these changes Feb 21, 2023

View reviewed changes

Merge branch 'main' into sql-error-unwrap-vreplication

769a438

Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

do not unwrap

77e440c

Signed-off-by: Shlomi Noach <2607934+shlomi-noach@users.noreply.github.com>

shlomi-noach merged commit 6b5eb50 into vitessio:main Feb 22, 2023

shlomi-noach deleted the sql-error-unwrap-vreplication branch February 22, 2023 06:44

shlomi-noach mentioned this pull request Feb 22, 2023

Backport to v16: onlineddl_vrepl flakiness and subsequent fixes #12426

Merged

3 tasks

shlomi-noach mentioned this pull request Mar 8, 2023

Internal refactor: SQLError #12574

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(re)Formalize SQLError in VReplication, add underlying wrap/unwrap functionality #12327

(re)Formalize SQLError in VReplication, add underlying wrap/unwrap functionality #12327

shlomi-noach commented Feb 13, 2023

vitess-bot bot commented Feb 13, 2023 •

edited by systay

Loading

mattlord left a comment

mattlord Feb 14, 2023

shlomi-noach Feb 14, 2023

mattlord Feb 21, 2023

shlomi-noach commented Feb 14, 2023 •

edited

Loading

rohit-nayak-ps left a comment

shlomi-noach commented Feb 21, 2023

mattlord Feb 21, 2023

mattlord commented Feb 21, 2023

shlomi-noach commented Feb 21, 2023

shlomi-noach commented Feb 22, 2023

shlomi-noach commented Feb 22, 2023

shlomi-noach commented Feb 22, 2023

shlomi-noach commented Feb 22, 2023

(re)Formalize SQLError in VReplication, add underlying wrap/unwrap functionality #12327

(re)Formalize SQLError in VReplication, add underlying wrap/unwrap functionality #12327

Conversation

shlomi-noach commented Feb 13, 2023

Description

Related Issue(s)

Checklist

Deployment Notes

vitess-bot bot commented Feb 13, 2023 • edited by systay Loading

Review Checklist

General

If a new flag is being introduced:

If a workflow is added or modified:

Bug fixes

Non-trivial changes

New/Existing features

Backward compatibility

mattlord left a comment

Choose a reason for hiding this comment

mattlord Feb 14, 2023

Choose a reason for hiding this comment

shlomi-noach Feb 14, 2023

Choose a reason for hiding this comment

mattlord Feb 21, 2023

Choose a reason for hiding this comment

shlomi-noach commented Feb 14, 2023 • edited Loading

rohit-nayak-ps left a comment

Choose a reason for hiding this comment

shlomi-noach commented Feb 21, 2023

mattlord Feb 21, 2023

Choose a reason for hiding this comment

mattlord commented Feb 21, 2023

shlomi-noach commented Feb 21, 2023

shlomi-noach commented Feb 22, 2023

shlomi-noach commented Feb 22, 2023

shlomi-noach commented Feb 22, 2023

shlomi-noach commented Feb 22, 2023

vitess-bot bot commented Feb 13, 2023 •

edited by systay

Loading

shlomi-noach commented Feb 14, 2023 •

edited

Loading