*: fix 'select for update' on partitioned table again #30732

tiancaiamao · 2021-12-15T02:02:07Z

What problem does this PR solve?

Issue Number: close #30382 #28073

Problem Summary:

This is a rework of #21148, that old fix introduce a extra partition ID column to the schema.
In theory, the performance is better, but it's error-prone. After PR 21148 it has introduced numerous bugs.
For example, #28073 #28292 #30489 ... and maybe many more

If we use this one, #28666 is unnecessary. It's a fix on the old code, while this one is a totally rework.

What is changed and how it works?

Now I avoid using the extra partition ID column.
The problem of partition ID column is, many of the executor need to fill the extra partition ID column, and if we forget to do it, then there would be bugs:

When the schema is wrong, column and row datatum mismatch
When the executor forget to fill the extra column, the extraPIDColumn will be missing

There are so many places involved, it's hard to fix all of them.

In this PR, I just avoid column pruning on the SelectLock, and the using partition pruning again to get the partition ID.

For #30382, the select for update lock will not lock the table of the subquery, although the plan is transformed into Join, just one side is locked.

Check List

Tests

Unit test
Integration test
Manual test (add detailed scripts or steps below)
No code

Side effects

Performance regression: Consumes more CPU
Performance regression: Consumes more Memory
Breaking backward compatibility

Documentation

Release note

Fix many bugs related to 'select ... for update' on partitioned table

ti-chi-bot · 2021-12-15T02:02:09Z

[REVIEW NOTIFICATION]

This pull request has not been approved.

To complete the pull request process, please ask the reviewers in the list to review by filling /cc @reviewer in the comment.
After your PR has acquired the required number of LGTMs, you can assign this pull request to the committer in the list by filling /assign @committer in the comment to help you merge this pull request.

The full list of commands accepted by this bot can be found here.

Reviewer can indicate their review by submitting an approval review.
Reviewer can cancel approval by submitting a request changes review.

cfzjywxk · 2021-12-15T03:46:57Z

executor/builder.go

+			cols := pt.VisibleCols()
+			offsets := make([]int, 0, len(cols))
+			for _, colInfo := range cols {
+				colName := fmt.Sprintf("%s.%s.%s", dbInfo.Name.L, tblInfo.Name.L, colInfo.Name.L)


Will there be some colume names not in this format like _tidb_rowid that it could not be found in the schema？

mjonss

Some curious questions :)

mjonss · 2021-12-16T00:24:57Z

executor/builder.go

+
+		for _, pt := range v.PartitionedTable {
+			tblInfo := pt.Meta()
+			e.partitionedTable[tblInfo.ID] = pt


Is the tblInfo.ID the logical table or the physical table (partition) ID here?

tblInfo.ID is the logical table ID here

mjonss · 2021-12-16T00:27:08Z

executor/builder.go

+
+			cols := pt.VisibleCols()
+			offsets := make([]int, 0, len(cols))
+			for _, colInfo := range cols {


Why create this double loop for each partition?
Are not all the partitions the same?
I wonder if we cannot use the HandleCols from tblID2Handle instead.
I think I managed something like that in my hack here.

mjonss · 2021-12-16T01:06:31Z

planner/core/planbuilder.go

 	}
 	return selectLock, nil
 }

-func addExtraPIDColumnToDataSource(p LogicalPlan, info *extraPIDInfo) error {
+func collectPartitionTable(p LogicalPlan, input []table.PartitionedTable) []table.PartitionedTable {
 	switch raw := p.(type) {
 	case *DataSource:
 		// Fix issue 26250, do not add extra pid column to normal table.


I think we can remove this comment.

mjonss · 2021-12-16T01:25:57Z

executor/builder.go

-			e.tblID2PIDColumnIndex[tblID] = offset
+	if len(v.PartitionedTable) > 0 {
+		e.partitionedTable = make(map[int64]table.PartitionedTable)
+		e.tblID2PtColsOffsets = make(map[int64][]int)


Do we really need this? Is not the e.tblID2Handle enough?

winoros · 2021-12-16T06:46:59Z

planner/core/planbuilder.go

-			if err != nil {
-				return err
-			}
+			input = collectPartitionTable(child, input)


Why do you still collect the partition table of the aggregation?

And also, you don't need to collect the things after a subquery.
For the two following queries

select * from t, (select * from t1) t1 where t.c=t1.id for update; select * from t, t1 where t.c=t1.id for update;

The first would not lock the t1, only the second one would.

Maybe you can refer to the handleHelper of the PlanBuilder. It maintains a stack to record the information. Though it does some redundant work. But it's easy to add new things to it I think.

// handleHelper records the handle column position for tables. Delete/Update/SelectLock/UnionScan may need this information. // It collects the information by the following procedure: // Since we build the plan tree from bottom to top, we maintain a stack to record the current handle information. // If it's a dataSource/tableDual node, we create a new map. // If it's an aggregation, we pop the map and push a nil map since no handle information left. // If it's a union, we pop all children's and push a nil map. // If it's a join, we pop its children out then merge them and push the new map to stack. // If we meet a subquery, it's clear that it's an independent problem so we just pop one map out when we finish building the subquery. handleHelper *handleColHelper

ti-chi-bot · 2021-12-16T08:29:09Z

@tiancaiamao: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tiancaiamao · 2022-01-24T14:30:15Z

Use another strategy to fix the locking on partitioned table issue #31634
Close this one.

tiancaiamao added 3 commits December 8, 2021 17:25

save the changes

7decdfd

fix issue 30382

3f1a838

Merge branch 'master' into partition-lock

9652b11

ti-chi-bot added release-note Denotes a PR that will be considered when it comes time to generate release notes. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Dec 15, 2021

tiancaiamao requested review from qw4990, winoros and cfzjywxk December 15, 2021 02:02

tiancaiamao mentioned this pull request Dec 15, 2021

executor: fill extra partition ID column in UnionScan executor #28666

Closed

12 tasks

tiancaiamao added the type/bugfix This PR fixes a bug. label Dec 15, 2021

tiancaiamao added 2 commits December 15, 2021 10:12

make fmt

4af0543

make golint happy

af61b36

cfzjywxk reviewed Dec 15, 2021

View reviewed changes

mjonss reviewed Dec 16, 2021

View reviewed changes

winoros reviewed Dec 16, 2021

View reviewed changes

ti-chi-bot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Dec 16, 2021

tiancaiamao closed this Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

*: fix 'select for update' on partitioned table again #30732

*: fix 'select for update' on partitioned table again #30732

tiancaiamao commented Dec 15, 2021

ti-chi-bot commented Dec 15, 2021

cfzjywxk Dec 15, 2021

mjonss left a comment

mjonss Dec 16, 2021

tiancaiamao Jan 25, 2022

mjonss Dec 16, 2021

mjonss Dec 16, 2021

mjonss Dec 16, 2021

winoros Dec 16, 2021

winoros Dec 16, 2021 •

edited

Loading

ti-chi-bot commented Dec 16, 2021

tiancaiamao commented Jan 24, 2022

*: fix 'select for update' on partitioned table again #30732

*: fix 'select for update' on partitioned table again #30732

Conversation

tiancaiamao commented Dec 15, 2021

What problem does this PR solve?

What is changed and how it works?

Check List

Release note

ti-chi-bot commented Dec 15, 2021

cfzjywxk Dec 15, 2021

Choose a reason for hiding this comment

mjonss left a comment

Choose a reason for hiding this comment

mjonss Dec 16, 2021

Choose a reason for hiding this comment

tiancaiamao Jan 25, 2022

Choose a reason for hiding this comment

mjonss Dec 16, 2021

Choose a reason for hiding this comment

mjonss Dec 16, 2021

Choose a reason for hiding this comment

mjonss Dec 16, 2021

Choose a reason for hiding this comment

winoros Dec 16, 2021

Choose a reason for hiding this comment

winoros Dec 16, 2021 • edited Loading

Choose a reason for hiding this comment

ti-chi-bot commented Dec 16, 2021

tiancaiamao commented Jan 24, 2022

winoros Dec 16, 2021 •

edited

Loading