Runaway query control based on resource group #43691

Connor1996 · 2023-05-10T11:55:07Z

Motivation

Run-away queries are queries that consume more resources beyond user expectation. This could be caused by improper SQL statement, suboptimal plan.
Runaway query can impact overall performance if they are not managed properly. We need to manage run-away queries effectively. Long-running operations should be identified and aborted.
Currently, we already have the deadline mechanism pushed down to the TiKV layer that one coprocessor request would not execute in TiKV more than 60s by default. But a runaway query may not cost too much time on one single coprocessor request, thus the deadline mechanism can't help avoid run-away queries. In the meantime, deadlines can't be too small, otherwise, normal requests can be quickly aborted.

How to identify run-away queries?

Runaway queries can adversely impact overall performance if they are not managed properly. Resource manager can take action when a query exceeds more than a specified amount of elapsed time. The elasped time indicates the time of being processed, which excludes the waiting time.
Differentiating run-away queries from queries that really need to perform a full table/index scan is hard. There is no absolute rule. So we just let users define the rule to identify run-away queries. They can twist it on their own needs. The criteria are only the execution time, at least at present. Maybe add more dimension later.
TiKV would send back the scan detail in coprocessor responses. If the total elapsed time of the query exceeds the threshold, then it would be recognized as a run-away query(statement).

Task Breakdown

Misc

Introduce a metric "max query elapsed time by resource groups" metrics: add max query duration per resource group metrics #44746
Add user document resource_control: add runaway queries docs-cn#14242
Publish RFC resource_control: publish runaway management rfc #44745

The text was updated successfully, but these errors were encountered:

ref #43691

ref #43691, close #44804

ref #43691

…atch` (#45465) ref #43691

ref #43691

…sting (#52197) ref #43691

Connor1996 added the type/enhancement The issue or PR belongs to an enhancement. label May 10, 2023

This was referenced May 15, 2023

*: Introduce runaway statement in resource group #43843

Merged

ddl, I_S: support runaway attribute in resource group #43877

Merged

Add a runaway field for resource group tikv/pd#6474

Closed

CabinfeverB mentioned this issue May 24, 2023

Tracking issue for the part of PD about runaway tikv/pd#6509

Closed

2 tasks

ti-chi-bot bot pushed a commit that referenced this issue May 31, 2023

*: Introduce runaway statement in resource group (#43843)

4c83352

ref #43691

CabinfeverB mentioned this issue Jun 1, 2023

resource_group: support patch for altering resource group #44322

Merged

12 tasks

Connor1996 mentioned this issue Jun 1, 2023

domain: Introduce runaway manager #44339

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jun 6, 2023

ddl, I_S: support runaway attribute in resource group (#43877)

a26691c

ref #43691

Connor1996 mentioned this issue Jun 8, 2023

resource_control: use const default resource group name #44526

Merged

12 tasks

Connor1996 changed the title ~~Runaway task control~~ Runaway query control based on resource group Jun 9, 2023

Connor1996 mentioned this issue Jun 12, 2023

Use override priority for runaway query tikv/tikv#14925

Closed

ti-chi-bot bot pushed a commit that referenced this issue Jun 13, 2023

resource_control: use const default resource group name (#44526)

1837efe

ref #43691

ti-chi-bot bot pushed a commit that referenced this issue Jun 13, 2023

resource_group: support patch for altering resource group (#44322)

af66f90

ref #43691

Connor1996 mentioned this issue Jun 15, 2023

resource_control: add runaway queries pingcap/docs-cn#14242

Merged

17 tasks

CabinfeverB mentioned this issue Jun 16, 2023

executor: impl runaway watch check #44474

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jun 16, 2023

domain: Introduce runaway manager (#44339)

7a29bec

ref #43691

CabinfeverB mentioned this issue Jun 16, 2023

domain: record runaway and quarantine query #44654

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jun 16, 2023

executor: impl runaway watch check (#44474)

adfb34b

ref #43691

This was referenced Jun 16, 2023

resource_control: publish runaway management rfc #44745

Merged

metrics: add max query duration per resource group metrics #44746

Merged

ti-chi-bot bot pushed a commit that referenced this issue Jun 17, 2023

metrics: add max query duration per resource group metrics (#44746)

b754f88

ref #43691

ti-chi-bot bot pushed a commit that referenced this issue Jun 18, 2023

domain: record runaway and quarantine query (#44654)

0f48433

ref #43691

CabinfeverB mentioned this issue Jun 19, 2023

domain: support GC runaway record #44784

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jun 21, 2023

domain: support GC runaway record (#44784)

47101e8

ref #43691, close #44804

ti-chi-bot mentioned this issue Jun 21, 2023

domain: support GC runaway record (#44784) #44860

Merged

12 tasks

CabinfeverB mentioned this issue Jun 21, 2023

I_S: unify the output of query_limit in resource_groups #44878

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jun 21, 2023

domain: support GC runaway record (#44784) (#44860)

847f543

ref #43691, close #44804

ti-chi-bot bot pushed a commit that referenced this issue Jun 25, 2023

I_S: unify the output of query_limit in resource_groups (#44878)

4099425

ref #43691

samhld mentioned this issue Jun 26, 2023

Add v7.2.0 release notes pingcap/docs#13910

Merged

16 tasks

CabinfeverB mentioned this issue Jul 21, 2023

*: add query watch stmt for manul management of runaway watch #45500

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Jul 26, 2023

*: add query watch stmt for manul management of runaway watch (#45500)

7e8bb1b

ref #43691

CabinfeverB mentioned this issue Jul 26, 2023

*: global runaway watch by system table and impl exector for query watch #45465

Merged

12 tasks

ti-chi-bot bot pushed a commit that referenced this issue Aug 1, 2023

*: global runaway watch by system table and impl exector for `query w…

c46da07

…atch` (#45465) ref #43691

ti-chi-bot bot pushed a commit that referenced this issue Aug 10, 2023

resource_control: publish runaway management rfc (#44745)

7828409

ref #43691

CabinfeverB mentioned this issue Sep 4, 2023

executor: return query watch id #46626

Merged

12 tasks

CabinfeverB mentioned this issue Sep 28, 2023

resource_control: add runaway metrics #47360

Merged

13 tasks

ti-chi-bot bot pushed a commit that referenced this issue Oct 19, 2023

resource_control: add runaway metrics (#47360)

af7b32c

ref #43691

ti-chi-bot bot pushed a commit that referenced this issue Nov 7, 2023

executor: return query watch id (#46626)

21844d0

ref #43691

CabinfeverB mentioned this issue Mar 28, 2024

resource_control: Separate the mark for watch and rule and enhance testing #52197

Merged

13 tasks

ti-chi-bot bot pushed a commit that referenced this issue Apr 22, 2024

resource_control: Separate the mark for watch and rule and enhance te…

62f3aea

…sting (#52197) ref #43691

Connor1996 closed this as completed Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runaway query control based on resource group #43691

Runaway query control based on resource group #43691

Connor1996 commented May 10, 2023 •

edited

Loading

Runaway query control based on resource group #43691

Runaway query control based on resource group #43691

Comments

Connor1996 commented May 10, 2023 • edited Loading

Motivation

How to identify run-away queries?

Task Breakdown

Connor1996 commented May 10, 2023 •

edited

Loading