Database and Table Level Metrics #17688

breezewish · 2020-06-04T14:31:27Z

Feature Request

Is your feature request related to a problem? Please describe:

Many customers use a single TiDB cluster to serve multiple & hybrid payloads (in different databases). Currently TiDB only supports metrics of the whole TiDB cluster or single TiDB instance. Database or table metrics are missing.

Describe the feature you'd like:

For critical metrics like QPS, latency, errors, a more detailed, per-table and per-database metrics is needed.

Describe alternatives you've considered:

Teachability, Documentation, Adoption, Migration Strategy:

The technical implementation needs to be further investigated. A possible solution can be using Prometheus labels. The corresponding label in memory metrics need to be deleted when table or database is deleted. Notice that there might be a lot of databases and tables and attaching metrics for each one may affect performance, so that it may not be suitable to adapt all metrics. Also histograms and multi-label metrics need to be very carefully considered, since they notably amplify the number of metrics. Another good idea can be allowing users to config what tables and databases are needed.

The new metrics should be added to the Grafana monitor.

Notice that TiDB already have a similar feature: #9151. However it may lead to memory leaks due to always keeping database names in the memory, as well as not work well when database numbers are huge. The new implementation could refine and improve it.

zz-jason · 2020-07-22T01:25:06Z

@breeswish Could you describe more about the use cases? Seems it's useful in a multi-tenant scenario?

breezewish · 2020-07-22T07:42:47Z

@zz-jason Simply speaking, yes. We already have many customers use a single TiDB cluster to serve multiple & hybrid payloads. These payloads are usually stayed in different databases. But anyway, they are on the same TiDB cluster or even same TiDB instance.

jackysp · 2020-08-06T12:24:56Z

emm... something like #9151 ?

breezewish · 2020-08-12T03:35:27Z

@jackysp Yes, similar to it. The current implementation in #9151 has memory leaks (when database is deleted) and database level metrics is usually not precise enough as well.

jackysp · 2020-08-12T03:40:16Z

The key point is that still there are performance issues when there are many databases. It seems like no one uses this feature so that many people don't even know about it.

breezewish · 2020-08-13T14:24:04Z

@jackysp I received real-world feature requests from our clients for this one, where they don't deploy multiple TiDB clusters :)

Yes, in addition to the memory leak issue, performance is another problem. I think we can simply let user configure what they want to collect, in order to not suffer from performance problems in default scenarios. It is netural that the more user wants to know, the more cost there will be. The important part is to let user decide.

pepezzzz · 2020-12-01T09:11:31Z

@breeswish Please help develop a simple one, end-user dba can oberseve the duration per database.
like #19360

breezewish added the type/feature-request Categorizes issue or PR as related to a new feature. label Jun 4, 2020

breezewish mentioned this issue Jun 4, 2020

Requirement Request: Database and Table Level Metrics pingcap/tidb-dashboard#545

Closed

djshow832 added component/metrics help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. labels Jun 5, 2020

scsldb assigned zz-jason Jul 15, 2020

scsldb added the feature/reviewing This feature request is reviewing by product managers label Jul 16, 2020

zz-jason added feature/discussing This feature request is discussing among product managers and removed feature/reviewing This feature request is reviewing by product managers labels Aug 10, 2020

zz-jason mentioned this issue Aug 25, 2020

record each db duration like record-db-qps feature #19360

Closed

scsldb added feature/accepted This feature request is accepted by product managers priority/P1 The issue has P1 priority. and removed feature/discussing This feature request is discussing among product managers labels Sep 4, 2020

scsldb added this to the Requirement pool milestone Sep 4, 2020

zz-jason removed their assignment Sep 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Database and Table Level Metrics #17688

Database and Table Level Metrics #17688

breezewish commented Jun 4, 2020

zz-jason commented Jul 22, 2020

breezewish commented Jul 22, 2020

jackysp commented Aug 6, 2020

breezewish commented Aug 12, 2020

jackysp commented Aug 12, 2020

breezewish commented Aug 13, 2020

pepezzzz commented Dec 1, 2020

Database and Table Level Metrics #17688

Database and Table Level Metrics #17688

Comments

breezewish commented Jun 4, 2020

Feature Request

zz-jason commented Jul 22, 2020

breezewish commented Jul 22, 2020

jackysp commented Aug 6, 2020

breezewish commented Aug 12, 2020

jackysp commented Aug 12, 2020

breezewish commented Aug 13, 2020

pepezzzz commented Dec 1, 2020