Metrics for tsdb startup time #5471

damnever · 2023-07-24T08:54:48Z

Is your feature request related to a problem? Please describe.
Add some metrics to show the tsdb startup time.

Describe the solution you'd like
We can reuse the tsdb level metrics, such as prometheus_tsdb_data_replay_duration_seconds and prometheus_tsdb_snapshot_replay_error_total. Alternatively, we can also introduce new metrics at the cortex level specific to each tenant.

The text was updated successfully, but these errors were encountered:

yeya24 · 2023-07-24T16:38:28Z

I like this. I guess it is fine to make it per tenant since we have other per tenant metrics in ingester anyway

damnever · 2023-07-26T02:42:24Z

However, there is already a metric called cortex_ingester_tsdb_wal_replay_duration_seconds

cortex/pkg/ingester/ingester.go

Lines 590 to 594 in 634df35

    
           walReplayTime: promauto.With(registerer).NewHistogram(prometheus.HistogramOpts{ 
        
           	Name:    "cortex_ingester_tsdb_wal_replay_duration_seconds", 
        
           	Help:    "The total time it takes to open and replay a TSDB WAL.", 
        
           	Buckets: prometheus.DefBuckets, 
        
           }),

yeya24 · 2023-08-14T16:58:06Z

cortex_ingester_tsdb_wal_replay_duration_seconds this one seems a little bit weird as it is the total time of opening a tsdb, including wal replay and other time.
Does it make sense to rename the metric?

damnever · 2023-08-15T06:19:12Z

Perhaps we should deprecate cortex_ingester_tsdb_wal_replay_duration_seconds and replace it with cortex_ingester_tsdb_data_replay_duration_seconds . Since I personally do not find the percentile metric useful for identifying slow users when considering related context information such as the number of series the user has.

yeya24 · 2023-08-17T06:22:48Z

I think I am ok to align with TSDB metrics. Thoughts? @friedrichg @alanprot @alvinlin123 ?

damnever mentioned this issue Jul 25, 2023

Add cortex_ingester_tsdb_data_replay_duration_seconds metric #5477

Merged

3 tasks

friedrichg added component/ingester type/feature labels Nov 7, 2023

yeya24 closed this as completed in #5477 Nov 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics for tsdb startup time #5471

Metrics for tsdb startup time #5471

damnever commented Jul 24, 2023

yeya24 commented Jul 24, 2023

damnever commented Jul 26, 2023

yeya24 commented Aug 14, 2023

damnever commented Aug 15, 2023 •

edited

Loading

yeya24 commented Aug 17, 2023

Metrics for tsdb startup time #5471

Metrics for tsdb startup time #5471

Comments

damnever commented Jul 24, 2023

yeya24 commented Jul 24, 2023

damnever commented Jul 26, 2023

yeya24 commented Aug 14, 2023

damnever commented Aug 15, 2023 • edited Loading

yeya24 commented Aug 17, 2023

damnever commented Aug 15, 2023 •

edited

Loading