Excessive bucket List operations when actively querying data over long time ranges #5018

slim-bean · 2021-12-29T22:56:06Z

I've been storing my shell history in Loki for almost a year now, and am discovering some pain points around List operations which are exacerbated by this use case.

Currently the compactor will search through every table in storage to look for work to do, this is a list operation on all the index tables as well as a list operation for each table to see the files in it. So every 10 minutes (default compactor run time) there is a list for as many days of stored data.

Also when you query data boltdb-shipper will download the index and cache it locally for some time, while it's cached every 5 minutes the querier will "sync" this table to make sure no new files were uploaded to the object store. For loki-shell I set the TTL on this cache to > 300 days because I regularly query for long term data. Every 5 minutes, every table in the cache will have a List call made to the object store.

I think a good first step at improving this would be to not compact or 'sync' index tables older than reject_old_samples_max_age

The text was updated successfully, but these errors were encountered:

slim-bean · 2022-01-10T01:17:55Z

Making a note here from a separate discussion: @sandeepsukhani had a good suggestion that it's possible to do a single list operation which returns all the objects including "subdirectories". This would make it possible to have both the sync operation and the compactor do a single list operation.

I think ultimately both would be great but making the actual operation really cheap as sandeep suggests is ideal I think.

Another consideration with using reject_old_samples_max_age is paying attention to retention which can change old indexes/chunks.

slim-bean · 2022-01-23T17:48:07Z

Fixed by #5018

samjewell · 2023-11-22T16:54:58Z

Fixed by #5018

@slim-bean this doesn't make sense! The issue fixes itself?

Should this say fixed by # 5160?

grafana/loki#5018 has now been closed

slim-bean mentioned this issue Jan 23, 2022

add objects list caching for boltdb-shipper index store to reduce object storage list api calls #5160

Merged

1 task

slim-bean closed this as completed Jan 23, 2022

samjewell added a commit to samjewell/loki-shell that referenced this issue Nov 22, 2023

Remove warning about excessive bucket list operations

3b145d9

grafana/loki#5018 has now been closed

samjewell mentioned this issue Nov 22, 2023

Remove warning about excessive bucket list operations slim-bean/loki-shell#20

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Excessive bucket List operations when actively querying data over long time ranges #5018

Excessive bucket List operations when actively querying data over long time ranges #5018

slim-bean commented Dec 29, 2021

slim-bean commented Jan 10, 2022

slim-bean commented Jan 23, 2022

samjewell commented Nov 22, 2023

Excessive bucket List operations when actively querying data over long time ranges #5018

Excessive bucket List operations when actively querying data over long time ranges #5018

Comments

slim-bean commented Dec 29, 2021

slim-bean commented Jan 10, 2022

slim-bean commented Jan 23, 2022

samjewell commented Nov 22, 2023