-
Notifications
You must be signed in to change notification settings - Fork 105
GC task too eagerly closes chunks #844
Comments
actually this makes me think it would make sense to be able to configure |
i dont think so. We need chunks to be persisted before the datapoints become older then the kafka retention. So if the kafka retention is 7.5hours we just need to make sure that the chunk is persisted within 7.5hours of the first datapoint being received. So rather than having a chunk-max-stale setting, i think we should just have a max-chunk-age setting. where the maximum age allowed is Looks like this issue is a dup of #614 |
re-opening this. #614 covers a much wider scope of problems but doesnt look like they are going to be fixed anytime soon. This specific issue is customer impacting and needs to be fixed ASAP. |
it sounds reasonable and correct to only close chunks via GC after max-stale AND after the chunk end has passed. (eg when wall clock > last ts of the chunk). @woodsaj if this sounds good to you i'll make the PR |
fix: GC task too eagerly closes chunks. #844
with default config, the GC task closes chunks if they have received no data for between 1-2hours (chunk-max-stale + gc-interval)
However, if a chunk has a chunkspan of say 6hours and you send 1hour of data then dont send anything for 3hours. When you start sending again the data will be rejected as the chunk will have already been closed and flushed to cassandra.
We need to make sure that chunks are not closed until after the chunk window has passed.
The text was updated successfully, but these errors were encountered: