Support table compaction #930

v0y4g3r · 2023-02-02T06:49:55Z

What type of enhancement is this?

Performance

What does the enhancement do?

So far GreptimeDB supports flushing rows in memtable to SST files in level 0. But SST files in level 0 is not sorted in time bucket order, so retrieving rows in a given time range needs to scan all SST files in level 0. Just like other LSM tree based storage engines, we need to compact SST files in different levels to:

merge insert/delete records with the same primary key;
sort all rows in timestamp order and evenly distribute those rows to SST files in level 1 so that no SST file in level 1 contains intersecting time range;
delete expired SST files according to TTL;
we can intriduce compression and indexing tasks when compacting SST files in the future.

The RFC can be found at #939.

Implementation challenges

We need to implement those component:

compaction scheduler (along with task control to limit performance impact to foreground queries) feat: compaction scheduler and rate limiter #947;
compaction strategy: we can begin with file-num based strategy that trigger compaction as soon as the number of SST files in level 0 exceeds some threshold feat: L0 to L1 compaction strategy #964;
- time bucket calculation: read time ranges of SST files in level 0 and calculate the proper time bucket for level 1 SST files so that data can be evenly distributed in level 1 SST files;
  track the referencer of SST files and only delete these SSTs when it's been marked as "deleted" and reference count is 0.
table compaction task: use merged reader along with dedup reader to read rows from all SST files in level 0 feat: compaction reader and writer #972.
integrate compaction components to datanode instance feat: compaction integration #997.

Future work

killme2008 · 2023-02-02T08:05:23Z

we may break the snapshot read semantic when implementing compaction, since with the same sequence for snapshot read, the second read may find some rows are deleted because of compaction.

Do we have any solutions to fix this issue? Looks like it's impossible.

waynexia · 2023-02-02T08:08:39Z

Do we have any solutions to fix this issue? Looks like it's impossible.

Maybe we need to introduce some mechanism like MVCC for it, but I think it's not necessary to do so.

v0y4g3r · 2023-02-02T08:13:38Z

we may break the snapshot read semantic when implementing compaction, since with the same sequence for snapshot read, the second read may find some rows are deleted because of compaction.

Do we have any solutions to fix this issue? Looks like it's impossible.

We need to keep track of all SnapshotImpls that currently refering level 0 SST files, and postpone the deletion of compacted level 0 SST files until all created snapshot reads are finished.

evenyag · 2023-02-02T08:26:54Z

we may break the snapshot read semantic when implementing compaction, since with the same sequence for snapshot read, the second read may find some rows are deleted because of compaction.

I think this might not be a problem. Now we only support reading the latest data. In this case, we acquire a reference to a stable Version struct which references the old SSTs. But we need to make the file metadata reference counted and delete the file after no "read" referencing it.

Do we have any solutions to fix this issue?

Introduces snapshot like other dbs if we need a stable snapshot but also want to release the unused SSTs.

killme2008 · 2023-07-25T03:23:08Z

I think we can close this issue. Open a new issue when we want to support other compaction features.

v0y4g3r added the C-enhancement Category Enhancements label Feb 2, 2023

v0y4g3r added this to the Release v0.1 milestone Feb 2, 2023

v0y4g3r self-assigned this Feb 2, 2023

v0y4g3r mentioned this issue Feb 10, 2023

Add close method for Region trait #832

Closed

4 tasks

WenyXu mentioned this issue Feb 13, 2023

feat: add close method for the region trait #970

Merged

6 tasks

This was referenced Feb 14, 2023

feat: compaction reader and writer #972

Merged

feat: compaction integration #997

Merged

evenyag added the tracking-issue A tracking issue for a feature. label Feb 14, 2023

v0y4g3r modified the milestones: v0.3, v0.1 Feb 15, 2023

v0y4g3r mentioned this issue Feb 17, 2023

feat: file purger #1030

Merged

2 tasks

vinland-avalon mentioned this issue Mar 9, 2023

Time Window CompactionStrategy #1068

Closed

killme2008 closed this as completed Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support table compaction #930

Support table compaction #930

v0y4g3r commented Feb 2, 2023 •

edited by killme2008

Loading

killme2008 commented Feb 2, 2023

waynexia commented Feb 2, 2023

v0y4g3r commented Feb 2, 2023

evenyag commented Feb 2, 2023

killme2008 commented Jul 25, 2023

Support table compaction #930

Support table compaction #930

Comments

v0y4g3r commented Feb 2, 2023 • edited by killme2008 Loading

What type of enhancement is this?

What does the enhancement do?

Implementation challenges

Future work

killme2008 commented Feb 2, 2023

waynexia commented Feb 2, 2023

v0y4g3r commented Feb 2, 2023

evenyag commented Feb 2, 2023

killme2008 commented Jul 25, 2023

v0y4g3r commented Feb 2, 2023 •

edited by killme2008

Loading