Question: Best practices for automating compaction / defrag #7607

davissp14 · 2017-03-27T00:07:04Z

Reading through: https://github.com/coreos/etcd/blob/master/Documentation/op-guide/maintenance.md

Looking at compaction:

Since etcd keeps an exact history of its keyspace, this history should be periodically compacted to avoid performance degradation and eventual storage space exhaustion

What metrics should we be using to determine when compaction is necessary or at the very least a good idea?

After compacting the keyspace, the backend database may exhibit internal fragmentation. Any internal fragmentation is space that is free to use by the backend but still consumes storage space. The process of defragmentation releases this storage space back to the file system. Defragmentation is issued on a per-member so that cluster-wide latency spikes may be avoided

What metrics should be used to monitor fragmentation? Initially I assumed that monitoring HeapAlloc and HeapInuse would get me close, but seems I have mistaken.

Any thoughts or advise?

Thanks in advance
.

xiang90 · 2017-03-27T04:31:19Z

What metrics should we be using to determine when compaction is necessary or at the very least a good idea?

It depends on your application. If your application is OK with 1hr duration, do 1hr. The shorter duration the better. The limit factor is the total db size though, do not let it grow more than 2GB if you use a normal cloud machine.

What metrics should be used to monitor fragmentation?

This is actually about disk fragmentation. But if you do not suddenly remove a lot of keys and want to reclaim the disk space immediately, you do not need to defrag.

davissp14 · 2017-03-27T12:28:56Z

Information below pertains to version 3.1.4 using API v3.

I went ahead and ran a few benchmarks in to see if I could get a better understanding of what's going on.

After seeding some data, I went ahead and ran a compact / defrag. I know you mentioned the defrag is a disk fragmentation thing, but it also seems to reclaim the cache along with it.

Normally I wouldn't really care about the cache as it should be freed upon memory pressure, but, in the benchmarks we are seeing quite a few failcnts starting at the time the cache fills up. Appears that the cache isn't being freed very efficiently.

Note: This is a separate benchmark.

Failcnts associated with the above benchmark.

It appears we are seeing a negative performance impact due to the aggressive caching. It also doesn't seem like constantly defragging is a great solution to this problem.

Any thoughts?

xiang90 · 2017-03-27T14:39:52Z

It appears we are seeing a negative performance impact due to the aggressive caching. It also doesn't seem like constantly defragging is a great solution to this problem.

Can you share the benchmark result?

davissp14 · 2017-03-27T15:51:14Z

Benchmark used:

./benchmark --endpoints=$ENDPOINTS --conns=100 --clients=1000 put --key-size=100 --key-space-size=10 --sequential-keys --total=5000000 --val-size=20 --user=root:$PASSWORD

xiang90 · 2017-03-27T16:40:57Z

How did you figure out the cache size has an impact over the benchmark result? I want to see a bench perf over cache size graph. Also note that the more keys you put into etcd the less throughput you might get if you run etcd on slow hdd. The level of btree grows so more io will be needed.

davissp14 · 2017-03-27T19:45:54Z

I guess I should clarify. Etcd seems to quickly gobble up available memory for caching purposes. Once all available memory has been allocated to cache, we start seeing a lot of cache evictions, which is expected. The result of the constant evictions however, is failcnts and slower response times.

The results below are from a cluster that has been allocated just enough memory to not have to force cache evictions when running the benchmark. I can work on creating a better comparison, but may take a bit due to other obligations.

The AWS instance types I have been testing in are:
i2.4xlarges with ephemeral disk in raid 0
r3.4xlarge with a single SSD

The only resource limitation I am enforcing via cgroups is memory. There doesn't seem to be any significant performance differences between the two instance types. Both have pretty solid I/O though.

xiang90 · 2017-03-28T18:38:38Z

@davissp14

The avg latency is very high for 1.5k throughput. I am more interested in the comparison. I want to see the bad impact of cache. The evicted memory should on the old tree node. If you do seq write, the tree nodes should be all in memory (xMB should be far than enough), I am surprised it has an observable impact.

gyuho · 2018-04-20T21:42:48Z

We will document how to monitor this in #9438.

Let's move this discussion to #9438.

regardfs · 2019-07-19T15:36:38Z

What metrics should be used to monitor fragmentation?

This is actually about disk fragmentation. But if you do not suddenly remove a lot of keys and want to reclaim the disk space immediately, you do not need to defrag.
@xiang90 That just means no fragmentation will not influence the performance of etcd cluster but only no more disk space freed?

regardfs · 2019-07-22T02:25:58Z

@gyuho，hi，What will happen if no defrag proceed, Only no free for disk space? Does it influence the performance of etcd cluster?

heyitsanthony assigned xiang90 Mar 28, 2017

heyitsanthony mentioned this issue Jun 1, 2017

Adding guidance of configuring compaction related parameters #8018

Closed

heyitsanthony added area/documentation area/usability labels Jun 1, 2017

xiang90 added this to the v3.4.0 milestone Dec 15, 2017

gyuho removed the area/usability label Feb 25, 2018

gyuho closed this as completed Apr 20, 2018

gyuho mentioned this issue Apr 20, 2018

*: improve compaction documentation #9601

Merged

joshmue mentioned this issue Nov 8, 2023

[Standardization] K8s cluster robustness features (#414) SovereignCloudStack/standards#361

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Best practices for automating compaction / defrag #7607

Question: Best practices for automating compaction / defrag #7607

davissp14 commented Mar 27, 2017 •

edited

Loading

xiang90 commented Mar 27, 2017

davissp14 commented Mar 27, 2017 •

edited

Loading

xiang90 commented Mar 27, 2017

davissp14 commented Mar 27, 2017 •

edited

Loading

xiang90 commented Mar 27, 2017

davissp14 commented Mar 27, 2017 •

edited

Loading

xiang90 commented Mar 28, 2017

gyuho commented Apr 20, 2018

regardfs commented Jul 19, 2019

regardfs commented Jul 22, 2019

Question: Best practices for automating compaction / defrag #7607

Question: Best practices for automating compaction / defrag #7607

Comments

davissp14 commented Mar 27, 2017 • edited Loading

xiang90 commented Mar 27, 2017

davissp14 commented Mar 27, 2017 • edited Loading

xiang90 commented Mar 27, 2017

davissp14 commented Mar 27, 2017 • edited Loading

xiang90 commented Mar 27, 2017

davissp14 commented Mar 27, 2017 • edited Loading

xiang90 commented Mar 28, 2017

gyuho commented Apr 20, 2018

regardfs commented Jul 19, 2019

regardfs commented Jul 22, 2019

davissp14 commented Mar 27, 2017 •

edited

Loading

davissp14 commented Mar 27, 2017 •

edited

Loading

davissp14 commented Mar 27, 2017 •

edited

Loading

davissp14 commented Mar 27, 2017 •

edited

Loading