SegmentPool improvements #352

fzhinkin · 2024-07-04T13:55:28Z

Currently, SegmentPool implementation on JVM works more like a cache than a pool: it has a relatively small fixed size, segments are taken from the pool on an effort basis, and it's easy to make a segment non-recyclable (meaning that it will never be returned to the pool).

This behavior is totally fine for many applications, but it's not for Ktor.
This PR addresses several issues:

Support precise segment reference tracking to return shared segments back to the pool eventually. Currently, once a segment is shared, it is never returned to the pool because the segment state is tracked by a flag that could be set but could not be used. To solve that problem, on JVM, the exact number of shared copies is now tracked, and the value is decremented on attempts to recycle shared copies. Once the last copy is passed to the Segment.recycle, it'll be returned to the pool.
Retry lost CAS attempts when taking or returning segments from the pool. The previous sentence is quite descriptive; I'll only add that it does not make much sense without the next change.
Support segment pool size configuration. Currently, the segment pool consists of multiple buckets (the exact number depends on the system's logical CPU count), each with a max size of 64KiB. This PR adds a second level/tier to the pool with half as many buckets as an old first tier. The total size of the new tier is configurable via system properties and is 0 by default. All failed attempts to take or recycle segments in the first tier use the second tier as a fallback. Unlike the first tier, if the second tier's bucket is empty, take will scan all other buckets. The scan is also performed on recycle to a full bucket.

All these changes reduce the allocation count by combining more precise work with the pool and optionally extensible pool size.
Two-level design helps with avoiding performance penalties induced by the scan technique employed by the second tier when that tier is not used and, at the same time, allows using the same sharding technique as in the first tier enhanced with the bucket steeling techniques.

fzhinkin · 2024-07-04T14:00:17Z

CC @e5l @bjhham

fzhinkin · 2024-07-10T12:55:56Z

core/jvm/src/SegmentPool.kt

+    private val HASH_BUCKET_COUNT_L2 = (HASH_BUCKET_COUNT / 2).coerceAtLeast(1)
+
+    private val SECOND_LEVEL_POOL_TOTAL_SIZE =
+        System.getProperty("kotlinx.io.pool.size.bytes", "0").toInt().coerceAtLeast(0)


Discussed with @e5l that it makes sense to bump up the default size up to several megs (probably, 4Mb).

Not sure if it should be done on Android.

What really puzzles me here is how it affects the end users.
Because it seems like the characteristics of the IO (which is the way to do a lot of stuff) will depend on whether or not Ktor is used in the project.

I don't mind it as a palliative intermediate solution as the ground for further benchmarking, but definitely not something to keep as as

I checked how BufferReadWriteByteArray performs depending on the presence of the second-level pool and the working set size.

If WSS is within L1-pool, there's no difference.
If WSS doesn't fit into L1-pool, it might be beneficial not to use L2-pool (i.e. it's faster w/o L2-pool) until some threshold size, but then L2-pool boosts the performance.
But it's hard to judge by looking at synthetic benchmarks only, as they provide zero to no evidence regarding GC-related effects on overall performance.

We can always revert it and also provide some alternative mechanism to adjust the pool size, so those who really demands it (Ktor), could tune it when needed.

qwwdfsad

Round of cosmetics to speed up the convergence, still digging into the segment pool details

core/common/src/Segment.kt

core/jvm/src/SegmentPool.kt

core/common/src/SegmentPool.kt

core/jvm/src/SegmentPool.kt

core/common/src/SegmentPool.kt

Co-authored-by: Vsevolod Tolstopyatov <qwwdfsad@gmail.com>

shanshin · 2024-07-11T09:23:05Z

core/jvm/src/SegmentPool.kt

+        else -> "4194304" // 4MB
+    }
+
+    private val SECOND_LEVEL_POOL_TOTAL_SIZE =


when is the first usage of class SegmentPool?

Is there a high chance of programmatically changing this value before its first reading?

when is the first usage of class SegmentPool?

On a first write into a buffer

Is there a high chance of programmatically changing this value before its first reading?

I think it's achievable by updating the system property before SegmentPool is initialized.

I think it's achievable by updating the system property before SegmentPool is initialized.

if this is acceptable, then fine

I don't see any severe consequences of that. That's the only kind of dynamic configuration (#352 (comment)) we can provide for now. :)

core/common/src/unsafe/UnsafeBufferOperations.kt

core/common/src/Segment.kt

core/jvm/src/SegmentPool.kt

core/jvm/test/PoolingTest.kt

core/jvm/src/SegmentPool.kt

fzhinkin added 10 commits June 27, 2024 14:18

Add a test on segments leackage

2e0383a

Replace the shared flag with a copy-tracker

b9dd6d1

Make share token nullable

0d8d33f

Support two-level segment pool

f4de95b

Retry take/recycle until successful CAS

238f909

Renamed pool size property

7f10170

Update KDoc

8f7a282

Use sharded L2-pool

28da07e

Cleanup

d809904

Cleanup

76995c2

fzhinkin requested review from shanshin and qwwdfsad July 4, 2024 13:59

fzhinkin mentioned this pull request Jul 4, 2024

Better segment pools #311

Open

7 tasks

fzhinkin commented Jul 10, 2024

View reviewed changes

qwwdfsad reviewed Jul 10, 2024

View reviewed changes

Bump up second level cache size to 4 megs on JVM

18e0aa3

qwwdfsad reviewed Jul 10, 2024

View reviewed changes

core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved

fzhinkin added 2 commits July 10, 2024 15:28

Cleanup

40d2f8b

Replace arrays of AtomicReferences with AtomicReferenceArrays

3dac356

qwwdfsad reviewed Jul 10, 2024

View reviewed changes

core/common/src/SegmentPool.kt Outdated Show resolved Hide resolved

fzhinkin and others added 2 commits July 10, 2024 15:46

Fixed description of ops on buckets

58b98cc

Improve KDoc

3f6e7c0

Co-authored-by: Vsevolod Tolstopyatov <qwwdfsad@gmail.com>

fzhinkin requested a review from qwwdfsad July 10, 2024 14:28

shanshin reviewed Jul 11, 2024

View reviewed changes

core/jvm/src/SegmentPool.kt Outdated Show resolved Hide resolved

fzhinkin added 4 commits July 11, 2024 14:44

Rename and reimplement tracker's removeCopy

8834cf7

Be more lenient when parsing system property

ac84646

Make simple copy tacker an object

7714e08

Improve bucket exhaustion check

564289d

Added an extra test case

cb5c499

fzhinkin requested a review from shanshin July 11, 2024 12:45

shanshin approved these changes Jul 11, 2024

View reviewed changes

qwwdfsad approved these changes Jul 12, 2024

View reviewed changes

fzhinkin merged commit be2bfca into develop Jul 12, 2024
1 check passed

fzhinkin deleted the two-level-segment-pool branch July 12, 2024 10:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SegmentPool improvements #352

SegmentPool improvements #352

fzhinkin commented Jul 4, 2024

fzhinkin commented Jul 4, 2024

fzhinkin Jul 10, 2024

qwwdfsad Jul 10, 2024

fzhinkin Jul 10, 2024

fzhinkin Jul 10, 2024

qwwdfsad left a comment

shanshin Jul 11, 2024

fzhinkin Jul 11, 2024

shanshin Jul 11, 2024

fzhinkin Jul 11, 2024

SegmentPool improvements #352

SegmentPool improvements #352

Conversation

fzhinkin commented Jul 4, 2024

fzhinkin commented Jul 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qwwdfsad left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment