Consider alternative Sink.writeString implementations on JVM #316

fzhinkin · 2024-05-07T11:09:53Z

On JVM, instead of reading each character separately and then encoding it to UTF-8 and writing to a buffer, it might be faster to:

extract chars to a CharArray and then iterate over it;
simply use toByteArray.

For other libraries, namely kotlinx.serialization, some of these approaches performed better. While quick ad-hoc experiments didn't show any pros for kotlinx-io, it does make sense to investigate it thoroughly.

The text was updated successfully, but these errors were encountered:

fzhinkin · 2024-08-27T21:15:37Z

Combination of String::toByteArray and UnsafeBufferOperations::moveToTail show better performance when it comes to strings whose chars could be encoded using same-length byte sequences. However, the current implementation significantly outperforms String::toByteArray-based approach on strings where characters require byte sequences of variadic lengths.
And, of course, String::toByteArray result in higher allocation rate.

qwwdfsad · 2024-08-28T09:03:50Z

In serialization, we leverage intrinsified String::getChars (pros: vectorized, much faster compact strings unpacking, no rangechecks) and also rely on the fact that our CharArrays are pooled, leading to no allocations.

fzhinkin · 2024-08-28T18:14:44Z

For kotlinx-io, it seems like such an approach does not provide any significant performance improvements on average:

kotlinx-io/core/jvm/src/SinksJvm.kt

Line 147 in 435acfb

    
           public fun Sink.writeStringJvm2(string: String, startIndex: Int = 0, endIndex: Int = string.length) {

https://jmh.morethan.io/?source=https://gist.githubusercontent.com/fzhinkin/a11a2ce595cadb8fba700cdbe18a6f4f/raw/fbb87909636731439aac80948fa023bcc10d4269/toCharArray-based-writeString.json

In some scenarios, performance is better, in others it's worse.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider alternative Sink.writeString implementations on JVM #316

Consider alternative Sink.writeString implementations on JVM #316

fzhinkin commented May 7, 2024

fzhinkin commented Aug 27, 2024

qwwdfsad commented Aug 28, 2024

fzhinkin commented Aug 28, 2024

Consider alternative Sink.writeString implementations on JVM #316

Consider alternative Sink.writeString implementations on JVM #316

Comments

fzhinkin commented May 7, 2024

fzhinkin commented Aug 27, 2024

qwwdfsad commented Aug 28, 2024

fzhinkin commented Aug 28, 2024