Type-specific Kotlin collections

The purpose of this project is analyzing possibility of implementing collections with primitive values stored.

Desired properties of collections:

Kotlin-multiplatform library
implementing standard interfaces like MutableList, MutableMap
advantage about two times in memory
better operating with a cache than standard implementation
linked implementations of Map and Set

Map

Two possible implementations were considered - open addressing and chained.

In both cases some common optimizations were used:

power of two capacity in order to avoid mod operation
multiplicative Knuth hash

Open addressing

uses linear probing
eager deletion

Implementation of MutableMap<Long, Long> is here.

Advantages:

better operating with a cache as it uses only one array for navigating
smaller memory usage in case of big load factor
could be used for Object map implementation with some advantage in memory

Disadvantages:

problems with collisions as all elements are stored sequentially in one array
memory usage highly depends on capacity rather than size

Chained

Idea: store fields of nodes from standard implementation in several arrays(keys, values, next).

Implementation of MutableMap<Long, Long> is here.

Advantages:

algorithmically equivalent to standard implementation(better resolution of collisions than open addressing)
dense memory usage
memory usage depends on capacity * load factor - approximately size
smaller memory usage in case of small load factor

Disadvantages:

needs several extra arrays (free indexes, head indexes)
uses two arrays for navigation
higher memory usage for small primitives(byte, short, int)
could not be used for Object map implementation

Open addressing vs Chained

Memory usage

Table contains average advantage of implementations to the standard one in case of load factor 0,6.

chained implementation is better in case of Map<Long, Long>
for other primitives open addressing has better memory usage
chained implementation is unused in case of Map<Object, Object> (advantage is less than 1 in worst case)

Some other calculations of memory usage are here.

Performance(get)

When size of hash map is big enough we can see a great improvements in performance comparing with standard implementation.

Other primitives

Two possible implementations of other than long primitives were considered:

store everything as long and redirect all calls to LongLongMap
implement a generic version of hash map with specialized storage for primitive elements

Long implementation redirecting

higher memory usage

Generic implementation

Idea: all memory advantages of specialized implementations are concentrated on a primitive array. Therefore we can wrap a primitive array and all work with primitive type into Storage<T> class.

one map implementation for all types, no code generation is needed
a special generic reified fabric function allows to have only liner number of implementations rather than quadratic other case
better memory usage and performance rather than long-long implementation redirecting

Open addressing implementation of MutableMap<K, V> is here.

ArrayList

Memory advantage is 2.5-3 times.

Implementation of MutableList<Long, Long> is here.

All operations with ArrayList are very simple so there are problems with performance(see next section).

Problem

Map, List and other interfaces from standard library require returning Object in all methods therefore extra boxing is needed everywhere. This critically worsens performance when:

ArrayList is used
size of hash map is small
operations like foreach are used

When size of hash map is small one extra boxing takes large part of get execution so standard implementation works better.

Name	Absolute	Std slowness factor
List sum
Small map sum
Big map sum

Foreach(or sum) is a cheap operation so extra boxings are fatal for performance.

Profiler results of ArrayList sum:

Name	Profile
Std
My list
FastUtil

The problem could be solved if special methods like getOrDefault are used:

Kotlin Native

Kotlin native benchmarks differ for get and stay the same for foreach.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.github/workflows		.github/workflows
JVMPerformance		JVMPerformance
Performance		Performance
gradle/wrapper		gradle/wrapper
images		images
src		src
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Type-specific Kotlin collections

Map

Open addressing

Chained

Open addressing vs Chained

Memory usage

Performance(get)

Other primitives

Long implementation redirecting

Generic implementation

ArrayList

Problem

Kotlin Native

About

Releases

Packages

Contributors 2

Languages

zuevmaxim/TypeSpecificCollections

Folders and files

Latest commit

History

Repository files navigation

Type-specific Kotlin collections

Map

Open addressing

Chained

Open addressing vs Chained

Memory usage

Performance(get)

Other primitives

Long implementation redirecting

Generic implementation

ArrayList

Problem

Kotlin Native

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages