Reader connection pool in native-driver #2125

andersio · 2020-12-23T20:32:07Z

Reader pool

Implement a reader connection pool in the NativeSqlDriver as discussed in #1986.

This is an opt-in feature — it emulates the current SQLDelight default (one reader connection), unless maxConcurrentReader > 1 is specified when instantiating NativeSqlDriver. As a "bonus", if you always wrap everything in a transaction, no reader connection will be instantiated.

The new Pool<T> is implemented using a mix of lock-free atomics, pthread mutex and pthread condition. In summary:

borrowing is lock-free in the general case;
lack of available entries put the thread into suspension (via pthread_cond_wait()); and
one blocked thread (if any) is woken up every time a borrow is released (via pthread_cond_signal()).

Pool<T> is used for both the reader pool (configurable capacity), and the transaction/writer pool (hardcoded capacity of 1). In other words, SinglePool<T> has been removed.

Note that the native-driver module has been changed to four source sets: commonMain, commonTest, nativeDarwinMain and mingwMain. Almost all code previously in nativeMain/nativeTest are now common code, which give these code a working IDE experience as a side effect.

The source set reshuffling is to make space for a small set of platform-specific, expect-actual declarations, since now divergence is required between Windows and Darwin/UNIX, due to discrepencies in the imported API signatures of pthreads.

Runtime read-only enforcements

Two runtime checks have been enabled on reader connections:

PRAMGA query_only = 1: This instructs SQLite to reject any data manipulation statements.
ThreadConnection.isReadOnly: This allows SQLDelight to catch internal misuse of reader connections in accessConnection().

Future directions

Read-only transaction API, using the reader pool.
Reduce locking by dropping the DatabaseConnection recursive lock imposed by SQLiter.

SQLDelight already imposes mutual exclusion on the database connection, even with the existing Pool<T> (putting aside [native + sqlite WAL] Read-your-writes consistency violation under stress, due to SqlCursor escaping mutual exclusion of its parent connection #2123). So the defensive recursive lock at SQLiter level is redundant from the perspective of the connection pools.

kpgalligan · 2020-12-27T20:06:26Z

Will take a look at this soon-ish. I'm in the middle of a rework of the native driver, but starting at the sqliter level currently. There are a few other performance improvements that have been discussed but never implemented that I want to explore.

andersio · 2020-12-30T17:11:10Z

While having worked on this, I did wonder if thread-local connections are a simpler choice for today's Kotlin/Native, which also dodges the headaches around the cost of shared data structures in the current strict memory model.

With thread pools being absence from the ecosystem, likely until the new memory model is ready, users are likely sticking to the library/language defaults in their K/N applications, which means 1-2 threads at maximum. So thread local doesn't seem a bad choice for the time being, putting aside caveats when using connections in native callbacks.

kpgalligan · 2020-12-30T17:33:05Z

Most will be in main and default threads, but not everybody. I think some form of reader pool makes sense, but the implementation with Stately collections for statement caching, etc, is way overdue for an update. We had coded up some optimizations in the past that just never made it into the code base, but I really want to rethink how shared data works in the driver. I've had a lot of time to think on how KN does things since the first version of Stately. There are much better ways to manage sharing state. Not to rail on my own library, but those atomic collections are not great.

andersio · 2020-12-31T00:17:19Z

I fear we are going off topic here. 😛

But since you mentioned the data structures, having poked around a bit more, I see some potentials in removing/replacing the hot data structures in today's ThreadConnection. e.g.:

SQLiter adopting sqlite_close_v2 might render tracking active SqlCursors in ThreadConnection unnecessary.
We could have an insert-only statement cache, and track usage on a per-entry basis. This way the cache would have a fairly low write rate (once per statement per app invocation), and hence is suitable for the simpler copy-on-write approach.

kpgalligan · 2020-12-31T21:10:53Z

sqlite_close_v2 is interesting, but I need to understand that quite a bit better. To review.

I'd need to look at your cache changes better, but for write statements, there's no need to have more than one per write connection, and only one connection can be used for writing anyway, so the add/removing causes a lot of unecessary churn. We coded something for this almost 2 years ago now, measuring the driver performance: #1226. The changes never made it into the driver, though.

ivyspirit · 2021-01-25T02:47:47Z

If the WAL mode is enabled, the write and read are supposed to be able to happen at the same time, even when reads are wrapped in a transaction. Which is most of the case for us: we wrap lots of our reads in transactions for different reasons. With the current design, the read transaction will not be able to start until the prev transaction is committed.

While having worked on this, I did wonder if thread-local connections are a simpler choice for today's Kotlin/Native, which also dodges the headaches around the cost of shared data structures in the current strict memory model.

With thread pools being absence from the ecosystem, likely until the new memory model is ready, users are likely sticking to the library/language defaults in their K/N applications, which means 1-2 threads at maximum. So thread local doesn't seem a bad choice for the time being, putting aside caveats when using connections in native callbacks.

In terms of the data structure, based on the profiling the bottle neck right now really is this function safePut. The data structure used to cache the statement frozenLinkedList and frozenHashMap are very expensive each time when modify the content. One proposal here is:

Since we expect write always happen on the single thread, can we make the the write connection not shared by multiple threads?
Support to pass in a dedicated write thread to the driver to create the write connection.
In execute check the current thread. Throw exception if the execute does not happen on the dedicated write thread. It is client's responsibility to make sure that write transaction and write execution happen on the write thread.
Replace the frozenList and frozenHashMap to normal data structures, will significantly help on the performance.
Allow creating transaction from any Connection, so starting the read transaction will not get blocked by write. If there is write in the transaction, 3) above will do the safety check.

Sorry if this is not the right place for the discussion. I can move this to a different issue. Let me know what you think. @kpgalligan @andersio

andersio · 2021-01-25T10:20:22Z

@ivyspirit

Which is most of the case for us: we wrap lots of our reads in transactions for different reasons.

You mean lack of read transaction support, right? This is indeed a missing corner in the SQLDelight API.

I think it should be tackled independently since it impacts the API instead of just reader concurrency of the native-driver (i.e. this PR).

ivyspirit · 2021-01-25T18:42:38Z

Yeah we can move to a separate discussion. But I don't think it is necessarily to change the API. If we do the thread checking in execute. The read transaction should be able to be started by any connections. But if there is write, it has to start on the write thread , which we will map to the write thread connection. Otherwise the check in execute would throw exception.

kpgalligan · 2021-01-25T19:58:14Z

I've been working on merging this with other planned changes. It's still a bit of a work in progress, but connection management has changed somewhat. The isolated write thread is going away (unless there's a really good reason to keep it), data structures are changing (no frozen on apple clients), write statements aren't removed from the cache, etc. Should be posting something soon.

andersio · 2021-02-17T15:07:10Z

@kpgalligan

The isolated write thread is going away

It is alright not to have a dedicated connection for writes. But I think it is still valuable to keep the application-level locking for any write (single statement or transaction), so people need not worry about transactions failing to start (edit: or upgrade) with SQLITE_BUSY in single-process setups.

kpgalligan · 2021-02-17T15:28:42Z

It is alright not to have a dedicated connection for writes. But I think it is still valuable to keep the application-level locking for any write (single statement or transaction), so people need not worry about transactions failing to start with SQLITE_BUSY in single-process setups.

Totally agree

kpgalligan@038afc4#diff-fdb8f8a5153034bba1cdc1adbb25f25280324223f01bc660b6507804cfce0925R24

My statement "Should be posting something soon" got a little derailed with other priorities but still intend to update soon.

andersio · 2021-03-04T01:07:59Z

(unless there's a really good reason to keep it)

I found a reason not to keep it.

I recently had problems with tests using in-memory DBs being intermittently stuck at a write operation. Turns out that in-memory SQLite DBs have been locked by a parallel reader when the writer attempts to write, because in-memory DBs don't support WAL and its concurrency model. So the 1 reader + 1 writer setup can cause issues, when application/test code does go parallel.

So allowing having exactly one connection can help with the in-memory DB usage, esp. as a substitute of WAL database in tests. Rollback journal too, I suppose, though it is probably rare on Apple platforms these days.

AlecKazakova · 2021-04-21T13:12:25Z

Closing and moving progress into #2303

andersio force-pushed the reader-pool branch from f6b4369 to 0b44087 Compare December 23, 2020 20:35

Reader pool.

66b2926

andersio force-pushed the reader-pool branch from 0b44087 to 66b2926 Compare December 23, 2020 20:48

andersio added 9 commits December 24, 2020 03:37

Introduce PoolLock with wake-upon-unlock using pthread condition.

9361241

Revert build.gradle changes

4744d83

Fixed PoolLock for mingwX64. Reorganization of source sets.

2397abd

Expect-actual the PoolLock instead.

4e2cc3a

Replace SinglePool with (Multi)Pool with capacity of 1.

4464b0a

Remove test artifact that was committed unintentionally

c129321

Make the linter happy.

b660c7f

Acquire lock before signalling availability to avoid a race condition

7bf4005

Rename a couple of PoolLock methods and rewrite their docs.

f9501aa

andersio force-pushed the reader-pool branch from 8a38c2a to f9501aa Compare December 25, 2020 04:44

andersio mentioned this pull request Apr 8, 2021

Read-only transactions #2237

Open

kpgalligan mentioned this pull request Apr 21, 2021

Native driver connection pool and performance updates #2303

Merged

AlecKazakova closed this Apr 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reader connection pool in native-driver #2125

Reader connection pool in native-driver #2125

andersio commented Dec 23, 2020 •

edited

Loading

kpgalligan commented Dec 27, 2020

andersio commented Dec 30, 2020 •

edited

Loading

kpgalligan commented Dec 30, 2020

andersio commented Dec 31, 2020 •

edited

Loading

kpgalligan commented Dec 31, 2020

ivyspirit commented Jan 25, 2021 •

edited

Loading

andersio commented Jan 25, 2021 •

edited

Loading

ivyspirit commented Jan 25, 2021 •

edited

Loading

kpgalligan commented Jan 25, 2021

andersio commented Feb 17, 2021 •

edited

Loading

kpgalligan commented Feb 17, 2021

andersio commented Mar 4, 2021 •

edited

Loading

AlecKazakova commented Apr 21, 2021

Reader connection pool in native-driver #2125

Reader connection pool in native-driver #2125

Conversation

andersio commented Dec 23, 2020 • edited Loading

Reader pool

Runtime read-only enforcements

Future directions

kpgalligan commented Dec 27, 2020

andersio commented Dec 30, 2020 • edited Loading

kpgalligan commented Dec 30, 2020

andersio commented Dec 31, 2020 • edited Loading

kpgalligan commented Dec 31, 2020

ivyspirit commented Jan 25, 2021 • edited Loading

andersio commented Jan 25, 2021 • edited Loading

ivyspirit commented Jan 25, 2021 • edited Loading

kpgalligan commented Jan 25, 2021

andersio commented Feb 17, 2021 • edited Loading

kpgalligan commented Feb 17, 2021

andersio commented Mar 4, 2021 • edited Loading

AlecKazakova commented Apr 21, 2021

andersio commented Dec 23, 2020 •

edited

Loading

andersio commented Dec 30, 2020 •

edited

Loading

andersio commented Dec 31, 2020 •

edited

Loading

ivyspirit commented Jan 25, 2021 •

edited

Loading

andersio commented Jan 25, 2021 •

edited

Loading

ivyspirit commented Jan 25, 2021 •

edited

Loading

andersio commented Feb 17, 2021 •

edited

Loading

andersio commented Mar 4, 2021 •

edited

Loading