Performance improvement: object database query caching? #1100

abitmore · 2018-06-26T23:44:39Z

Most of data stored in the object database is in a red-black tree. Currently, every query searches from the root to the leaf, the complexity is O(log(n)). Generally, some data is much hotter than others, so it makes sense to cache the hottest results (pointers or addresses) to avoid searching from root to leaf for same data again and again.

Things to be done:

profiling / query pattern analysis : find out what's being queried the most, whether a cache would help
caching design

Thoughts?

The text was updated successfully, but these errors were encountered:

jmjatlanta · 2018-07-05T03:20:05Z

Here's a somewhat dated article about cache design. While the article brings out some high-level architecture ideas and points to consider, the code shows its age. ipcc99.pdf

Things to think about: Stale due to age, stale due to an operation, memory size, lookups of cached data, cost of decision to look in database vs cache.

How smart do we make it? Does it learn what to cache, or do we tell it what to cache?

abitmore · 2018-07-05T13:40:50Z

I guess LRU cache would work in most cases.

The question is whether now is the correct time to play with pointers, whether the gain worth the efforts/risks. Similar discussion in #1095 .

clockworkgr · 2018-07-25T06:32:09Z

I may be completely off the mark here seeing as I'm not a C++ dev and I'm going purely on documentation I've read. So if completely wrong just let me know.

What about boost's notifying indices? It seems to me that it would allow for perfect cache invalidation. I have no idea whether it's implementable or what the performance hit (if any) would be. I just know that we use boost containers and according to the docs, notifying indices provide callbacks on object modification.

Combined with a fixed cache size and cache object removal based on last-access time it seems as if it would self-optimise.

UPDATE: Apologies...appears notifying indices are planned but not included in boost yet.

abitmore mentioned this issue Dec 8, 2018

Add new direct index type #1462

Merged

abitmore added this to the Future Feature Release milestone Sep 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvement: object database query caching? #1100

Performance improvement: object database query caching? #1100

abitmore commented Jun 26, 2018

jmjatlanta commented Jul 5, 2018

abitmore commented Jul 5, 2018

clockworkgr commented Jul 25, 2018 •

edited

Loading

Performance improvement: object database query caching? #1100

Performance improvement: object database query caching? #1100

Comments

abitmore commented Jun 26, 2018

jmjatlanta commented Jul 5, 2018

abitmore commented Jul 5, 2018

clockworkgr commented Jul 25, 2018 • edited Loading

clockworkgr commented Jul 25, 2018 •

edited

Loading