Inquiry About Read Cache Workload Testing Practices #3687

PsiACE · 2024-04-10T04:45:54Z

PsiACE
Apr 10, 2024

I am currently in the process of evaluating various caching solutions and their performance characteristics, and I am particularly interested in the methodologies your team employs to test Read Cache efficiency.

Could you share some insights into the types of workloads you use to test Read Cache? I am looking for both single-node and distributed benchmarking scenarios that can help me understand how the Read Cache performs under different conditions.

Moreover, if there are any specific benchmarking tools or frameworks that you would recommend for these purposes, I would greatly appreciate learning about them.

Answered by waynexia

Apr 10, 2024

To be honest, read cache at present is quite naïve compared to other maturation systems. We've implemented a layered cache on file using a local disk, some memory cache for file pages, internal structures and metadata. And most of them are simple capacity-bounded LRU. Don't know if this answers your curiosity, and please ask any details you are still interested.

I'm also looking for workload patterns that will clearly demonstrate the impact of Read Cache on performance within these metrics.

To my understanding, methods like counting miss/hit times can give real-time insight into how a cache works. For benchmarking whose workload is simpler and easy to reproduce, run it twice with and wi…

View full answer

PsiACE · 2024-04-10T04:53:43Z

PsiACE
Apr 10, 2024
Author

I'm aware that metrics provide insights into hit/miss ratios, yet I'm keen to grasp the typical data sets and query patterns employed for benchmarking Greptime. I'm also looking for workload patterns that will clearly demonstrate the impact of Read Cache on performance within these metrics.

0 replies

waynexia · 2024-04-10T17:07:47Z

waynexia
Apr 10, 2024
Maintainer

To be honest, read cache at present is quite naïve compared to other maturation systems. We've implemented a layered cache on file using a local disk, some memory cache for file pages, internal structures and metadata. And most of them are simple capacity-bounded LRU. Don't know if this answers your curiosity, and please ask any details you are still interested.

I'm also looking for workload patterns that will clearly demonstrate the impact of Read Cache on performance within these metrics.

To my understanding, methods like counting miss/hit times can give real-time insight into how a cache works. For benchmarking whose workload is simpler and easy to reproduce, run it twice with and without cache respectively should provide a reliable result.

Moreover, if there are any specific benchmarking tools or frameworks that you would recommend for these purposes, I would greatly appreciate learning about them.

We currently leverage two suites to benchmark the db: tsbs and vmbench. And we're also considering making one to cover more scenarios. Appreciate it in advance if you have any insight on how to build a benchmark tool that can share with.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Greptime

Inquiry About Read Cache Workload Testing Practices #3687

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Greptime

Inquiry About Read Cache Workload Testing Practices #3687

PsiACE Apr 10, 2024

Replies: 2 comments

PsiACE Apr 10, 2024 Author

waynexia Apr 10, 2024 Maintainer

PsiACE
Apr 10, 2024

PsiACE
Apr 10, 2024
Author

waynexia
Apr 10, 2024
Maintainer