Shared local SSD cache for Velox #4836

zhouyuan · 2023-05-04T14:04:57Z

zhouyuan
May 4, 2023

Velox local caching feature is very helpful when reading from slow storage(S3 or remote HDFS w/ HDDs), it does not support sharing between executors on one compute node. For Spark applications one typical setup is to have multiple executors running on one compute node, e.g. on a 96-core server people may allocate 24 executors with 4 core per executor. Currently if local SSD cache is enable in this case, the same content maybe access by different executors thus the data will be cached multiple times on that compute node. The cache is not efficiently used. A shared local SSD cache can fix this issue. Here's a rough design for this:

An SSD Cache Daemon: On each compute node the daemon will control the shared cache state, the daemon will listen on a local domain socket
Executor instances will do IPC(unix domain socket based) with the daemon do read/write lock on a shared cache block
Cache blocks from tables(s) are cached in a shared area on compute node(s)
Reads are served from the shared cache, writes are not supported yet(?)

CC: @oerling @mbasmanova

thanks, -yuan

Yohahaha · 2023-05-09T06:19:26Z

Yohahaha
May 9, 2023

It looks like Alluxio ?

0 replies

zuochunwei · 2023-07-12T02:24:59Z

zuochunwei
Jul 12, 2023

GOOD

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shared local SSD cache for Velox #4836

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Shared local SSD cache for Velox #4836

zhouyuan May 4, 2023

Replies: 2 comments

Yohahaha May 9, 2023

zuochunwei Jul 12, 2023

zhouyuan
May 4, 2023

Yohahaha
May 9, 2023

zuochunwei
Jul 12, 2023