The new SeparateBodyFileCache design makes updates/reads non-atomic #324

itamarst · 2023-10-18T14:23:59Z

The problem

Scenario 1

Program crashes after writing metadata but before writing body. Now the cache thinks the body is empty.

Scenario 2

Multiple processes using the same cache directory. Process A writes metadata, but not yet body. Process B gets metadata, then gets body, and now it thinks the body is empty.

Workarounds

For pip I added logic that basically pretends a cache entry is missing if it doesn't have both body and metadata files.

Solutions

Probably need both a new API that involves writing and reading both at once, and an implementation that makes sure that's atomic. See also the somewhat related #325.

The text was updated successfully, but these errors were encountered:

itamarst · 2023-10-18T14:36:55Z

This is my fault, by the way... 😢

We discussed various solutions on the pip side here: pypa/pip#12361

woodruffw · 2023-10-18T14:40:42Z

Thanks for the report!

Probably need both a new API that involves writing and reading both at once, and an implementation that makes sure that's atomic. See also the somewhat related #325.

This makes sense to me -- if I'm understanding correctly, this would mean replacing get + get_body with a single get, correct? Along with internal changes to ensure that we synchronize/atomize the two I/O operations?

itamarst · 2023-10-18T14:46:34Z

Yeah, plus have a single write API function.

For cross-file atomicity, writing the body file first might do the trick, if reading code reads the metadata file first. This would allow the file format to stay the same, at least.

(Would also be nice if write API didn't take the full bytes, since that means you have to keep full response in memory, but that maybe a bit too much work. Or could use the mmap hack again to work around that and accepts buffer API-implementing objects.)

itamarst · 2023-10-18T14:53:28Z

Note that a combination of fixing this and #235 in the ways suggested in previous comment, plus os.replace(), might allow removing the need for locking altogether.

pip doesn't use locking because historic reliance on deprecated lockfile. They'd vendor the newer filelock if need be, but in general seem happier with fewer dependencies since they need to vendor everything.

woodruffw added the bug label Oct 18, 2023

thatch mentioned this issue Feb 19, 2024

Cache deserialization issues cause no versions error hdeps/hdeps#27

Open

thatch mentioned this issue Apr 25, 2024

Race condition in FileCache that can result in empty body 200 response #332

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The new SeparateBodyFileCache design makes updates/reads non-atomic #324

The new SeparateBodyFileCache design makes updates/reads non-atomic #324

itamarst commented Oct 18, 2023 •

edited

Loading

itamarst commented Oct 18, 2023

woodruffw commented Oct 18, 2023

itamarst commented Oct 18, 2023

itamarst commented Oct 18, 2023

The new SeparateBodyFileCache design makes updates/reads non-atomic #324

The new SeparateBodyFileCache design makes updates/reads non-atomic #324

Comments

itamarst commented Oct 18, 2023 • edited Loading

The problem

Scenario 1

Scenario 2

Workarounds

Solutions

itamarst commented Oct 18, 2023

woodruffw commented Oct 18, 2023

itamarst commented Oct 18, 2023

itamarst commented Oct 18, 2023

itamarst commented Oct 18, 2023 •

edited

Loading