Cache downloaded wheel when range requests aren't supported #5089

charliermarsh · 2024-07-16T01:56:06Z

Summary

When range requests aren't supported, we fall back to streaming the wheel, stopping as soon as we hit a METADATA file. This is a small optimization, but the downside is that we don't get to cache the resulting wheel...

We don't know whether METADATA will be at the beginning or end of the wheel, but it seems like a better tradeoff to download and cache the entire wheel?

Closes: #5088.

Sort of a revert of: #1792.

charliermarsh · 2024-07-16T02:24:59Z

I'm sort of torn on this.

zanieb · 2024-07-16T04:21:19Z

Would it be a pain to use a heuristic like... only finish downloading if it's greater than some percentage into the file? If we find a metadata entry 10% into the file it seems excessive to unconditionally download the whole wheel when we're trying many versions.

morotti · 2024-07-16T09:27:46Z

We don't know whether METADATA will be at the beginning or end of the wheel, but it seems like a better tradeoff to download and cache the entire wheel?

METADATA are at end of the wheel, as per PEP 407. https://peps.python.org/pep-0427/#recommended-archiver-features

I vaguely recall there was a similar discussion in pip to try to only download metadata (I can't find the thread anymore). They checked many packages and only found one exception to confirm the rule, some google packages had the metadata at the start (around tensorflow if I recall well), because google used their own build system that did its own thing. I think they fixed it a while back.

charliermarsh · 2024-07-16T13:07:16Z

@zanieb - Perhaps... It would be harder to implement for sure. Given that this is not the happy path and PEP 407 recommends putting the archive at the end anyway, I'm inclined to just move forward with it.

charliermarsh force-pushed the charlie/stream branch 2 times, most recently from de63976 to a5855d1 Compare July 16, 2024 02:19

charliermarsh requested review from konstin and zanieb July 16, 2024 02:19

charliermarsh added the performance Potential performance improvement label Jul 16, 2024

charliermarsh marked this pull request as ready for review July 16, 2024 02:21

charliermarsh force-pushed the charlie/stream branch from a5855d1 to 41da6fd Compare July 16, 2024 02:22

konstin approved these changes Jul 16, 2024

View reviewed changes

Cache downloaded wheel when range requests aren't supported

7d92915

charliermarsh force-pushed the charlie/stream branch from 41da6fd to 7d92915 Compare July 16, 2024 13:07

charliermarsh merged commit 2ff3b38 into main Jul 16, 2024
51 checks passed

charliermarsh deleted the charlie/stream branch July 16, 2024 13:21

BrewTestBot mentioned this pull request Jul 17, 2024

uv 0.2.26 Homebrew/homebrew-core#177649

Merged

ewianda mentioned this pull request Aug 15, 2024

uv lock is extremely slow with google artifact registry #6104

Closed

zanieb mentioned this pull request Aug 22, 2024

uv downloads are slow on fallback to streamed wheel downloads #5073

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache downloaded wheel when range requests aren't supported #5089

Cache downloaded wheel when range requests aren't supported #5089

charliermarsh commented Jul 16, 2024 •

edited

Loading

charliermarsh commented Jul 16, 2024

zanieb commented Jul 16, 2024

morotti commented Jul 16, 2024

charliermarsh commented Jul 16, 2024 •

edited

Loading

Cache downloaded wheel when range requests aren't supported #5089

Cache downloaded wheel when range requests aren't supported #5089

Conversation

charliermarsh commented Jul 16, 2024 • edited Loading

Summary

charliermarsh commented Jul 16, 2024

zanieb commented Jul 16, 2024

morotti commented Jul 16, 2024

charliermarsh commented Jul 16, 2024 • edited Loading

charliermarsh commented Jul 16, 2024 •

edited

Loading

charliermarsh commented Jul 16, 2024 •

edited

Loading