-
Notifications
You must be signed in to change notification settings - Fork 551
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hermetic toolchains incur a significant overhead to running py_binary and py_test #1653
Comments
Yeah, the caching can only do so well. The stdlib is a decent size. It's just a lot of files that have to symlinked. I wonder if we'd be able to zip up the stdlib. I'm think that can Just Work. Otherwise, maybe declare_symlink or I really wish we had directories as a first-class concept so Bazel knew to materialize a single symlink, but to monitor the contents of it recursively. |
A few of us were experimenting with symlinks and declare_directory (TreeArtifacts). The result is not really promising. For local strategy actions, it works very well: TreeArtifacts create a single symlink to the directory; very cheap! Unfortunately, sandboxed actions don't do that; they copy the whole directory and/or symlink each individual file. We tried to trick bazel into create a symlink, but no luck. The Some maintainers were talking, and zipping the stdlib is probably the best option available. That would greatly reduce the number of files. A few companies already do this. A PR implementing this would be welcome. Digging around the files in the python install, I also think that maybe 25-50% of them aren't necessary for running a program. e.g., all the vendored pip dependencies and interpreter headers don't need to be in the runfiles to run a binary. On the Bazel side, there is bazelbuild/bazel#8230 which is approximately about the cost of creating large sets of files and is one of the better issues demonstrating that symlinking to a directory would be a big performance benefit; this issue also affects js/node |
This issue has been automatically marked as stale because it has not had any activity for 180 days. It will be closed if no further activity occurs in 30 days. |
This is still relevant |
Question: is using a locally installed platform runtime, instead of a remote downloaded in-build runtime, an appealing option to this problem? I ask because I've merged some of the basic for using a locally installed runtime (one equivalent to what our hermetic runtimes do) in PR #2000. But, we're trying to figure out some of the use cases for it to better understand what public APIs it needs. |
No it is not. Technically it would solve the problem, at least on Linux (I am not working with other platforms). However, besides managing a polyglot workspace Bazel's appeal and big selling point are in my point of view hermeticity and thus reproducibility of actions, which enables the awesome caching performance. A host Interpreter toolchain is to my understanding defeating hermeticity. That much said, I am still very much interested in rules_python offering better support for host Interpreter toolchains. There are some known quirks with the hermetic toolchain. Switching to the host Interpreter is an easy workaround for those (e. g. during debugging). While it is easy enough to set the host interpreter itself, what is missing are the toolchain files defined here, e.g. |
Oh that's perfect. That's exactly the sort of case I had in mind. PR #2000 added the basics to do all that, though the APIs are approximately private. I created #2070 to collect user feedback. I'd be particularly interested in if you're doing anything to automatically locate e.g. headers and libraries to wire them into the toolchain. Inspecting the Python install to figure that out was quite a mystery, and I'm really uncertain what edge cases I missed. |
We are not dynamically discovering headers and libraries. Currently we only support host toolchains for Linux (Ubuntu) and a subset of Python versions. For this use case it was easier to have dedicated BUILD file per toolchain and let users select them explicitly. One caveat is that multiple Debian packages are involved in a full Python toolchain. Not everybody might have |
(back to OP issue) |
I used my artificial benchmark to look into this issue again.
This is great, as this is a free improvement for everybody by simply using a recent Bazel version. |
We now use the lock file for local optimization, but do not check it in yet. We will try out this feature a bit before doing so. Bazel 7.2.1 also offers performance improvements, see: bazelbuild/rules_python#1653 (comment)
We now use the lock file for local optimization, but do not check it in yet. We will try out this feature a bit before doing so. Bazel 7.2.1 also offers performance improvements, see: bazelbuild/rules_python#1653 (comment)
🐞 bug report
Affected Rule
py_binary
andpy_test
Is this a regression?
No
Description
Hermetic toolchains incur a significant overhead to executing Python targets due to the large amount of files which are added to the runfiles tree.
You find an analysis of the issue here: https://github.com/martis42/show_hermetic_python_overhead
Bazel caching of course negates much of this issue in daily working.
Still, whenever a test has to rerun we pay this additional overhead.
I don't know if there is a "perfect" solution possible given how Bazel sandboxing works.
However, it would be great if rues_python would add a note in its documentation for hermetic toolchains that adding
--nolegacy_external_runfiles
to the.bazelrc
file is recommended.Maybe the
rules_python
maintainer can even convince the Bazel maintainer to prioritize working on flipping--legacy_external_runfiles
in Bazel 8 ?🔬 Minimal Reproduction
You see reproduce this with this small example workspace https://github.com/martis42/show_hermetic_python_overhead
🔥 Exception or Error
NA
🌍 Your Environment
Operating System:
Output of
bazel version
:Rules_python version:
Anything else relevant?
#1624 might be related.
It is however maybe Windows specific.
The text was updated successfully, but these errors were encountered: