-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
implicit libnuma.so.1 dependency added in new ucx-1.11.1 in conda package #790
Comments
It was always the intent to depend on libnuma, see https://github.com/rapidsai/ucx-split-feedstock/blob/master/recipe/install_ucx.sh#L19 , and our users were always instructed to install it from their OS package manager. However, recently a bug was discovered and fixed in openucx/ucx#6782 that would not enable NUMA when passed explicitly as we do in our conda recipes. In previous UCX 1.9 packages we didn't have that dependency due to the UCX bug above, and in new UCX 1.11 packages we do, as it's important for UCX in certain systems. Apparently, we could depend on https://anaconda.org/conda-forge/numactl-libs-cos7-x86_64 to resolve that dependency in conda directly, but because it's specifically targeted at CentOS 7, I'm not sure whether it's a reliable package for the general user, any thoughts here @jakirkham @raydouglass ? One thing I noticed is that RAPIDS 21.08 wasn't pinned to UCX 1.9, which causes a new environment to pick UCX 1.11 (which wasn't supported back then), so if we still want to support RAPIDS <= 21.08, we must pin UCX 1.9 or instruct users to specify |
@pentschev Ok, that's good to know. So rapids <=21.08 shouldn't be used with ucx1.11 then, I should go back to ucx1.9? I was also hit by this then, since the new conda solution upgraded ucx to 1.11 and I just assumed this was ok and was trying to resolve the libnuma issue to make that work. |
That's right.
No, this wasn't predicted. Recently we started pinning some libraries to a maximum version, I believe we should do the same with UCX. |
No that's a CDT that is just using a vendored package from CentOS 7. It's only used at build time to make our build tooling happy. Would not use that at runtime. At present users should continue to install this from an OS system package manager. |
@jakirkham but that means the ucx package is not consistent with anything else in conda land. For no other packages do i have to install something on the OS natively separately except nvidia drivers. This this is quite a big awkward change. Means the conda setup is not self-contained like it should be. |
I know it is not ideal. Unfortunately We are discussing internally to see if there are ways to improve the situation to make this less of a pain to deploy. Edit: Also we raised this issue ( openucx/ucx#4570 ) previously to discuss making |
ldd shows:
for the
after building non-conflicting conda solution.
This is a new dependency that was not present prior to 3 days ago when new ucx was uploaded to https://anaconda.org/rapidsai/ucx/files
I expect it is a mistake that this dependency was forced since no corresponding package dependency installs libnuma
So one now gets things like:
The text was updated successfully, but these errors were encountered: