Skip to content

libdevice not found at ./libdevice.10.bc error with tensorflow on GPU node [SOLVED] #61

Answered by jbusecke
jbusecke asked this question in Q&A
Discussion options

You must be logged in to vote

The issue for this error is a missing cuda library. Apparently due to licensing issues this cannot be included with the current pangeo-docker-image.

The solution (as suggested here https://github.com/pangeo-data/pangeo-docker-images/blob/614419aa55eea9200876357626eb498b17a27755/README.md?plain=1#L173) is to manually install cuda-nvvm, with

mamba install -c nvidia cuda-nvcc

You need to additionally set the cuda directory as environment variable in your notebook:

import os
os.environ['XLA_FLAGS'] = '--xla_gpu_cuda_data_dir=/srv/conda/envs/notebook'

Replies: 1 comment

Comment options

jbusecke
Apr 24, 2023
Maintainer Author

You must be logged in to vote
0 replies
Answer selected by jbusecke
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant