Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong device specification for host to device in xla runners #2103

Open
PikaCat-OuO opened this issue Dec 27, 2024 · 0 comments
Open

Wrong device specification for host to device in xla runners #2103

PikaCat-OuO opened this issue Dec 27, 2024 · 0 comments

Comments

@PikaCat-OuO
Copy link

PikaCat-OuO commented Dec 27, 2024

#2081 is not fixing the issue.

Example error:
./lc0 benchmark --backend-opts="plugin_path=/kaggle/working/libtpu.so,max_batch=4,steps=1,datatype=bf16,device=1" --minibatch-size=4 --max-prefetch=0

Unhandled exception in worker thread: Buffer passed to Execute() as argument 0 to replica 0 is on device TPU_0(process=0,(0,0,0,0)), but replica is assigned to device TPU_1(process=0,(0,0,0,1)).

This line

devices_[0].get())

should be changed to:

devices_.at(device_).get()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant