RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' #27

Answered by zhuohaoyu

hodgesz asked this question in Q&A

hodgesz
Jul 14, 2023

Apologies to be the only one asking questions, but we love the project and think it will really help us in evaluating different LLMs for our use cases.

We moved out of Colab to make things easier to debug. We are now trying to install on our own hardware - server w/ 3070 desktop card and 64 GB of RAM.

We are now able to launch the Chat UI without a problem.

However, we run into the following RuntimeError from there.

Any thoughts?

Answered by zhuohaoyu

Similar issues can be found on StackOverflow, I think your model is loaded on CPU instead of your GPU. https://stackoverflow.com/questions/73530569/pytorch-matmul-runtimeerror-addmm-impl-cpu-not-implemented-for-half

View full answer

Replies: 1 comment 1 reply

zhuohaoyu
Jul 15, 2023
Maintainer

Similar issues can be found on StackOverflow, I think your model is loaded on CPU instead of your GPU. https://stackoverflow.com/questions/73530569/pytorch-matmul-runtimeerror-addmm-impl-cpu-not-implemented-for-half

1 reply

hodgesz Jul 15, 2023
Author

Thanks, will take a look there.

Answer selected by hodgesz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment