-
-
Notifications
You must be signed in to change notification settings - Fork 146
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dockerize cake #30
base: main
Are you sure you want to change the base?
Dockerize cake #30
Conversation
@derogab thank you, this is cool! however in all the containers acceleration is disabled |
Yes, right 😕 I have never had a hardware or applications worth activating it for (😅) and until now I had not bothered with it. I understand that you may need to map your hardware into the container at runtime, but this is not standard. Do you think it is enough to add this warning "in all the containers acceleration is disabled" in the README? In the meantime, however, I will try to understand it better. Especially if changes to the |
Well it's a bit more complex than that. Since the code is compiled within the container, no acceleration is found and so no acceleration is compiled. I think the containers require something like this (at least the workers) https://sarus.readthedocs.io/en/stable/user/custom-cuda-images.html The main point of this project is using acceleration (the inference is already slowed down by the fact it is distribute, without acceleration it is just unusable), so it doesn't make a lot of sense to have containers without it. |
Got it, thank you! I'll try to see what I can do. |
News
New docker commands
Splitting the Model:
Run a worker node:
Run a master node with an OpenAI compatible REST API: