Conversation
|
@derogab thank you, this is cool! however in all the containers acceleration is disabled |
|
Yes, right 😕 I have never had a hardware or applications worth activating it for (😅) and until now I had not bothered with it. I understand that you may need to map your hardware into the container at runtime, but this is not standard. Do you think it is enough to add this warning "in all the containers acceleration is disabled" in the README? In the meantime, however, I will try to understand it better. Especially if changes to the |
|
Well it's a bit more complex than that. Since the code is compiled within the container, no acceleration is found and so no acceleration is compiled. I think the containers require something like this (at least the workers) https://sarus.readthedocs.io/en/stable/user/custom-cuda-images.html The main point of this project is using acceleration (the inference is already slowed down by the fact it is distribute, without acceleration it is just unusable), so it doesn't make a lot of sense to have containers without it. |
|
Got it, thank you! I'll try to see what I can do. |
News
New docker commands
Splitting the Model:
Run a worker node:
Run a master node with an OpenAI compatible REST API: