You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
thanks! We are just poking around, checking if the 70B model provides better output.
After some testing, it seems that the 8B one is already very capable though.
Actually in out application, a timely response from the LLM is crucial, so if 70B is slower in that regard it's indeed a worse choice.
Thanks for helping out!
Hi,
When I load the model into 4 gpus with model parallelism:
It gives the below error:
The text was updated successfully, but these errors were encountered: