-
Notifications
You must be signed in to change notification settings - Fork 10.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Faster loading of the model #85
Comments
Great idea. We prefer to not use |
and TBB? I remember that the mold linker project also uses it. |
Not familiar with TBB, but most likely the answer is no |
I have some experiments with optimizing large file read I/O in https://gist.github.com/kig/357a4193be54915d142f1db6063bc929 and https://github.com/kig/fast_read_optimizer if you want to overkill it... |
Has this been implemented yet? |
…lama Export llama without llama
I was playing with the 65B model, and it took a minute to read the files. If you wrap the model loader loop with a
#pragma omp parallel for
and add-fopenmp
to the compiler flags, you can drop it to 18 seconds.The text was updated successfully, but these errors were encountered: