-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ggml : implement a spellcheck model (xfspell, t5-spellchecker, etc) #233
Comments
Great idea - we should do that! |
It seems there are other, less niche models for spelling correction, like t5-spellchecker or other BERT-based models. Since there's been some work on T5 and there BERT.cpp (which does not yet support decoding), unless this model outperforms it in quality, ease of implementation, or resource usage, efforts can be directed to these two. |
Ok, will add this to the roadmap to get some extra attention |
I would like to give this a try. |
While I was trying to figure out how to convert a small pytorch based model to ggml, I've found this thread. I wanted to emphasize that small models (sub 1gig) exist, Thank you. |
I would like to finish this implementation, do any of the people who have already attempted have any recommendations? |
@Ferruolo please go head! |
Should changes go to LLAMA.cpp or GGML? |
Depends on the interface that will be exposed, but I suppose the |
I checked t5-base-spellchecker and it works with #8141:
|
just posted it here ggml-org/llama.cpp#8204 , but there is now an example of deployed ggml spellchecking AND on-device finetuning ! |
Apple had recently announced a new transformer-based keyboard auto-correct and prediction.
xfspell seems to be an existing model that tried doing it, so why not investigate if it can be ported to GGML. If anyone know other models for predictive keyboard or auto correct, please drop your suggestions here.
Perhaps this may even be a good test case for on-device QLoRA fine-tuning.
High quality predictive keyboards and auto-correct in pure C++ can be a useful thing for open-source mobile operating systems like Ubuntu Touch and privacy-focused Android ROMs, because traditionally, such proposals got rejected because of excessive dependencies for ML inference.
The text was updated successfully, but these errors were encountered: