-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LSI backend #201
Comments
Evaluation results with the code in #219 were so bad that I don't think it makes sense to continue in this direction. LSI makes more sense when there are no predefined subjects. It might still be useful for small classifications though. |
Here are the evaluation results: 2018-11-27 LSI model for Annif Created first implementation of LSI model. lsi-fi-100 model built in ~35min CPU time (with some parallel processing) Evaluated on kirjastonhoitaja (tfidf f1@5=0.22): Not very promising…
|
We are currently using Gensim only for the basic TF-IDF backend. It should be almost trivial to create an LSI backend, it's just one extra LsiModel layer and a single parameter (number of dimensions).
LDA would be possible too, but I'll leave that for another issue.
The text was updated successfully, but these errors were encountered: