Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incremental / online learning based on user feedback #225

Closed
osma opened this issue Dec 14, 2018 · 1 comment
Closed

Incremental / online learning based on user feedback #225

osma opened this issue Dec 14, 2018 · 1 comment
Milestone

Comments

@osma
Copy link
Member

osma commented Dec 14, 2018

Right now all models are built once and then remain static. But at least for tfidf and pav backends it would be possible to improve the model over time based on user feedback, i.e. documents that were analyzed, a human reviewed the result and rejected some subjects (false positives) and/or added new ones (false negatives).

This issue is about providing the infrastructure for such functionality and the CLI commands and REST API methods. I will open separate issues for adding online learning support for each backend.

The CLI command could be

annif learn <projectid> <corpus>

(still thinking about what would be the best command name, maybe also increment[al] or retrain) learn is fine, used by e.g. Vowpal Wabbit

The REST API method could be (for a single command)

POST /projects/<projectid>/learn

with parameters corresponding to the CLI command.

Ensemble style backends (currently ensemble and pav) should propagate the learn operation first to the source projects, reanalyze the document, and only then update their own model.

@osma
Copy link
Member Author

osma commented Feb 27, 2019

Implemented in #257

@osma osma closed this as completed Feb 27, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant