-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
get_latest_training_loss returns 0 #3228
Comments
#2658 is closed as a duplicate, because #2617 is a more comprehensive discussion of what broken (or simply never implemented) in the *2Vec models. The docs are wrong to imply there's any loss-tallying in |
I see. But if loss training has never been implemented, how do we know if the training needs to be early stopped, or if the training needs more epochs? |
You'd have to use other heuristics. AFAIK, neither the original Google They do each, however, show a running loss that a user can watch for hints. It's definitely a desirable feature to have - hence the many requests, & partial/buggy implementation inside Gensim's (Note, though, running-loss is also somewhat prone to misinterpretation, with some people thinking it's an accurate measure of model quality for other purposes, and that, of a set of candidate models, the one with the lowest loss will work best for outside purposes. That's not inherently the case, as it's just a report on the model's internal training goal. That internal goal is, if all sorts of other things are also done right, at best only an approximation of fitness for the real external purposes where people use word-vectors. For example, a massively-'overfit' model can have an arbitrarily low training loss, while being entirely useless for other tasks.) |
After reading out some replies here and on stackoverflow, I'm aware that loss-tallying is yet to be implemented. I'm running
Why the |
Gensim *2Vec model loss-tallying is... But since |
Shouldn't we raise NotImplementedError instead of returning zero? It'd be less surprising for the user. |
That'd be better than the current mysteriously-incomplete behavior! But such hard failures should start as soon as the user takes any step guaranteed to disappoint - such as initializing a model that can't track loss with |
Problem description
It seems that the
get_latest_training_loss
function infasttext
returns only 0. Both gensim 4.1.0 and 4.0.0 do not work.If currently FastText does not support
get_latest_training_loss
, the documentation here needs to be removed:https://radimrehurek.com/gensim/models/fasttext.html#gensim.models.fasttext.FastText.get_latest_training_loss
Versions
I have tried this in three different environments and neither of them works.
First environment:
Second environment:
Third environment:
The text was updated successfully, but these errors were encountered: