Fix infinite diff in `LdaModel.do_mstep` #2344

horpto · 2019-01-20T16:55:00Z

Fix partially #416
Fix partially #2051

This PR solves infinite diff in output. It works for me. This caused by a big negative values in self.state.sstate and narrow dtype of values. I try to solve only this bug. self.expElogbeta can contain zeros anyway. That can be solved by pointing a more wide dtype, for example np.float64. I don't increase dtype of self.expElogbeta as I know there is a drawback between memory consumption/file sizes/sent bytes and precision. User should be smart enough and choose wisely himself.

gensim/models/ldamodel.py

menshikh-iv · 2019-01-23T14:53:21Z

ping @johann-petrak @stevemarin @TC-Rudel

Guys, this change should fix diff=inf/nan issue (from #416), can anyone check is current PR fixed your problems:

clone horpto fork: git clone git@github.com:horpto/gensim.git && cd gensim
install gensim from inf-mstep branch: git checkout inf-mstep && pip install -e . && python setup.py build_ext --inplace
check that diff=nan doesn't appear for you any more?

menshikh-iv · 2019-01-28T02:42:26Z

Thanks @horpto

horpto added 2 commits January 20, 2019 21:52

Fix piskvorky#416, piskvorky#2051: Infinite diff in LdaModel.do_mstep

5659054

fix build

79196c8

menshikh-iv suggested changes Jan 23, 2019

View reviewed changes

gensim/models/ldamodel.py Show resolved Hide resolved

gensim/models/ldamodel.py Show resolved Hide resolved

gensim/models/ldamodel.py Show resolved Hide resolved

menshikh-iv changed the title ~~Fix #416, #2051: Infinite diff in LdaModel.do_mstep~~ Fix infinite diff in LdaModel.do_mstep Jan 23, 2019

menshikh-iv merged commit 179a2c1 into piskvorky:develop Jan 28, 2019

menshikh-iv mentioned this pull request Jan 29, 2019

Redundant get_Elogbeta calls in LdaModel #2051

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix infinite diff in `LdaModel.do_mstep` #2344

Fix infinite diff in `LdaModel.do_mstep` #2344

horpto commented Jan 20, 2019 •

edited

Loading

menshikh-iv commented Jan 23, 2019 •

edited

Loading

menshikh-iv commented Jan 28, 2019

Fix infinite diff in LdaModel.do_mstep #2344

Fix infinite diff in LdaModel.do_mstep #2344

Conversation

horpto commented Jan 20, 2019 • edited Loading

menshikh-iv commented Jan 23, 2019 • edited Loading

menshikh-iv commented Jan 28, 2019

Fix infinite diff in `LdaModel.do_mstep` #2344

Fix infinite diff in `LdaModel.do_mstep` #2344

horpto commented Jan 20, 2019 •

edited

Loading

menshikh-iv commented Jan 23, 2019 •

edited

Loading