From 7388d54a6eb801146407c4b002712ec4a1783257 Mon Sep 17 00:00:00 2001 From: Menshikh Ivan Date: Thu, 1 Feb 2018 15:08:23 +0500 Subject: [PATCH] Remove outdated bz2 examples from tutorials[2] (#1868) * Revert "Remove outdated `bz2` + `MmCorpus` examples from tutorials (#1867)" This reverts commit 5342153eb4f4b02bb45bfa3951eef8250ac9f6b6. * remove bz2 wrapper * remove bz2 wrapper[2] --- docs/src/dist_lsi.rst | 1 + docs/src/wiki.rst | 2 ++ 2 files changed, 3 insertions(+) diff --git a/docs/src/dist_lsi.rst b/docs/src/dist_lsi.rst index e80ca3809d..15dfb41f9c 100644 --- a/docs/src/dist_lsi.rst +++ b/docs/src/dist_lsi.rst @@ -127,6 +127,7 @@ the corpus iterator with:: >>> id2word = gensim.corpora.Dictionary.load_from_text('wiki_en_wordids.txt') >>> # load corpus iterator >>> mm = gensim.corpora.MmCorpus('wiki_en_tfidf.mm') + >>> # mm = gensim.corpora.MmCorpus('wiki_en_tfidf.mm.bz2') # use this if you compressed the TFIDF output >>> print(mm) MmCorpus(3199665 documents, 100000 features, 495547400 non-zero entries) diff --git a/docs/src/wiki.rst b/docs/src/wiki.rst index 47aeaa34fd..2992cf8401 100644 --- a/docs/src/wiki.rst +++ b/docs/src/wiki.rst @@ -45,6 +45,7 @@ First let's load the corpus iterator and dictionary, created in the second step >>> id2word = gensim.corpora.Dictionary.load_from_text('wiki_en_wordids.txt') >>> # load corpus iterator >>> mm = gensim.corpora.MmCorpus('wiki_en_tfidf.mm') + >>> # mm = gensim.corpora.MmCorpus('wiki_en_tfidf.mm.bz2') # use this if you compressed the TFIDF output (recommended) >>> print(mm) MmCorpus(3931787 documents, 100000 features, 756379027 non-zero entries) @@ -99,6 +100,7 @@ As with Latent Semantic Analysis above, first load the corpus iterator and dicti >>> id2word = gensim.corpora.Dictionary.load_from_text('wiki_en_wordids.txt') >>> # load corpus iterator >>> mm = gensim.corpora.MmCorpus('wiki_en_tfidf.mm') + >>> # mm = gensim.corpora.MmCorpus('wiki_en_tfidf.mm.bz2') # use this if you compressed the TFIDF output >>> print(mm) MmCorpus(3931787 documents, 100000 features, 756379027 non-zero entries)