From d4ead46854a55d55b63713a8d280f8e382c3e196 Mon Sep 17 00:00:00 2001 From: Arjan van de Ven Date: Sun, 5 Aug 2018 22:46:17 +0000 Subject: [PATCH] [update to 3.5.0] Allow initialization with `max_final_vocab` in lieu of `min_count` for `gensim.models.Word2Vec`. Fix #465 (#1915) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Andrey Kutuzov (2): Fix OOV pairs counter in `WordEmbeddingsKeyedVectors.evaluate_word_pairs` (#1934) Add `evaluate_word_analogies` (will replace `accuracy`) method for `gensim.models.KeyedVectors` (#1935) Aneesh Joshi (3): Add windows venv activate command to `CONTRIBUTING.md` (#1880) Fix deprecation warning from `inspect.getargspec`. Fix #1878 (#1887) Allow initialization with `max_final_vocab` in lieu of `min_count` for `gensim.models.Word2Vec`. Fix #465 (#1915) Chaitali Saini (1): Update rules for removing table markup from Wikipedia dumps. Fix #1710 (#1954) Dennis.Chen (1): Fix inheritance chain for `load_word2vec_format` (return correct class in case when you create an child class based on kv) (#1968) Dmitry (5): Refactor API reference `gensim.corpora`. Partial fix #1671 (#1835) Refactor documentation for `gensim.similarities.docsim` and `MmCorpus-related`. (#1910) Refactor documentation for `gensim.models.coherencemodel` (#1933) Refactor documentation for `gensim.models.phrases` (#1950) Fix format & links for `gensim.similarities.docsim` (#2030) Dmitry Persiyanov (1): Add `gensim.models.BaseKeyedVectors.add_entity` method for fill `KeyedVectors` in manual way. Fix #1942 (#1957) Fernando Camargo (1): Add `ns_exponent` parameter to control the negative sampling distribution for `*2vec` models. Fix #2090 (#2093) Gordon Mohr (1): Fix `Doc2Vec.infer_vector`, notebook cleanup (#2103) Gyanesh Malhotra (1): Fix docstrings for`gensim.models.hdpmodel`, `gensim.models.lda_worker` & `gensim.models.lda_dispatcher` (#1912) Ibrahim Sharaf ElDen (1): Store images from `README.md` directly in repository. Fix #1849 (#1861) Ivan Menshikh (2): Fix PEP8 in `HashDictionary` Disable google-style docstring support. Fix #1663 (#2106) Jayant Jain (1): Fix negative sampling floating-point error for `gensim.models.PoincareModel`. Fix #1917 (#1959) Johannes Baiter (1): Fix method `estimate_memory` from `gensim.models.FastText` & huge performance improvement. Fix #1824 (#1916) Jonathan Hourany (1): Fixed Typo and increased performance in analyze_sentence (#2070) Kento NOZAWA (1): Fix example block for `gensim.models.Word2Vec` (#1876) Kumar Akshay (1): Fix documentation for `gensim.models.wrappers` (#1859) Menshikh Ivan (5): Fix `test_similarities.py` (#1928) Add flag for skip network-related tests (#1930) Fix encoding in Lee corpus reader (#1931) Fix Keras version (avoid bug from `keras==2.1.5`) (#1963) Fix quoting that break `doc2vec-IMDB` notebook Mohit Rathore (1): Add Pivot Normalization for `gensim.models.TfidfModel`. Fix #220 (#1780) Mritunjay Mohitesh (1): Fix deprecated parameters in `D2VTransformer` and `W2VTransformer`. Fix #1937 (#1945) Nils Werner (1): Add license field to `setup.py` (#1909) Oliver Price (1): Fix return dtype for `matutils.unitvec` according to input dtype. Fix #1722 (#1992) Orion Montoya (1): Fix parameter description of `sg` parameter for `gensim.models.word2vec` (#1919) Pete Bleackley (1): Fix SMART from TfidfModel for case when `df == "n"`. Fix #2020 (#2021) Pushpankar Kumar Pushp (1): Fix datatype parameter for `KeyedVectors.load_word2vec_format`. Fix #1682 (#1819) Radim Řehůřek (8): fix logging formatting in downloader fixes to HashDictionary more fixes to broken formatting minor wording change Merge pull request #2073 from RaRe-Technologies/hashdictionary_docs Fix documentation for `*2vec` models (#2087) Fix documentation for various modules (#2096) Update non-API docs (about, intro, etc) (#2101) Rob Malouf (1): Fix `_is_single` from `Phrases` for case when corpus is numpy array (#1987) Samyak Jain (2): Fix empty output bug in `Phrases`. Fix #1401 (#1853) Fix file-like closing bug from `gensim.corpora.MmCorpus`. Fix #1869 (#1911) Sharan Yalburgi (3): Add anaconda-cloud badge. Partial fix #1901 (#1905) Add method that show base installation info of Gensim & related packages. Fix #1902 (#1903) Replace open() with smart_open() in notebooks. Fix #1789 (#1812) Shiva Manne (4): Add `wv` property to KeyedVectors (for backward compatibility). Fix #1882 (#1884) Adds `LabeledSentence` to `gensim.models.doc2vec` (for backward compatibility). Fix #1886 (#1891) Fix `Doc2Vec.infer_vector` after loading old `Doc2Vec` (`gensim<=3.2`). Fix #1952 (#1974) Fixes issues while loading `word2vec` and `doc2vec` models saved using old Gensim versions. Fix #2000, #1977 (#2012) Sourav Singh (1): Fix docstrings for `gensim.models.AuthorTopicModel` (#1907) Stamenov (1): Add inference for new unseen author for `gensim.models.AuthorTopicModel` (#1766) Stergiadis Manos (4): Fix docstrings for lsi-related code (#1892) Fix docstrings for `gensim.sklearn_api`. Fix #1667 (#1895) Document LDA-related models (#2026) Allow pass empty dictionary to `gensim.corpora.WikiCorpus`. Fix #2052 (#2042) TheFlash10 (1): Fix deprecated parameters in doc2vec-lee notebook (#1918) Umang Varma (1): Fix linear decay for learning rate in `Doc2Vec.infer_vector`. Fix #2061 (#2063) Utkarsh Mishra (1): Fix `D2VTransformer.fit_transform`. Fix #1834 (#1845) Vít Novotný (4): Implement Soft Cosine Measure (#1827) Fix misinformation in docstring of `gensim.models.KeyedVectors.similarity_matrix`. Fix #1960 (#1971) Fix `SoftCosineSimilarity.get_similarities` on corpora. Fix #1955 (#1972) Fix tests for `EuclideanKeyedVectors.similarity_matrix`. Fix #1961 (#1984) Yuri Isakov (3): Refactor docstrings for `gensim.scripts`. Partial fix #1665 (#1792) Fix docstrings for `gensim.test.utils` (#1904) Fix docstrings for `gensim.interfaces` (#1913) arlenk (3): Add Cython version of `MmReader` (#1825) Add cython version for "hot" functions from `gensim.models.LdaModel` (#1767) Fix OverflowError when loading a large term-document matrix in MatrixMarket format. Fix #1998 (#2001) bohea (1): Fix bug in `Similarity.query_shards` in multiprocessing case (#2044) darindf (3): Fix python 3 compatibility for `gensim.corpora.UciCorpus.save_corpus` (#1875) Remove duplication of class documentation for `IndexedCorpus` (#2033) Add `dtype` argument for `chunkize_serial`, fix `LdaModel` (#2027) ivan (12): Merge branch 'master' into develop bump version to 3.4.0 regenerate C files with cython==0.27 Merge branch 'release-3.4.0' Merge branch 'master' into develop apply fixes for distributed mode lda/lsi from @piskvorky #2102 remove smart_open limitation from setup.py, replace smart_open -> open until https://github.com/RaRe-Technologies/smart_open/issues/207 will be fixed fix PEP8 issues bump version to 3.5.0 bump changelos to 3.5.0 + add missing changelog for 3.4.0 regenerated C files with Cython Merge branch 'release-3.5.0' numericlee (1): Fix `doc2vec-lee` notebook (#1870) --- .gitignore | 13 +++++++++++++ Makefile | 2 +- gensim.spec | 34 +++++++++++++++++++++++----------- options.conf | 2 +- release | 2 +- upstream | 2 +- 6 files changed, 40 insertions(+), 15 deletions(-) create mode 100644 .gitignore diff --git a/.gitignore b/.gitignore new file mode 100644 index 0000000..fd14cc3 --- /dev/null +++ b/.gitignore @@ -0,0 +1,13 @@ +.*~ +*~ +*.swp +.repo-index +*.log +build.log.round* +*.tar.* +*.tgz +!*.tar.*.* +*.zip +commitmsg +results/ +rpms/ diff --git a/Makefile b/Makefile index 44d8521..ab212a1 100644 --- a/Makefile +++ b/Makefile @@ -1,5 +1,5 @@ PKG_NAME := gensim -URL = http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz +URL = https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz ARCHIVES = include ../common/Makefile.common diff --git a/gensim.spec b/gensim.spec index 5304d16..88fc116 100644 --- a/gensim.spec +++ b/gensim.spec @@ -3,29 +3,27 @@ # Generated by: autospec.py # Name : gensim -Version : 3.3.0 -Release : 27 -URL : http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz -Source0 : http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz +Version : 3.5.0 +Release : 28 +URL : https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz +Source0 : https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz Summary : Python framework for fast Vector Space Modelling Group : Development/Tools License : LGPL-2.1 Requires: gensim-python3 +Requires: gensim-license Requires: gensim-python Requires: numpy +Requires: pytest-rerunfailures Requires: scikit-learn Requires: scipy Requires: six Requires: smart_open Requires: tensorflow BuildRequires : Cython +BuildRequires : buildreq-distutils3 BuildRequires : numpy -BuildRequires : pbr -BuildRequires : pip - -BuildRequires : python3-dev BuildRequires : scipy -BuildRequires : setuptools BuildRequires : six BuildRequires : smart_open @@ -37,6 +35,14 @@ BuildRequires : smart_open |Travis|_ |Wheel|_ +%package license +Summary: license components for the gensim package. +Group: Default + +%description license +license components for the gensim package. + + %package python Summary: python components for the gensim package. Group: Default @@ -56,18 +62,20 @@ python3 components for the gensim package. %prep -%setup -q -n gensim-3.3.0 +%setup -q -n gensim-3.5.0 %build export http_proxy=http://127.0.0.1:9/ export https_proxy=http://127.0.0.1:9/ export no_proxy=localhost,127.0.0.1,0.0.0.0 export LANG=C -export SOURCE_DATE_EPOCH=1528564391 +export SOURCE_DATE_EPOCH=1533509167 python3 setup.py build -b py3 %install rm -rf %{buildroot} +mkdir -p %{buildroot}/usr/share/doc/gensim +cp COPYING %{buildroot}/usr/share/doc/gensim/COPYING python3 -tt setup.py build -b py3 install --root=%{buildroot} echo ----[ mark ]---- cat %{buildroot}/usr/lib/python3*/site-packages/*/requires.txt || : @@ -76,6 +84,10 @@ echo ----[ mark ]---- %files %defattr(-,root,root,-) +%files license +%defattr(-,root,root,-) +/usr/share/doc/gensim/COPYING + %files python %defattr(-,root,root,-) diff --git a/options.conf b/options.conf index 4c649ad..250428e 100644 --- a/options.conf +++ b/options.conf @@ -1,6 +1,6 @@ [package] name = gensim -url = http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz +url = https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz archives = giturl = https://github.com/RaRe-Technologies/gensim.git diff --git a/release b/release index f64f5d8..9902f17 100644 --- a/release +++ b/release @@ -1 +1 @@ -27 +28 diff --git a/upstream b/upstream index e3fdd38..9e5421b 100644 --- a/upstream +++ b/upstream @@ -1 +1 @@ -54bfeade5310d1dc474d84e6cb165200bad0b0c8/gensim-3.3.0.tar.gz +6f58b91a1c6f9d3cacec632457253961dfd85be8/gensim-3.5.0.tar.gz