Skip to content
This repository has been archived by the owner on Oct 27, 2022. It is now read-only.

Commit

Permalink
[update to 3.5.0] Allow initialization with max_final_vocab in lieu…
Browse files Browse the repository at this point in the history
… of `min_count` for `gensim.models.Word2Vec`. Fix #465 (#1915)

Andrey Kutuzov (2):
      Fix OOV pairs counter in `WordEmbeddingsKeyedVectors.evaluate_word_pairs` (#1934)
      Add `evaluate_word_analogies` (will replace `accuracy`) method for `gensim.models.KeyedVectors` (#1935)

Aneesh Joshi (3):
      Add windows venv activate command to `CONTRIBUTING.md` (#1880)
      Fix deprecation warning from `inspect.getargspec`. Fix #1878 (#1887)
      Allow initialization with `max_final_vocab` in lieu of `min_count` for `gensim.models.Word2Vec`. Fix #465 (#1915)

Chaitali Saini (1):
      Update rules for removing table markup from Wikipedia dumps. Fix #1710 (#1954)

Dennis.Chen (1):
      Fix inheritance chain for `load_word2vec_format` (return correct class in case when you create an child class based on kv) (#1968)

Dmitry (5):
      Refactor API reference `gensim.corpora`. Partial fix #1671 (#1835)
      Refactor documentation for `gensim.similarities.docsim` and `MmCorpus-related`. (#1910)
      Refactor documentation for `gensim.models.coherencemodel` (#1933)
      Refactor documentation for `gensim.models.phrases` (#1950)
      Fix format & links for `gensim.similarities.docsim` (#2030)

Dmitry Persiyanov (1):
      Add `gensim.models.BaseKeyedVectors.add_entity` method for fill `KeyedVectors` in manual way. Fix #1942 (#1957)

Fernando Camargo (1):
      Add `ns_exponent` parameter to control the negative sampling distribution for `*2vec` models. Fix #2090 (#2093)

Gordon Mohr (1):
      Fix `Doc2Vec.infer_vector`, notebook cleanup (#2103)

Gyanesh Malhotra (1):
      Fix docstrings for`gensim.models.hdpmodel`, `gensim.models.lda_worker` & `gensim.models.lda_dispatcher` (#1912)

Ibrahim Sharaf ElDen (1):
      Store images from `README.md` directly in repository. Fix #1849 (#1861)

Ivan Menshikh (2):
      Fix PEP8 in `HashDictionary`
      Disable google-style docstring support. Fix #1663 (#2106)

Jayant Jain (1):
      Fix negative sampling floating-point error for `gensim.models.PoincareModel`. Fix #1917 (#1959)

Johannes Baiter (1):
      Fix method `estimate_memory` from `gensim.models.FastText` & huge performance improvement. Fix #1824 (#1916)

Jonathan Hourany (1):
      Fixed Typo and increased performance in analyze_sentence (#2070)

Kento NOZAWA (1):
      Fix example block for `gensim.models.Word2Vec` (#1876)

Kumar Akshay (1):
      Fix documentation for `gensim.models.wrappers` (#1859)

Menshikh Ivan (5):
      Fix `test_similarities.py` (#1928)
      Add flag for skip network-related tests (#1930)
      Fix encoding in Lee corpus reader (#1931)
      Fix Keras version (avoid bug from `keras==2.1.5`) (#1963)
      Fix quoting that break `doc2vec-IMDB` notebook

Mohit Rathore (1):
      Add Pivot Normalization for `gensim.models.TfidfModel`. Fix #220 (#1780)

Mritunjay Mohitesh (1):
      Fix deprecated parameters in `D2VTransformer` and `W2VTransformer`. Fix #1937 (#1945)

Nils Werner (1):
      Add license field to `setup.py` (#1909)

Oliver Price (1):
      Fix return dtype for `matutils.unitvec` according to input dtype. Fix #1722 (#1992)

Orion Montoya (1):
      Fix parameter description of  `sg` parameter for `gensim.models.word2vec` (#1919)

Pete Bleackley (1):
      Fix SMART from TfidfModel for case when `df == "n"`. Fix #2020 (#2021)

Pushpankar Kumar Pushp (1):
      Fix datatype parameter for `KeyedVectors.load_word2vec_format`. Fix #1682 (#1819)

Radim Řehůřek (8):
      fix logging formatting in downloader
      fixes to HashDictionary
      more fixes to broken formatting
      minor wording change
      Merge pull request #2073 from RaRe-Technologies/hashdictionary_docs
      Fix documentation for `*2vec` models (#2087)
      Fix documentation for various modules (#2096)
      Update non-API docs (about, intro, etc) (#2101)

Rob Malouf (1):
      Fix `_is_single` from `Phrases` for case when corpus is numpy array (#1987)

Samyak Jain (2):
      Fix empty output bug in `Phrases`. Fix #1401 (#1853)
      Fix file-like closing bug from `gensim.corpora.MmCorpus`. Fix #1869 (#1911)

Sharan Yalburgi (3):
      Add anaconda-cloud badge. Partial fix #1901 (#1905)
      Add method that show base installation info of Gensim & related packages. Fix #1902 (#1903)
      Replace open() with smart_open() in notebooks. Fix #1789 (#1812)

Shiva Manne (4):
      Add `wv` property to KeyedVectors (for backward compatibility). Fix #1882 (#1884)
      Adds `LabeledSentence` to `gensim.models.doc2vec` (for backward compatibility). Fix #1886 (#1891)
      Fix `Doc2Vec.infer_vector` after loading old `Doc2Vec` (`gensim<=3.2`). Fix #1952 (#1974)
      Fixes issues while loading `word2vec` and `doc2vec` models saved using old Gensim versions. Fix #2000, #1977 (#2012)

Sourav Singh (1):
      Fix docstrings for `gensim.models.AuthorTopicModel` (#1907)

Stamenov (1):
      Add inference for new unseen author for `gensim.models.AuthorTopicModel` (#1766)

Stergiadis Manos (4):
      Fix docstrings for lsi-related code (#1892)
      Fix docstrings for `gensim.sklearn_api`. Fix #1667 (#1895)
      Document LDA-related models (#2026)
      Allow pass empty dictionary to `gensim.corpora.WikiCorpus`. Fix #2052 (#2042)

TheFlash10 (1):
      Fix deprecated parameters in doc2vec-lee notebook (#1918)

Umang Varma (1):
      Fix linear decay for learning rate in `Doc2Vec.infer_vector`. Fix #2061 (#2063)

Utkarsh Mishra (1):
      Fix `D2VTransformer.fit_transform`. Fix #1834 (#1845)

Vít Novotný (4):
      Implement Soft Cosine Measure (#1827)
      Fix misinformation in docstring of `gensim.models.KeyedVectors.similarity_matrix`. Fix #1960 (#1971)
      Fix `SoftCosineSimilarity.get_similarities` on corpora. Fix #1955 (#1972)
      Fix tests for `EuclideanKeyedVectors.similarity_matrix`. Fix #1961 (#1984)

Yuri Isakov (3):
      Refactor docstrings for `gensim.scripts`. Partial fix #1665 (#1792)
      Fix docstrings for `gensim.test.utils` (#1904)
      Fix docstrings for `gensim.interfaces` (#1913)

arlenk (3):
      Add Cython version of `MmReader` (#1825)
      Add cython version for "hot" functions from `gensim.models.LdaModel` (#1767)
      Fix OverflowError when loading a large term-document matrix in MatrixMarket format. Fix #1998 (#2001)

bohea (1):
      Fix bug in `Similarity.query_shards` in multiprocessing case (#2044)

darindf (3):
      Fix python 3 compatibility for `gensim.corpora.UciCorpus.save_corpus` (#1875)
      Remove duplication of class documentation for `IndexedCorpus` (#2033)
      Add `dtype` argument for `chunkize_serial`, fix `LdaModel` (#2027)

ivan (12):
      Merge branch 'master' into develop
      bump version to 3.4.0
      regenerate C files with cython==0.27
      Merge branch 'release-3.4.0'
      Merge branch 'master' into develop
      apply fixes for distributed mode lda/lsi from @piskvorky #2102
      remove smart_open limitation from setup.py, replace smart_open -> open until piskvorky/smart_open#207 will be fixed
      fix PEP8 issues
      bump version to 3.5.0
      bump changelos to 3.5.0 + add missing changelog for 3.4.0
      regenerated C files with Cython
      Merge branch 'release-3.5.0'

numericlee (1):
      Fix `doc2vec-lee` notebook (#1870)
  • Loading branch information
fenrus75 committed Aug 5, 2018
1 parent c410ad7 commit d4ead46
Show file tree
Hide file tree
Showing 6 changed files with 40 additions and 15 deletions.
13 changes: 13 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
.*~
*~
*.swp
.repo-index
*.log
build.log.round*
*.tar.*
*.tgz
!*.tar.*.*
*.zip
commitmsg
results/
rpms/
2 changes: 1 addition & 1 deletion Makefile
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
PKG_NAME := gensim
URL = http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz
URL = https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz
ARCHIVES =

include ../common/Makefile.common
34 changes: 23 additions & 11 deletions gensim.spec
Original file line number Diff line number Diff line change
Expand Up @@ -3,29 +3,27 @@
# Generated by: autospec.py
#
Name : gensim
Version : 3.3.0
Release : 27
URL : http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz
Source0 : http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz
Version : 3.5.0
Release : 28
URL : https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz
Source0 : https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz
Summary : Python framework for fast Vector Space Modelling
Group : Development/Tools
License : LGPL-2.1
Requires: gensim-python3
Requires: gensim-license
Requires: gensim-python
Requires: numpy
Requires: pytest-rerunfailures
Requires: scikit-learn
Requires: scipy
Requires: six
Requires: smart_open
Requires: tensorflow
BuildRequires : Cython
BuildRequires : buildreq-distutils3
BuildRequires : numpy
BuildRequires : pbr
BuildRequires : pip

BuildRequires : python3-dev
BuildRequires : scipy
BuildRequires : setuptools
BuildRequires : six
BuildRequires : smart_open

Expand All @@ -37,6 +35,14 @@ BuildRequires : smart_open
|Travis|_
|Wheel|_

%package license
Summary: license components for the gensim package.
Group: Default

%description license
license components for the gensim package.


%package python
Summary: python components for the gensim package.
Group: Default
Expand All @@ -56,18 +62,20 @@ python3 components for the gensim package.


%prep
%setup -q -n gensim-3.3.0
%setup -q -n gensim-3.5.0

%build
export http_proxy=http://127.0.0.1:9/
export https_proxy=http://127.0.0.1:9/
export no_proxy=localhost,127.0.0.1,0.0.0.0
export LANG=C
export SOURCE_DATE_EPOCH=1528564391
export SOURCE_DATE_EPOCH=1533509167
python3 setup.py build -b py3

%install
rm -rf %{buildroot}
mkdir -p %{buildroot}/usr/share/doc/gensim
cp COPYING %{buildroot}/usr/share/doc/gensim/COPYING
python3 -tt setup.py build -b py3 install --root=%{buildroot}
echo ----[ mark ]----
cat %{buildroot}/usr/lib/python3*/site-packages/*/requires.txt || :
Expand All @@ -76,6 +84,10 @@ echo ----[ mark ]----
%files
%defattr(-,root,root,-)

%files license
%defattr(-,root,root,-)
/usr/share/doc/gensim/COPYING

%files python
%defattr(-,root,root,-)

Expand Down
2 changes: 1 addition & 1 deletion options.conf
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
[package]
name = gensim
url = http://pypi.debian.net/gensim/gensim-3.3.0.tar.gz
url = https://files.pythonhosted.org/packages/d1/8d/f20e715f3eae5a277b13a31d440d65f294fadbc2047c4d02226e1de05b6e/gensim-3.5.0.tar.gz
archives =
giturl = https://github.com/RaRe-Technologies/gensim.git

Expand Down
2 changes: 1 addition & 1 deletion release
Original file line number Diff line number Diff line change
@@ -1 +1 @@
27
28
2 changes: 1 addition & 1 deletion upstream
Original file line number Diff line number Diff line change
@@ -1 +1 @@
54bfeade5310d1dc474d84e6cb165200bad0b0c8/gensim-3.3.0.tar.gz
6f58b91a1c6f9d3cacec632457253961dfd85be8/gensim-3.5.0.tar.gz

0 comments on commit d4ead46

Please sign in to comment.