Skip to content

Commit

Permalink
Remove CZ synonym and stop words
Browse files Browse the repository at this point in the history
because they have been deleted from the source web
  • Loading branch information
eronisko committed Mar 1, 2024
1 parent 165faa0 commit 92c077f
Showing 1 changed file with 0 additions and 4 deletions.
4 changes: 0 additions & 4 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -11,8 +11,4 @@ RUN curl -Lso config/lemmagen/en.lem https://github.com/vhyza/lemmagen-lexicons/
RUN curl -Ls https://github.com/SlovakNationalGallery/elasticsearch-slovencina/archive/master.tar.gz | tar xz -C config --strip-components=1

# grab extra synonym files and save them inside config/synonyms
RUN curl -Lso config/synonyms/synonyms_cz.txt https://sites.google.com/site/kevinbouge/synonyms-lists/synonyms_cz.txt
RUN curl -Ls http://wordnetcode.princeton.edu/3.0/WNprolog-3.0.tar.gz | tar xz -C config/synonyms --strip-components=1 prolog/wn_s.pl

# grab extra stop-words files
RUN curl -Lso config/stop-words/stop-words-czech2.txt https://sites.google.com/site/kevinbouge/stopwords-lists/stopwords_cz.txt

0 comments on commit 92c077f

Please sign in to comment.