Skip to content

Releases: Hyuto/indo-nlp

v0.3.4

15 Oct 17:21
Compare
Choose a tag to compare

Changelog

Bug Fixing

Memperbaiki dan mengganti inner pattern pada fungsi replace_word_elongation.

Updates

  1. Menambahkan wkwk pattern pada stopwords.
  2. Menggunakan inner flags untuk insensitive case.

v.0.3.3

29 Sep 17:37
Compare
Choose a tag to compare

Changelog

Bug Fixing

Fixing bog on slang words regex pattern caused by common string from SLANG_DATA.

v0.3.2

04 Sep 15:44
Compare
Choose a tag to compare

Changelog

Update

  • Mengubah return Dataset.read method menjadi dataclass Data.
  • Update docstring pada code.

v.0.3.1

22 Aug 16:08
Compare
Choose a tag to compare

Changelog

Documentation 📝

Make documentation site for indoNLP using mkdocs with mkdocs-material theme and auto generating code references using mkdocstring.

Visit indoNLP website

  • Fixing code and change code docstring to Bahasa
  • Merge docs to master and deploy site using github action

Bug Fixing

  • Fixing top level import at indoNLP/__init__.py
  • Fixing inconsistent return indoNLP.dataset.reader.txt_table_reader

v0.3.0

17 Aug 05:37
Compare
Choose a tag to compare

Changelog

New Feature

Dataset 📖

New module indoNLP.dataset to provide easy way to access Indonesian open dataset for NLP.

v0.2.0

14 Jul 05:21
Compare
Choose a tag to compare

Changelog

Bug Fixing

Fixing bug on preprocessing.replace_word_elongation to only replace repeating characters in the end of words

New Feature

Emoji Supports 🤗

Able to preprocess emoji containing text with functions

  1. emoji_to_words
  2. words_to_emoji

v.0.1.1

30 Jun 14:24
Compare
Choose a tag to compare

Changelog

Change preprocessing.pipline to preprocessing.pipeline

v0.1.0

27 Jun 18:09
Compare
Choose a tag to compare

Initial Release

Create preprocessing module consist of several common utility functions.

  1. preprocessing.remove_html
  2. preprocessing.remove_url
  3. preprocessing.remove_stopwords
  4. preprocessing.replace_slang
  5. preprocessing.replace_word_elongation
  6. preprocessing.pipeline