MirasText
-
Updated
Aug 12, 2020 - Python
MirasText
This is the distribution point for the NUS SMS Corpus as described and updated from This is a corpus of SMS (Short Message Service) messages collected for research at the Department of Computer Science at the National University of Singapore. This dataset consists of 67,093 SMS messages taken from the corpus on Mar 9, 2015. The messages largely …
Statistical text analysis and semantic networks with Python
This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings like fastText or ELMo Deep contextualized word representations.
Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good model.
Expanding sentences in a given text corpus. The code checks for NE in sentences and create new sentences by injecting new NEs from NE list.
AsoSoft Text Corpus is the first large scale text corpus for the Kurdish language.
A text corpus collection for the DroppedText language.
Text corpus the of Tlingit language for linguistic research.
A model was trained using Google handwritten Fonts using a text corpus containing only digits ranging from 0-9. The main aim was to recognize ICR sheets from such trained data. Our model gave an accuracy of 94.6% using Tesseract Version-4.
Corpus de novelas hispanoamericanas del siglo XIX (conha19)
Search a long list of names (patterns) in a large text corpus systematically and quickly
Yeezy Taught Me Text Generation. Training next character predictions RNN LSTM model with user input text corpus
Walk through to convert WikiMedia into a text corpus
Command-line corpus tools
Final project for Natural language processing course in final_project_diary folder
A project that extracts Honkai: Star Rail text corpus
"Text Analyzer" is a web application designed to analyze any given text or script and provide users with useful information about its contents.
Walk through to convert PMC OAS Dataset into a text corpus
Add a description, image, and links to the text-corpus topic page so that developers can more easily learn about it.
To associate your repository with the text-corpus topic, visit your repo's landing page and select "manage topics."