Fuzzy string matching, grouping, and evaluation.
-
Updated
Dec 23, 2024 - Python
Fuzzy string matching, grouping, and evaluation.
Machine learning movie recommending system
Text2Text Language Modeling Toolkit
A Python Search Engine for Humans 🥸
Text vectorization tool to outperform TFIDF for classification tasks
several methods for text classification
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large Plain-Text Collections" (Agichtein and Gravano, 2000)
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Stringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
Arabic Open Domain Question Answering System using Neural Reading Comprehension
Social Analysis based on Whatsapp data
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Document similarity algorithms experiment - Jaccard, TF-IDF, Doc2vec, USE, and BERT.
It is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Add a description, image, and links to the tf-idf topic page so that developers can more easily learn about it.
To associate your repository with the tf-idf topic, visit your repo's landing page and select "manage topics."