NLP Comment Filtering

Artificial Intelligence Course 4th Project: Implementing Bigram and Unigram models for filtering comments.
In this group project we (Amirhossein-Rajabpour and arminZolfaghari) implemented Bigram and Unigram models to filter comments.

We trained these models on these positive and negative datasets. We also used smoothing in both models (you can change coefficients). For preprocessing first we removed punctuation marks and we also have a cut_down parameter which specifies that words with equal or less number of repetition to this parameter should be removed. Also there is a cut_above parameter that specifies that how many of most repeated words should be removed.

A sample run:

Check full description here

Project report (in persian): tried different coefficients and tried the models with and without cut_down and cut_above and checked the results here

Check our other AI Course projects:

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.idea		.idea
dataset		dataset
AI_P4.pdf		AI_P4.pdf
AI_P4_Report.pdf		AI_P4_Report.pdf
BigramModel.py		BigramModel.py
Dataset.py		Dataset.py
Main.py		Main.py
README.md		README.md
UnigramModel.py		UnigramModel.py
sample run.jpg		sample run.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Comment Filtering

About

Releases

Packages

Contributors 2

Languages

arminZolfaghari/NLP-Comment-Filtering

Folders and files

Latest commit

History

Repository files navigation

NLP Comment Filtering

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages