Skip to content

rmarconzini/amazon-fine-food-reviews-classification-clustering

Repository files navigation

Amazon Fine Food Reviews Classification and Clustering

Requirements

  • python 3.8+
  • Pandas
  • Numpy
  • Sk-Learn
  • gensim
  • nltk
  • textblob
  • yellowbrick

Abstract

The project aims to analyze the Amazon Fine Food Reviews dataset using different Text Mining techniques. Initially, an exploratory analysis of the data is performed, followed by some preprocessing activities. Then, different classification and clustering models are implemented, to classify reviews starting from their text and grouping similar reviews.

Google Drive folder with models estimated and datasets: https://drive.google.com/drive/u/1/folders/1veNClNl7CxCTFHVNY2Fp29hcMoEj-the

Research Question

The project aims to answer the following questions:

  1. Can a review be classified as good or bad from its text?
  2. Is it possible to predict the user’s rating starting from the text of the review?
  3. Is it possible to group similar reviews?

About

Progetto del corso di Text Mining & Search

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published