ADM_HW4_Group3 - Recommendation systems and clustering everywhere

This repository contains code and analysis for the 4th homework assignment for the Algorithmic Methods of Data Mining course.

Usage

The main tasks are implemented in main.ipynb. This covers:

Recommendation system using minhash and LSH
User clustering with feature engineering, dimensionality reduction, K-means and DBSCAN
The command line question and the algorithmic question

The command line question is executed via CommandLine.sh and output is shown in SS.png.

The code requires Python 3 and standard data science libraries like Pandas, NumPy, Scikit-Learn, etc.

The bash script assumes a Linux/Unix environment with common command line utilities like grep, wc, etc.

Course project completed as part of the Algorithmic Methods of Data Mining course.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
CommandLine.sh		CommandLine.sh
README.md		README.md
SS.png		SS.png
main.ipynb		main.ipynb
vodclickstream_uk_movies_03.csv		vodclickstream_uk_movies_03.csv