This repository contains the working and output files for the project "Spotify metadata analysis," completed for the "INF3104: Data science foundations" course.
The repo is structured as:
data
contains the raw data as downloaded from Kaggle.scripts
contains the script used to read in the data and clean them; plus the script used to create classification models for predicting genres, based on audio features.paper
contains the Quarto file, bibliographic file, and PDF of the paper write-up.outputs
contains the .csv and .png files outputted from the scripts, to include in the paper.
Aspects of the code were written with the help of ChatGPT (through OpenAI) and CoPilot (through Microsoft).