Regional Rhythms: Spotify Audio Feature Analysis

Exploratory data analysis on the Spotify Audio features, leveraging data from the Spotify Charts Dataset and Spotify API.

Project Report

About

Our work explores the dynamic landscape of global music preferences using data from the Spotify Charts Dataset and Spotify API. By examining the top charting songs across various regions over a specified period, we aim to uncover patterns and correlations between regional popularity and the intrinsic features of the music, such as but not limited to danceability, energy, and loudness. The dataset includes key information such as dates, regions, and rankings, providing a comprehensive view of the shifting trends in music consumption worldwide. Our analysis is expected to provide valuable insights for artists, record labels, and marketers, helping them tailor their approaches based on regional preferences and the distinct audio features of popular music. Additionally, this study contributes to the academic understanding of cultural diversity in music, anticipating a wide variety of musical tastes and unique characteristics across different regions.

Collaborators:

This project is a component of the ML-4102 Data Literacy course, instructed by Philipp Henning in the winter of 2023.

This course provides students with essential concepts and tools for working with large datasets, covering practical experiments and examples to discuss common pitfalls and best practices. It addresses basic statistical notions, bias, testing, and experimental design while employing foundational methods of machine learning and statistical data analysis.

Installation

We have used Python 3.10 to obtain our results. Although the requirements should work with higher (and some lower) Python versions, you may want to ensure working Python 3.10 environment for reproducability.

Environment creation example with Conda:

$ conda create --name regional_rhythms python=3.10
$ conda activate regional_rhythms

Alternatively, you can directly create a new envrionment copying ours instead (though it may fail if your system is not compatible with all specified versions described in the file):

$ conda env create -f env/conda_env.yaml

If you wish to install this environment, skip to step 4 after installation.

Install pip if you haven't in order to install the requirements:

Example installation in a Conda environment:

$ conda install pip

Install the requirements:

$ pip install -r env/requirements.txt

If you use Conda, you may need to install the additional package to be able to use the environment as a kernel in Jupyter Notebook:

$ conda install nb_conda_kernels

Download the required datasets:

If you would like to run the scripts that collects data from Spotify API, please create a .env file in the source/ directory and add your Spotify related credentials for using Spotify API. Please head to the following link to see how to get Spotify credentials.
Move the related dataset files into the data/ folder, resulting in a file structure as follows:

Data Literacy Project
├── data
│   ├── audio_features.csv
│   ├── charts_preprocessed.csv
│   ├── ...
│   └─────────────────
├── env
│   ├── ...
│   └─────────────────
├── exp
│   ├── data_audio_features.ipynb
│   ├── data_exploration.ipynb
│   ├── data_feature_vectors.ipynb
│   ├── data_rank_analysis.ipynb
│   ├── data_regional_correlation.ipynb
│   ├── data_regional_correlation_plots.ipynb
│   ├── data_regional_correlations.ipynb
│   ├── data_visualization.ipynb
│   └─────────────────
├── source
│   ├── utils
│   ├── .env
│   ├── build_dataset.py
│   └─────────────────
...
│
├── .gitignore
├── requirements.txt
├── README.md
└─────────────────

The .ipynb files under exp/ can now be executed. To ensure that relative paths work, scripts should be executed from inside the directory:

$ jupyter-notebook exp/example_script.ipynb

Appendix

We present our obtained plots under the figures/ directory. The directory includes additional plots to those included in the report, along with their explanations in a README file.

We also present our processed data in the data/ folder as a convenience, which are explained further in the folder's README file.

License

Roboto fonts included in this repository are licensed under the Apache License, Version 2.0.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Regional Rhythms: Spotify Audio Feature Analysis

About

Collaborators:

Installation

Appendix

License

About

Releases

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
data		data
env		env
exp		exp
figures		figures
paper		paper
source		source
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

License

kargibora/DataLiteracy-Regional-Rhythms

Folders and files

Latest commit

History

Repository files navigation

Regional Rhythms: Spotify Audio Feature Analysis

About

Collaborators:

Installation

Appendix

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages