IDS706 - Final Project - Spotify Microservice

Microservice and Core Development Documentation

Video presentation

Watch the video presentation here

Summary of Completed Work:

1. Core Microservice Implementation

Technology: Python (Flask framework).
Endpoints:
- /rankings: Fetch top-ranked tracks by region and date.
- /artist: Retrieve track data for a specific artist.
Optimizations:
- Cached grouped data for faster query performance.
- Optimized database filtering and ranking logic.
Tested Functionality:
- Successfully tested all endpoints locally, within a Docker container, and deployed to AWS App Runner.

2. Logging

Integrated structured logging to track:
- Incoming requests (parameters, endpoints).
- Errors and debug information.
Logs are written to both the console and a file (app.log).
Verified correctness during various test scenarios.

3. Dockerization

Created a production-ready Dockerfile:
- Uses gunicorn as a WSGI server for performance.
- Handles dependency installation (requirements.txt).
- Exposes port 3000 for microservice access.
Verified the Docker image works as expected locally and pushed the image to AWS ECR.

⚠️ Note: When to Rebuild the Docker Image?

If any of the following files are updated:
- app.py (microservice logic).
- requirements.txt (dependencies).
- Dockerfile (image configuration).
- Dataset file (e.g., spotify_top_200.parquet) if it is embedded within the image.
Rebuild the Docker image with:
```
docker build -t spotify_microservice .
```
After rebuilding, test the new image by running:
```
docker run -p 3000:3000 spotify_microservice
```

4. Deployment to AWS App Runner

Pushed the Docker image to AWS ECR.
Deployed the microservice using AWS App Runner:
- Configured the service to pull the image from ECR.
- Verified successful deployment and health checks.
- Public endpoint available for accessing the service.

Load Test

Performance under Load: The system handles initial load increases well, but begins to struggle as the number of requests per second and concurrent users grow drastically.
Potential Issues: The spike in response times and growing failures suggest bottlenecks, possibly in CPU, memory, or I/O capacity.
Next Steps: Investigate system resource utilization during this test and identify the bottleneck. Consider optimizing code, scaling infrastructure, or implementing load-balancing techniques to improve system performance. The switch to AWS EC2 does not aid the situation. Due to budget limit, we might try to move our database to AWS Postgre SQL to enable transcational query optimization.

Files and Their Purposes

app.py: The Flask microservice implementation.
locustfile.py: Load testing script for simulating requests.
Dockerfile: Defines the Docker image for production deployment.
requirements.txt: Python dependencies for the microservice.
data.py: Script for preparing and transforming the dataset.
spotify_top_200.parquet: Preprocessed dataset for use in the microservice.

How to Test Locally

Run the microservice:
- Open a terminal and run:
```
python app.py
```
- This starts the Flask application. Access endpoints at http://127.0.0.1:3000.
Run load tests:
- Open a second terminal and run:
```
locust -f locustfile.py
```
- This starts the Locust load testing framework. Access the monitoring interface at http://127.0.0.1:8089.
Note: Ensure both app.py and locustfile.py are running simultaneously to avoid interruptions.
Run the Docker container:
- Alternatively, test the Dockerized version:
```
docker run -p 3000:3000 spotify_microservice
```
- Access the application at http://127.0.0.1:3000.
Rebuild the Docker Image (if required):
- If you modify any code or configuration:
```
docker build -t spotify_microservice .
```
- Test the updated image using the steps above.

Infrastructure as Code

This project uses AWS App Runner for deployment, pulling Docker images from AWS ECR and managing the infrastructure automatically. Health checks and scaling configurations ensure smooth operation under varying loads.

Architectural Diagram

Final Repository Link

Spotify Microservice GitHub Repository

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github/workflows		.github/workflows
data		data
logs		logs
project		project
team_reflection		team_reflection
.gitattributes		.gitattributes
.gitignore		.gitignore
1.png		1.png
2.png		2.png
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
flask_profiler.sql		flask_profiler.sql
image.png		image.png
locustfile.py		locustfile.py
requirements.txt		requirements.txt
spotify.db-shm		spotify.db-shm
spotify.db-wal		spotify.db-wal
total_requests_per_second_1734060700.608.png		total_requests_per_second_1734060700.608.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IDS706 - Final Project - Spotify Microservice

Microservice and Core Development Documentation

Video presentation

Summary of Completed Work:

Load Test

Files and Their Purposes

How to Test Locally

Infrastructure as Code

Architectural Diagram

Final Repository Link

About

Releases

Packages

Languages

License

nogibjj/IDS706_Final_Spotify_Microservice

Folders and files

Latest commit

History

Repository files navigation

IDS706 - Final Project - Spotify Microservice

Microservice and Core Development Documentation

Video presentation

Summary of Completed Work:

Load Test

Files and Their Purposes

How to Test Locally

Infrastructure as Code

Architectural Diagram

Final Repository Link

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages