This project is to demonstrate how to perform statistical analysis using pandas and polars. We then compare runtimes of both the approaches. Then for CICD, we perform tests to evaluate compatibility across different python versions.
- A
.ipynb
notebook each for polars and pandas analysis - A
.py
script to calculate the runtimes of each of these notebooks - A
lib
folder withhelper.py
script to host helper function.
src/
: Contains the source code for the project.tests/
: Contains the unit tests for the project.requirements.txt
: Lists the Python dependencies.Makefile
: Defines common tasks like installing dependencies, running tests, linting, and running docker..devcontainer/
: ContainsDockerfile
and VS Code configuration..github/workflows/
: Contians CI/CD workflows for GitHub.
Clone the repository to your local machine:
git clone https://github.com/nogibjj/arko_individual_project_1.git
cd arko_individual_project_1
.venv/bin/python rdu_weather_analytics_pandas.ipynb
.venv/bin/python rdu_weather_analytics_pandas.ipynb
As we can see. polars tends to run quicker than its equivalent implementation in pandas.