An open-source boilerplate code for data engineering with pandas
- Install dependencies from requirements.txt
- Create Data folder in root
- Add your AWS credentials in configuration/config.ini file
- Put your raw file in either S3 or data/raw/ folder
- Run from src/main.py