Hello, this is a repo for Project 1 of IDS 706: Data Engineering. Here, I am using Kaggle Dataset on books scraped via the Goodreads API (https://www.kaggle.com/datasets/jealousleopard/goodreadsbooks)
- Build a repo in Github
- Configure “scaffold”: Makefile,requirementsfile, app file (example: streamlit, cli, fastapi), test file
- Test with Github Actions
- To build a very simple microservice system that talks to a Big Data Script using Dask.
- Create a webservice using FastAPI
- Testing in demo (link)