A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousing, containerization, and a dashboard to monitor data pipeline KPIs
python emr docker aws airflow spark cassandra postgresql s3 data-warehouse data-engineering data-lake infrastructure-as-code redshift etl-pipeline infrastructure-setup
-
Updated
Apr 29, 2021 - Python