Contains script in Python for Data Pipeline solution on Azure Databricks
-
Updated
Nov 1, 2022 - Jupyter Notebook
Contains script in Python for Data Pipeline solution on Azure Databricks
In this project, I'll apply what I've learned on data modeling with Postgres and build an ETL pipeline using Python. To complete the project, I will define the fact and dimension tables for a star schema for a particular analytic focus, and write an ETL pipeline that transfers data from files in two local directories into these tables in Postgre…
Udacity Data Engineering project: Data Warehouse
This is an MSSQL Data Warehouse and ETL implementation on specially formatted Water Quality dataset from DEFRA, UK
Builds dimension tables with configurable/variable granularity. Keeps those tables up-to-date. This makes reporting queries performant and doable. I use it with https://grafana.com/
Analyzed Zomato's restaurant data to reveal global insights, highlighting continent-wise restaurant counts, average costs, and ratings. Also, identified country-level cuisine counts to enhance Zomato's data-driven decision-making.
This is an Oracle DB Data Warehouse and ETL implementation on specially formatted Water Quality dataset from DEFRA, UK
Data Warehouse creation using MySQL and R
A simplified introdocutory data analysis report to show how using SQL queries can answer questions from a data warehouse with a star schema. The report answers why server downtime happens and how it can be fixed.
Using ETL process to Extract, Transform, and Load local data into a PostgreSQL database
Cloud Data Warehouse of Sparkify Data using Redshift
Projects from Udacity Data Engineer Nanodegree
Star Schema design and Data Ingestion
This is a business intelligence project on analyzing super market data. Check out the README file for more details.
Add a description, image, and links to the dimension-tables topic page so that developers can more easily learn about it.
To associate your repository with the dimension-tables topic, visit your repo's landing page and select "manage topics."