project_2
View our presentation here: https://docs.google.com/presentation/d/1VOsYs7ST_fb8iY0AjCcxDU_StIpxHrh422JAz6LDbME/edit?usp=sharing
Data sets:
https://www.kaggle.com/nationalparkservice/park-biodiversity
https://data.world/inform8n/us-national-parks-visitation-1904-2016-with-boundaries/discuss/national-parks-totals-by-year/g44tenrr
https://www.kaggle.com/planejane/national-park-trails
https://data.world/ajsanne/north-america-bear-killings
Our team will submit a Final Report that describes the following:
Extract: your original data sources and how the data was formatted (CSV, JSON, pgAdmin 4, etc)
We will read the cvs files into pandas dataframes and filter the data. Then load the dataframes into Postgres database
Transform: what data cleaning or transformation was required.
Select columns via Pandas and filter out relevant park names
Load: the final database, tables/collections, and why this was chosen.