Green New York is a Big Data Management and Analytics project which purpose is to discover potential new Citi bike stations in the city of New York by finding a correlation between the Citi bike and taxi trip data.
- Platform: Python
- Framework: Spark
- Cluster: Hadoop, HDFS (HUE)
- Taxi Trip Data (2009-2014) (csv)
- Citi Bike Stations (json)
- NYC Boroughs (geojson)
- NYC Blocks (geojson)
- [Final Report] (/documents/report.pdf)