In this Kaggle competition, the challenge is to build a model that predicts the total ride duration of taxi trips in New York City. The primary dataset is one released by the NYC Taxi and Limousine Commission, which includes pickup time, geo-coordinates, number of passengers, and several other variables. This notebook is consists of the following sections
- Data Loading & Basic Consistency Check
- Visualizing Features
- Feature Engineerings
- Data Cleaning
The dataset used are from Kaggle and available at here!