Skip to content

BJTU, trainning course, Predicting flight delays with weather data.

License

Notifications You must be signed in to change notification settings

Routhleck/flight-delay-predict

Repository files navigation

img img img img img

All Contributors

flight-delay-predict

English | 中文

Predicting flight delays with weather data.

img

img

img

img

Caution ⚠⚠⚠

Database

Database initialization files are located in the dataset-and-model branch, under the design folder in all.sql.

Note!! Due to an error with the weather prediction website during the presentation, another method was temporarily used. You'll need to add the same attributes from departureweather (year, month, day, normal_prob, mild_prob, moderate_prob, serious_prob) to the arriveweather table and delete the date attribute.

Next, you need to INSERT data into the airline and airport tables, which are located in the delay-master branch's modelTrain/predict/dict_id.csv and the dataset-and-model branch's dataset/airport.csv, respectively.

Finally, don't forget to reconfigure your (cloud) database in API/algorithm.py and API/loginAndRegister.py.

Weather Data Prediction

During the defense, the weather site being scraped was temporarily down, so weather predictions were replaced with data from the same day in previous years. It can now be used again. If you need to reintegrate weather predictions, you can restore the weather prediction function in algorithm.py (if I recall correctly).

Project

A flight delay prediction system based on previous years' flight and weather information.

Data Cleaning Project:

  1. First, manually create an airport-city code reference dictionary based on the original flight information data. Only a selection of airports is chosen.
  2. Fill in the corresponding airport longitude and latitude.
  3. First, clean: remove items where the departure and arrival airports are not in the provided reference airport dictionary.
  4. Second cleaning: remove duplicated items for the same flight route (i.e., where departure and arrival points are the same).
  5. The handling of the most original dataset also includes:
  • Calculate the planned departure, arrival, actual departure, and arrival times from the original timestamp.
  • Calculate the distance between different airports based on their longitude and latitude and integrate it into the corresponding flight information column.
  1. Scrape and fill in weather information using the constructed airport-city reference dictionary.
  • First, construct daily weather information files for different cities in .CSV format.
  • Scrape data through web crawlers and write to the city weather file.
  • Use the city to locate the file to access; the estimated departure date to locate the specific item to fill.
  1. Finally, save the input to get the initially cleaned dataset.

Contributors ✨

Thanks goes to these wonderful people (emoji key):

Project Dependence

npm install echarts@4.9 npm install --save-dev less-loader less

Team Allocation:

Project Manager: 解世超 (Jie Shichao)

Front-end Engineers: 蒋涵 (Jiang Han), 陈泽锋 (Chen Zefeng)

Back-end Engineers: 何毅 (He Yi), 江顺 (Jiang Shun)

Data Engineers: 贺思超 (He Sichao)


Routhleck Ekalesor

🔣 💻

是小柴同学吖

💻 🐛 🚇

Shichao

📖 🤔 📆

chenzefeng33

🎨 📦

hanjiang1073

🎨 📦

1avish

💻

This project follows the all-contributors specification. Contributions of any kind welcome!

About

BJTU, trainning course, Predicting flight delays with weather data.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published