Skip to content

JennyferWAN/IBM_Data_Science_Capstone_Project_Space_X

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ViewCount GitHub top language GitHub language count Stars Badge Forks Badge Pull Requests Badge Total Downloads

➫ IBM_Data_Science_Capstone_Project_Space_X :

In this capstone, i predicted if the Falcon 9 first stage will land successfully. SpaceX advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore if we can determine if the first stage will land, we can determine the cost of a launch. This information can be used if an alternate company wants to bid against SpaceX for a rocket launch.

Several examples of an unsuccessful landing :

landing_1

crash

➫ Table of contents :

1️⃣ Data Collection :

  1. SpaceX Rest API.
  2. Web Scraping from Wikipedia.

2️⃣ Data Wrangling (Preparing data for Machine Learning) :

  1. Data Wrangling.
  2. EDA with SQL : Connect to the database using DB2 from IBM Cloud to write and execute SQL queries
  3. EDA with Matplotlib, Seaborn, and Plotly

3️⃣ Interactive Visual Analytics and Dashboard :

  1. Interactive Visual Analytics with Folium
  2. Build an Interactive Dashboard with Ploty Dash : Perform interactive visual analytics by using VSCode

4️⃣ Predictive Analysis (Classification) :

  1. Machine Learning Prediction : Find best Hyperparameter for Logistic Regression, SVM, Decision Tree and KNN

5️⃣ My Data-Driven Insights :

  1. Final Project : Resume of all our findings and processes in this final project with explanations.
    Final Project with Genially : Project with animations and clickable links to be directed to github pages.

Preview of SpaceX Project Presentation :

SpaceX.mp4

➫ Build With :

  • Python - Open source programming language most used by computer scientists.
  • Anaconda - Distribution of the Python and R programming languages for scientific computing that aims to simplify package management and deployment from local environment.
  • Jupyter Notebooks - Open-source web application that allows data scientists to create and share documents that integrate live code, equations, computational output, visualizations, and other multimedia resources.
  • VSCode - Visual Studio Code is a source-code editor that can be used with a variety of programming languages, including Java, JavaScript, Go, Node.js, Python and C++.
  • IBM Watson Studio - IBM’s software platform for data science.
  • IBM Db2 - Db2 is a family of data management products, including database servers, developed by IBM.
  • IBM Cloud - Next-generation hybrid multicloud platform, advanced data and AI capabilities
  • Plotly - Plotly's Python graphing library makes interactive, publication-quality graphs.
  • PowerPoint - PowerPoint is software for making electronic slide shows that can then be projected onto a screen, allowing for oral presentations.
  • GitHub - Website that provides a cloud service for developers to store and manage their code.

➫ Badges :

Author :

Jennyfer WAN