Capstone project for the IBM Data Science Professional Certificate.
Predicting Falcon 9 rocket landing success using SpaceX launch data, EDA, ML models, and interactive dashboards.
This project explores SpaceX's Falcon 9 launch records to identify key factors that influence whether the rocket’s first stage successfully lands. The goal is to build machine learning models to predict landing success and create interactive tools for exploring the data.
SpaceX aims to reduce launch costs by recovering and reusing rockets, but landing is not always successful.
Can we predict landing success based on parameters like:
- Launch site
- Payload mass
- Orbit type
- Booster version
- Languages: Python, SQL
- Libraries: Pandas, Scikit-learn, Plotly, Folium, Dash
- Tools: Jupyter, Git, REST APIs (SpaceX), Web Scraping (Wikipedia)
- Models: Logistic Regression, SVM, Decision Tree, K-Nearest Neighbors
- Collected data via SpaceX API and Wikipedia.
- Cleaned and filtered for Falcon 9 launches.
- Engineered binary target variable (
landing_class: 0 = fail, 1 = success).
- Used SQL queries and visualizations to explore patterns.
- Key correlations: payload mass, booster version, launch site.
- Visualized trends and outcomes over time.
- Built an interactive dashboard using Plotly Dash:
- Dropdown to select launch site
- Range slider for payload mass
- Pie chart for site-wise success rates
- Scatter plot to analyze payload vs. outcome
- Geospatial maps with Folium to locate and analyze launch sites.
- Trained and evaluated:
- Logistic Regression
- SVM
- Decision Tree
- KNN
- Achieved ~83% accuracy across models.
- Evaluated results with confusion matrices and accuracy scores.