A comprehensive analysis of SpaceX launch data, employing various data science techniques to predict landing outcomes and extract meaningful insights
View the Project on GitHub alxmares/SpaceX-Launch-Analysis-and-Landing-Prediction
In this project, I performed extensive exploratory data analysis to understand the nuances of SpaceX launch outcomes. Below are the key steps taken:
A key SQL query used in the analysis to understand the distribution of landing outcomes:
SELECT landing_outcome, count(*) as outcome_count
FROM SPACEXTABLE
WHERE Date BETWEEN '2010-06-04' AND '2017-03-20'
GROUP BY landing_outcome
ORDER BY outcome_count DESC
Output:
Landing_Outcome | Outcome_Count |
---|---|
No attempt | 10 |
Success (drone ship) | 5 |
Failure (drone ship) | 5 |
Success (ground pad) | 3 |
Controlled (ocean) | 3 |
Uncontrolled (ocean) | 2 |
Failure (parachute) | 2 |
Precluded (drone ship) | 1 |
Using the cleaned data, I analyzed the success rate for each orbit type. The bar chart below illustrates the success rates: This chart shows the success rate of launches to different orbits, providing insights into the reliability of various orbital missions.
The line plot highlights how the success rate has evolved over the years, reflecting SpaceXโs growing proficiency in launch operations.
Created interactive maps to visualize launch sites and their respective landing outcomes, leveraging the power of Folium for geographical representation.
Developed a dynamic dashboard using Dash and Plotly to provide interactive visualizations and predictive analytics for SpaceX launches.
Several machine learning algorithms were utilized to predict the landing outcomes:
GridSearchCV
to achieve optimal performance.GridSearchCV
.Model | Test Accuracy |
---|---|
Logistic Regression | 0.8333 |
Support Vector Machine | 0.8333 |
Decision Trees | 0.7222 |
K-Nearest Neighbors | 0.8333 |
The results indicate that the Logistic Regression, SVM, and KNN models all performed equally well, achieving an accuracy of 83.33% on the test data. The Decision Tree model, while slightly less accurate, still provided valuable insights with an accuracy of 72.22%.