0% found this document useful (0 votes)

291 views47 pages

Ds Capstone Presentation

The document outlines a data science capstone project to predict if SpaceX will reuse the first stage of Falcon 9 rockets using machine learning models. It collects data through SpaceX's API and web scraping Wikipedia, then performs exploratory data analysis with visualization and SQL before building classification models to predict first stage landing outcomes. Key steps include data wrangling, interactive visualizations with Folium and Plotly Dash, and evaluating models to determine the best for binary classification. The goal is to help determine the cost of launches by predicting first stage reuse.

Uploaded by

Danish Nazrin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

291 views47 pages

Ds Capstone Presentation

Uploaded by

Danish Nazrin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Data Science Capstone Project

Evgeny Zorin
29.08.2021

Outline

• Executive Summary
• Introduction
• Methodology
• Results
• Conclusion
• Appendix

Executive Summary
Summary of methodologies
- Data collection
- Data wrangling
- Exploratory Data Analysis with Data Visualization
- Exploratory Data Analysis with SQL
- Building an interactive map with Folium
- Building a Dashboard with Plotly Dash
- Predictive analysis (Classi ication)

Summary of all results

- Exploratory Data Analysis results
- Interactive analytics demo in screenshots
- Predictive analysis results
 

Introduction
Project background and context
SpaceX is the most successful company of the commercial space
age, making space travel affordable. The company advertises Falcon
9 rocket launches on its website, with a cost of 62 million dollars;
other providers cost upward of 165 million dollars each, much of the
savings is because SpaceX can reuse the irst stage. Therefore, if we
can determine if the irst stage will land, we can determine the cost
of a launch. Based on public information and machine learning
models, we are going to predict if SpaceX will reuse the irst stage.

Questions to be answered
- How do variables such as payload mass, launch site, number of
lights, and orbits affect the success of the irst stage landing?
- Does the rate of successful landings increase over the years?
- What is the best algorithm that can be used for binary classi ication
in this case?
f
f

f
f
f
f

Methodology
Data collection methodology:
- Using SpaceX Rest API
- Using Web Scrapping from Wikipedia

Performed data wrangling

- Filtering the data
- Dealing with missing values
- Using One Hot Encoding to prepare the data to a binary classi ication

Performed exploratory data analysis (EDA) using visualization and SQL

Performed interactive visual analytics using Folium and Plotly Dash

Performed predictive analysis using classi ication models

- Building, tuning and evaluation of classi ication models to ensure the best
results

f
f
f

Methodology
Data collection
Data collection process involved a combination of API requests from SpaceX REST
API and Web Scraping data from a table in SpaceX’s Wikipedia entry.
We had to use both of these data collection methods in order to get complete
information about the launches for a more detailed analysis.

Data Columns are obtained by using SpaceX REST API:

FlightNumber, Date, BoosterVersion, PayloadMass, Orbit, LaunchSite,
Outcome, Flights, GridFins, Reused, Legs, LandingPad, Block, ReusedCount,
Serial, Longitude, Latitude

Data Columns are obtained by using Wikipedia Web Scraping:

Flight No., Launch site, Payload, PayloadMass, Orbit, Customer, Launch
outcome, Version Booster, Booster landing, Date, Time

Data collection – SpaceX API

Decoding the Requesting needed
Requesting response content information about
Constructing data
rocket launch using .json() and the launches from
we have obtained
data from turning it into a SpaceX API
into a dictionary
SpaceX API dataframe using by applying
.json_normalize() custom functions

Replacing missing
Filtering the
values of Payload
Exporting the data dataframe to only Creating a dataframe
Mass column with
to CSV include Falcon 9 from the dictionary
calculated .mean()
launches
for this column

GitHub URL: Data Collection API

Data collection – Web scraping

Requesting Creating a Extracting
Falcon 9 launch BeautifulSoup object all column names
data from from the HTML from the HTML table
Wikipedia response header

Collecting the data

by parsing
HTML tables

Constructing data
Exporting the data Creating a dataframe
we have obtained
to CSV from the dictionary
into a dictionary

GitHub URL: Data Collection with Web Scraping

Data wrangling
In the data set, there are several different cases where the Perform exploratory Data Analysis
booster did not land successfully. Sometimes a landing was and determine Training Labels
attempted but failed due to an accident; for example, True
Ocean means the mission outcome was successfully landed
Calculate the number of launches
to a speci ic region of the ocean while False Ocean means on each site
the mission outcome was unsuccessfully landed to a speci ic
region of the ocean. True RTLS means the mission outcome Calculate the number and occurrence
was successfully landed to a ground pad False RTLS means of each orbit

the mission outcome was unsuccessfully landed to a ground Calculate the number and occurrence
pad.True ASDS means the mission outcome was successfully of mission outcome per orbit type
landed on a drone ship False ASDS means the mission
Create a landing outcome label
outcome was unsuccessfully landed on a drone ship.
from Outcome column
We mainly convert those outcomes into Training Labels with
Exporting the data
“1” means the booster successfully landed, “0” means it was to CSV
unsuccessful.

GitHub URL: Data Wrangling

f
EDA with data visualization
Charts were plotted:
Flight Number vs. Payload Mass, Flight Number vs. Launch Site, Payload Mass
vs. Launch Site, Orbit Type vs. Success Rate, Flight Number vs. Orbit Type,
Payload Mass vs Orbit Type and Success Rate Yearly Trend

Scatter plots show the relationship between variables. If a relationship exists,

they could be used in machine learning model.
Bar charts show comparisons among discrete categories. The goal is to show the
relationship between the speci ic categories being compared and a measured
value.
Line charts show trends in data over time (time series).

GitHub URL: EDA with Data Visualization

EDA with SQL

Performed SQL queries:
• Displaying the names of the unique launch sites in the space mission
• Displaying 5 records where launch sites begin with the string ‘CCA'
• Displaying the total payload mass carried by boosters launched by NASA (CRS)
• Displaying average payload mass carried by booster version F9 v1.1
• Listing the date when the irst successful landing outcome in ground pad was achieved
• Listing the names of the boosters which have success in drone ship and have payload mass greater than 4000 but
less than 6000
• Listing the total number of successful and failure mission outcomes
• Listing the names of the booster versions which have carried the maximum payload mass
• Listing the failed landing outcomes in drone ship, their booster versions and launch site names for the months in
year 2015
• Ranking the count of landing outcomes (such as Failure (drone ship) or Success (ground pad)) between the date
2010-06-04 and 2017-03-20 in descending order

GitHub URL: EDA with SQL

Build an interactive map with Folium

Markers of all Launch Sites:
- Added Marker with Circle, Popup Label and Text Label of NASA Johnson Space Center using
its latitude and longitude coordinates as a start location.
- Added Markers with Circle, Popup Label and Text Label of all Launch Sites using their latitude
and longitude coordinates to show their geographical locations and proximity to Equator and
coasts.

Coloured Markers of the launch outcomes for each Launch Site:

- Added coloured Markers of success (Green) and failed (Red) launches using Marker Cluster to
identify which launch sites have relatively high success rates.

Distances between a Launch Site to its proximities:

- Added coloured Lines to show distances between the Launch Site KSC LC-39A (as an
example) and its proximities like Railway, Highway, Coastline and Closest City.

GitHub URL: Interactive Visual Analytics with Folium

Build a Dashboard with Plotly Dash

Launch Sites Dropdown List:
- Added a dropdown list to enable Launch Site selection.

Pie Chart showing Success Launches (All Sites/Certain Site):

- Added a pie chart to show the total successful launches count for all sites and the
Success vs. Failed counts for the site, if a speci ic Launch Site was selected.

Slider of Payload Mass Range:

- Added a slider to select Payload range.

Scatter Chart of Payload Mass vs. Success Rate for the di erent Booster Versions:
- Added a scatter chart to show the correlation between Payload and Launch Success.

GitHub URL: SpaceX Dash App

Predictive analysis (Classi ication)

Standardizing the Splitting the data into
Creating a
Creating a NumPy data with training and testing
GridSearchCV object
array from the column StandardScaler, then sets with
with cv = 10 to ind
“Class” in data itting and train_test_split
the best parameters
transforming it function

Finding the method Calculating the Applying

performs best by Examining the accuracy on the test GridSearchCV
examining the confusion matrix data using the on LogReg, SVM,
Jaccard_score and for all models method .score() Decision Tree, and
F1_score metrics for all models KNN models

GitHub URL: Machine Learning Prediction

f
Results

• Exploratory data analysis results

• Interactive analytics demo in

screenshots
• Predictive analysis results

EDA with Visualization

Flight Number vs. Launch Site

Explanation:
• The earliest lights all failed while the latest lights all succeeded.
• The CCAFS SLC 40 launch site has about a half of all launches.
• VAFB SLC 4E and KSC LC 39A have higher success rates.
• It can be assumed that each new launch has a higher rate of success.
f

Payload vs. Launch Site

Explanation:
• For every launch site the higher the payload mass, the higher the success
rate.
• Most of the launches with payload mass over 7000 kg were successful.
• KSC LC 39A has a 100% success rate for payload mass under 5500 kg too.

Success rate vs. Orbit type

Explanation:
• Orbits with 100% success rate:
- ES-L1, GEO, HEO, SSO
• Orbits with 0% success rate:
- SO
• Orbits with success rate
between 50% and 85%:
- GTO, ISS, LEO, MEO, PO

Flight Number vs. Orbit type

Explanation:
• In the LEO orbit the Success appears related to the number of lights;
on the other hand, there seems to be no relationship between light
number when in GTO orbit.

f
f
Payload Mass vs. Orbit type

Explanation:
• Heavy payloads have a negative in luence on GTO orbits and positive
on GTO and Polar LEO (ISS) orbits.

f
Launch success yearly trend

Explanation:
• The success rate
since 2013 kept
increasing till 2020.

EDA with SQL

All launch site names

Explanation:
• Displaying the names of the unique launch sites in the space mission.

Launch site names begin with `CCA`

Explanation:
• Displaying 5 records where launch sites begin with the string 'CCA'.

Total payload mass

Explanation:
• Displaying the total payload mass carried by boosters launched by
NASA (CRS).

Average payload mass by F9 v1.1

Explanation:
• Displaying average payload mass carried by booster version F9 v1.1.

First successful ground landing date

Explanation:
• Listing the date when the irst successful landing outcome in ground
pad was achieved.

f
Successful drone ship landing with payload
between 4000 and 6000

Explanation:
• Listing the names of the boosters which have success in drone ship
and have payload mass greater than 4000 but less than 6000.

Total number of successful and failure

mission outcomes

Explanation:
• Listing the total number of successful and failure mission outcomes.

Boosters carried maximum payload

Explanation:
• Listing the names of the booster versions which have carried the maximum
payload mass.

2015 launch records

Explanation:
• Listing the failed landing outcomes in drone ship, their booster
versions and launch site names for the months in year 2015.

Rank success count between 2010-06-04 and 2017-03-20

Explanation:
• Ranking the count of landing outcomes (such as Failure (drone ship) or Success
(ground pad)) between the date 2010-06-04 and 2017-03-20 in descending order.

Interactive map with Folium

All launch sites’ location markers on a global map
Explanation:
• Most of Launch sites are in proximity to the
Equator line. The land is moving faster at
the equator than any other place on the
surface of the Earth. Anything on the
surface of the Earth at the equator is
already moving at 1670 km/hour. If a ship is
launched from the equator it goes up into
space, and it is also moving around the
Earth at the same speed it was moving
before launching. This is because of inertia.
This speed will help the spacecraft keep up
a good enough speed to stay in orbit.
• All launch sites are in very close proximity
to the coast, while launching rockets
towards the ocean it minimises the risk of
having any debris dropping or exploding
near people.

Colour-labeled launch records on the map

Explanation:
• From the colour-labeled markers
we should be able to easily
identify which launch sites have
relatively high success rates.
- Green Marker = Successful
Launch
- Red Marker = Failed Launch
• Launch Site KSC LC-39A has a
very high Success Rate.

Build a Dashboard with Plotly

Dash
Launch success count for all sites

Explanation:
• The chart clearly shows that from all the sites, KSC LC-39A has the most
successful launches.

Launch site with highest launch success ratio

Explanation:
• KSC LC-39A has the highest launch success rate (76.9%) with 10 successful and
only 3 failed landings.

Payload Mass vs. Launch Outcome for all sites

Explanation:
• The charts show
that payloads
between 2000
and 5500 kg have
the highest
success rate.

Predictive analysis
(Classi ication)
f
Classi ication Accuracy
Explanation: Scores and Accuracy of the Test Set
• Based on the scores of the Test Set,
we can not con irm which method
performs best.
• Same Test Set scores may be due
to the small test sample size (18
samples). Therefore, we tested all
methods based on the whole
Dataset.
Scores and Accuracy of the Entire Data Set
• The scores of the whole Dataset
con irm that the best model is the
Decision Tree Model. This model
has not only higher scores, but also
the highest accuracy.
f

f
Confusion Matrix
Explanation:
• Examining the confusion matrix, we see
that logistic regression can distinguish
between the different classes. We see
that the major problem is false positives.

Conclusion
• Decision Tree Model is the best algorithm for this dataset.

• Launches with a low payload mass show better results

than launches with a larger payload mass.

• Most of launch sites are in proximity to the Equator line

and all the sites are in very close proximity to the coast.

• The success rate of launches increases over the years.

• KSC LC-39A has the highest success rate of the launches

from all the sites.

• Orbits ES-L1, GEO, HEO and SSO have 100% success rate.

Appendix

Special Thanks to:

Instructors
Coursera
IBM

IBM Data Science Capstone
89% (9)
IBM Data Science Capstone
51 pages
Data Science Specialization Capstone Presentation
No ratings yet
Data Science Specialization Capstone Presentation
46 pages
Ds Capstone Template Coursera
No ratings yet
Ds Capstone Template Coursera
49 pages
IBM Data Science Capstone
No ratings yet
IBM Data Science Capstone
51 pages
Capstone Final
100% (1)
Capstone Final
40 pages
Settings: Settings P63X/Uk St/A54 Micom P631, P632, P633, P634
No ratings yet
Settings: Settings P63X/Uk St/A54 Micom P631, P632, P633, P634
108 pages
00 - SpaceX - Final Presentation - JF
100% (1)
00 - SpaceX - Final Presentation - JF
43 pages
DS Capstone Presentation
No ratings yet
DS Capstone Presentation
46 pages
Applied Data Science Capstone - Spacex
No ratings yet
Applied Data Science Capstone - Spacex
49 pages
DS Capstone Presentation
No ratings yet
DS Capstone Presentation
46 pages
Ds Capstone Template Coursera
No ratings yet
Ds Capstone Template Coursera
50 pages
Data Science Capstone Project
No ratings yet
Data Science Capstone Project
21 pages
Service Manual Xerox Wide Format 8850
No ratings yet
Service Manual Xerox Wide Format 8850
407 pages
All Life Bank - AIML - ML - Project - Low - Code - Notebook
No ratings yet
All Life Bank - AIML - ML - Project - Low - Code - Notebook
78 pages
FRA Project Report - Chilla Nagaraju
100% (1)
FRA Project Report - Chilla Nagaraju
66 pages
00 Final Presentation Echeverria
No ratings yet
00 Final Presentation Echeverria
42 pages
Winning Space Race With Data Science
No ratings yet
Winning Space Race With Data Science
46 pages
Machine Learning Project: Name-Rasmita Mallick Date - 5 September 2021
100% (2)
Machine Learning Project: Name-Rasmita Mallick Date - 5 September 2021
47 pages
Midterm Project Report
No ratings yet
Midterm Project Report
39 pages
Honor 6 Plus - Pe-Tl10 QSG - (01, All, Neu, Si, L)
No ratings yet
Honor 6 Plus - Pe-Tl10 QSG - (01, All, Neu, Si, L)
144 pages
FDTD Getting Started Manual
No ratings yet
FDTD Getting Started Manual
63 pages
DS Capstone Powerpoint
No ratings yet
DS Capstone Powerpoint
46 pages
Book of Gamification
No ratings yet
Book of Gamification
98 pages
Mit Data Science Machine Learning Program Brochure
No ratings yet
Mit Data Science Machine Learning Program Brochure
17 pages
8 Dec DSA Roadmap From Beginner To Advanced With A Focus On
No ratings yet
8 Dec DSA Roadmap From Beginner To Advanced With A Focus On
22 pages
Introduction To Computing Lab
100% (1)
Introduction To Computing Lab
31 pages
Number Sequence Customization in Dynamics 365
No ratings yet
Number Sequence Customization in Dynamics 365
2 pages
PYF Project LearnerNotebook LowCode
No ratings yet
PYF Project LearnerNotebook LowCode
6 pages
Customer Churn Prediction
100% (1)
Customer Churn Prediction
32 pages
T Berd Mts 4000 v2 User Manual Manuals User Guides en
No ratings yet
T Berd Mts 4000 v2 User Manual Manuals User Guides en
212 pages
Sukanya Linear LogisticRegression Report
100% (1)
Sukanya Linear LogisticRegression Report
23 pages
Education and Language Policies
No ratings yet
Education and Language Policies
20 pages
Body Shaming
100% (2)
Body Shaming
16 pages
Machine Learning With Real Life Project: by - Rishabh Gaur
100% (2)
Machine Learning With Real Life Project: by - Rishabh Gaur
26 pages
Ml-1-Guided-Bus Report
No ratings yet
Ml-1-Guided-Bus Report
35 pages
Credit Card Default Prediction: Final Project Report
No ratings yet
Credit Card Default Prediction: Final Project Report
28 pages
Titanic Data Analysis
No ratings yet
Titanic Data Analysis
11 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
LOZ - Tears of The Kingdom Dynamic FPS, Static FPS, and Visual Fixes Patch Collection - GBAtemp - Net - The Independent Video Game Community
No ratings yet
LOZ - Tears of The Kingdom Dynamic FPS, Static FPS, and Visual Fixes Patch Collection - GBAtemp - Net - The Independent Video Game Community
10 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
C-Unit 3
No ratings yet
C-Unit 3
16 pages
Project 3 - Build A Logistic Regression Model To Predict Custo Mer Churn in Telecom IndustryV1.0 PDF
100% (1)
Project 3 - Build A Logistic Regression Model To Predict Custo Mer Churn in Telecom IndustryV1.0 PDF
38 pages
ML Use Cases Ebook
100% (2)
ML Use Cases Ebook
53 pages
Akshaya SMDM Project Report
100% (1)
Akshaya SMDM Project Report
18 pages
MockTest 4 W
No ratings yet
MockTest 4 W
3 pages
Military Canteen Automation System: Project-Report
No ratings yet
Military Canteen Automation System: Project-Report
77 pages
Nagareddy 18-Nov-2023
No ratings yet
Nagareddy 18-Nov-2023
20 pages
ALV in ABAP Using OOPS Concept
No ratings yet
ALV in ABAP Using OOPS Concept
11 pages
KofaxTotalAgilityBestPracticesGuide EN
No ratings yet
KofaxTotalAgilityBestPracticesGuide EN
79 pages
Project Data Mining
No ratings yet
Project Data Mining
55 pages
600 Machine Learning DL NLP CV Projects
100% (2)
600 Machine Learning DL NLP CV Projects
16 pages
IBM Data Science Capstone Report
No ratings yet
IBM Data Science Capstone Report
10 pages
SMDM Guided Project Sample Business Report
No ratings yet
SMDM Guided Project Sample Business Report
17 pages
Data Science For Business 2 PDF
No ratings yet
Data Science For Business 2 PDF
40 pages
Learning Spark
27% (11)
Learning Spark
3 pages
Machine Learning GL
No ratings yet
Machine Learning GL
25 pages
Assignment 2 - Network Connectivity - APPLIED SOCIAL NETWORK
No ratings yet
Assignment 2 - Network Connectivity - APPLIED SOCIAL NETWORK
6 pages
Exploratory Data Analysis of Titanic Survival Prediction Using Machine Learning Techniques
No ratings yet
Exploratory Data Analysis of Titanic Survival Prediction Using Machine Learning Techniques
5 pages
X Education - Lead Scoring Case Study
No ratings yet
X Education - Lead Scoring Case Study
24 pages
Recommender Systems Cookbook
No ratings yet
Recommender Systems Cookbook
11 pages
AB Cheatsheet
No ratings yet
AB Cheatsheet
13 pages
Business Report: Advanced Statistics Module Project I
100% (1)
Business Report: Advanced Statistics Module Project I
5 pages
Chapter 5 - Backtracking PDF
No ratings yet
Chapter 5 - Backtracking PDF
10 pages
Iota Bits Mid-II
No ratings yet
Iota Bits Mid-II
18 pages
Business Report Project - Sheetal - SMDM
100% (1)
Business Report Project - Sheetal - SMDM
20 pages
PG Program Dsba Classroom
No ratings yet
PG Program Dsba Classroom
16 pages
Machine Learning Mini-Project Report
No ratings yet
Machine Learning Mini-Project Report
26 pages
Computer Worksheet Grade VII
No ratings yet
Computer Worksheet Grade VII
2 pages
Reality Show Management - TutorialsDuniya
No ratings yet
Reality Show Management - TutorialsDuniya
19 pages
Sample - Customer Churn Prediction Python Documentation
No ratings yet
Sample - Customer Churn Prediction Python Documentation
33 pages
Data Science & Business Analytics: Post Graduate Program in
No ratings yet
Data Science & Business Analytics: Post Graduate Program in
16 pages
Cci & DF Lab Ex - No 1a - 1b
No ratings yet
Cci & DF Lab Ex - No 1a - 1b
10 pages
Data Science & Business Analytics: Post Graduate Program in
No ratings yet
Data Science & Business Analytics: Post Graduate Program in
16 pages
Cut Command in Linux With Examples - Geeksforgeeks
No ratings yet
Cut Command in Linux With Examples - Geeksforgeeks
4 pages
Capstone Project
0% (1)
Capstone Project
6 pages
Osy Report
No ratings yet
Osy Report
16 pages
PGP Dsba Brochure
No ratings yet
PGP Dsba Brochure
12 pages
Documentaion
No ratings yet
Documentaion
19 pages
2nd Unit - 2.2 - Data Analytics
No ratings yet
2nd Unit - 2.2 - Data Analytics
22 pages
Chapter 6 Synchronisation - p1
No ratings yet
Chapter 6 Synchronisation - p1
30 pages
GDS-VFP Interview Guides July 9 2021
No ratings yet
GDS-VFP Interview Guides July 9 2021
6 pages
Present This About A Good Man
No ratings yet
Present This About A Good Man
5 pages
Career Plans For Next 2 Years
No ratings yet
Career Plans For Next 2 Years
11 pages
Five Chinese Brothers
No ratings yet
Five Chinese Brothers
4 pages
Customer Churn Analysis and Prediction
No ratings yet
Customer Churn Analysis and Prediction
4 pages
CV For Sanni Joseph Adeiza Acted
No ratings yet
CV For Sanni Joseph Adeiza Acted
3 pages
L47GUYKXUHYU
No ratings yet
L47GUYKXUHYU
1 page
Use The Book To Help You Answer These Questions. Remember To Answer The Questions in Sentences
No ratings yet
Use The Book To Help You Answer These Questions. Remember To Answer The Questions in Sentences
1 page
Assignment Nov 2019 Lesson Plan
No ratings yet
Assignment Nov 2019 Lesson Plan
1 page
ODI Open Data Stories 2014-03
No ratings yet
ODI Open Data Stories 2014-03
9 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
(Skiena, 2017) - Book - The Data Science Design Manual - 2
No ratings yet
(Skiena, 2017) - Book - The Data Science Design Manual - 2
1 page
Software Requirements Specification
No ratings yet
Software Requirements Specification
5 pages
Structural Shape Optimization Using Moving Mesh Method
No ratings yet
Structural Shape Optimization Using Moving Mesh Method
5 pages
Politecnico Di Torino Repository ISTITUZIONALE: Loop Detection in Robotic Navigation Using MPEG CDVS
No ratings yet
Politecnico Di Torino Repository ISTITUZIONALE: Loop Detection in Robotic Navigation Using MPEG CDVS
7 pages
Seaman Resume 1
No ratings yet
Seaman Resume 1
1 page
Web Developer Interview
No ratings yet
Web Developer Interview
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ds Capstone Presentation

Uploaded by

Ds Capstone Presentation

Uploaded by

Data Science Capstone Project

Summary of all results

Performed data wrangling

Performed exploratory data analysis (EDA) using visualization and SQL

Performed interactive visual analytics using Folium and Plotly Dash

Performed predictive analysis using classi ication models

Data Columns are obtained by using SpaceX REST API:

Data Columns are obtained by using Wikipedia Web Scraping:

Data collection – SpaceX API

GitHub URL: Data Collection API

Data collection – Web scraping

Collecting the data

GitHub URL: Data Collection with Web Scraping

GitHub URL: Data Wrangling

Scatter plots show the relationship between variables. If a relationship exists,

GitHub URL: EDA with Data Visualization

EDA with SQL

GitHub URL: EDA with SQL

Build an interactive map with Folium

Coloured Markers of the launch outcomes for each Launch Site:

Distances between a Launch Site to its proximities:

GitHub URL: Interactive Visual Analytics with Folium

Build a Dashboard with Plotly Dash

Pie Chart showing Success Launches (All Sites/Certain Site):

Slider of Payload Mass Range:

GitHub URL: SpaceX Dash App

Predictive analysis (Classi ication)

Finding the method Calculating the Applying

GitHub URL: Machine Learning Prediction

• Exploratory data analysis results

EDA with Visualization

Payload vs. Launch Site

Success rate vs. Orbit type

Flight Number vs. Orbit type

EDA with SQL

Launch site names begin with `CCA`

Total payload mass

Average payload mass by F9 v1.1

First successful ground landing date

Total number of successful and failure

Boosters carried maximum payload

2015 launch records

Rank success count between 2010-06-04 and 2017-03-20

Interactive map with Folium

Colour-labeled launch records on the map

Distance from the launch site

Build a Dashboard with Plotly

Launch site with highest launch success ratio

Payload Mass vs. Launch Outcome for all sites

• Launches with a low payload mass show better results

• Most of launch sites are in proximity to the Equator line

• The success rate of launches increases over the years.

• KSC LC-39A has the highest success rate of the launches

Special Thanks to:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.