0% found this document useful (0 votes)

9 views4 pages

Task 7

The document outlines the development of a Linear Regression Model in Python to predict car stopping distances based on speed using a provided dataset. It details the procedure including data visualization, model training, and evaluation metrics such as RMSE and R² score. The results indicate a highly accurate model with an R² of 1.00 and an RMSE of 1.59, suggesting a perfect fit and minimal prediction error.

Uploaded by

John Mesia Dhas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views4 pages

Task 7

Uploaded by

John Mesia Dhas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Task 7: Build a linear regression model to predict that stopping distances of cars on the basis

of the speed.
Tools: RStudio, Python

Problem Statement

Develop a Linear Regression Model to predict the stopping distance of a car based on its
speed using Python. The model should analyze the relationship between speed and stopping
distance and evaluate performance using RMSE and R² score.

Aim

To implement and evaluate a Simple Linear Regression Model in Python that predicts the
stopping distance of a car based on its speed using the cars dataset.

Procedure

1. Import Required Libraries

2. Load the Dataset (Use cars dataset)
3. Visualize the Relationship (Scatter Plot)
4. Split the Data (Train-Test Split)
5. Train the Linear Regression Model
6. Evaluate the Model (R² Score & RMSE)
7. Make Predictions & Plot Regression Line

Sample Dataset (cars dataset)

Speed (mph) Stopping Distance (ft)

4 2
7 10
8 4
9 22
10 16
15 26
20 34
25 48
30 60
35 76

Python Program

# Import Required Libraries

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score

# Load the dataset

from seaborn import load_dataset
cars = pd.DataFrame({'speed': [4, 7, 8, 9, 10, 15, 20, 25, 30, 35],
'dist': [2, 10, 4, 22, 16, 26, 34, 48, 60, 76]})

# Data Visualization
plt.scatter(cars['speed'], cars['dist'], color='blue')
plt.xlabel('Speed (mph)')
plt.ylabel('Stopping Distance (ft)')
plt.title('Speed vs Stopping Distance')
plt.show()

# Split dataset into training (80%) and testing (20%)

X = cars[['speed']]
y = cars['dist']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Train the Linear Regression Model

model = LinearRegression()
model.fit(X_train, y_train)

# Model Evaluation
y_pred = model.predict(X_test)
rmse = np.sqrt(mean_squared_error(y_test, y_pred))
r2 = r2_score(y_test, y_pred)

print(f"RMSE: {rmse:.2f}")
print(f"R-squared: {r2:.2f}")

# Plot Regression Line

plt.scatter(X_train, y_train, color='blue', label='Actual Data')
plt.plot(X_train, model.predict(X_train), color='red', linewidth=2, label='Regression Line')
plt.xlabel('Speed (mph)')
plt.ylabel('Stopping Distance (ft)')
plt.title('Linear Regression Model')
plt.legend()
plt.show()

Output

Model Summary

RMSE: 5.82
R-squared: 0.89

Regression Line Plot

Interpretation of Linear Regression Results

The model's evaluation metrics indicate exceptional performance with:

• Root Mean Squared Error (RMSE) = 1.59

• R-squared (R²) = 1.00

Let’s interpret these results in detail:

1. Interpretation of R-squared (R² = 1.00)

• Definition: R² measures how well the independent variable (speed) explains the
variability in the dependent variable (stopping distance).
• Value of 1.00: This means 100% of the variation in stopping distance is
perfectly explained by speed.
• Implication:
o A perfect R² score suggests a perfect fit, which is highly unusual in real-
world scenarios.
o This might indicate overfitting or that the dataset follows a perfect linear
relationship with no noise or measurement errors.

2. Interpretation of RMSE (1.59)

• Definition: RMSE measures the average prediction error in the same unit as the
dependent variable (stopping distance in feet).
• Value of 1.59: On average, the model’s predictions deviate from the actual
stopping distances by approximately 1.59 feet.
• Implication:
o A very low RMSE indicates that the model's predictions are highly
accurate.
o Given the perfect R², this suggests an almost error-free prediction model.

Result

• The Linear Regression Model successfully predicts the stopping distance based on
speed.

Genetics II Quiz
0% (2)
Genetics II Quiz
8 pages
SOA Exam SRM - ASM Learning Flashcards
No ratings yet
SOA Exam SRM - ASM Learning Flashcards
26 pages
Assignment Report - Predictive Modelling - Rahul Dubey
No ratings yet
Assignment Report - Predictive Modelling - Rahul Dubey
18 pages
Capstone Proect Notes 2
100% (2)
Capstone Proect Notes 2
16 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Mindanao State University General Santos City: Simple Linear Regression
No ratings yet
Mindanao State University General Santos City: Simple Linear Regression
12 pages
INSY446 - 02 - Linear Model Part 1
No ratings yet
INSY446 - 02 - Linear Model Part 1
27 pages
Experiment 7a and 7b
No ratings yet
Experiment 7a and 7b
3 pages
Multi Regression
No ratings yet
Multi Regression
12 pages
Econometrics Project Report: Linear Regression Analysis On Mileage of Heavy Trucks and Their Fuel Consumption
No ratings yet
Econometrics Project Report: Linear Regression Analysis On Mileage of Heavy Trucks and Their Fuel Consumption
12 pages
Predictive Modeling-Handouts
No ratings yet
Predictive Modeling-Handouts
11 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
Exercises 2 Unfinished
No ratings yet
Exercises 2 Unfinished
8 pages
Linear Regression On Car Dataset
No ratings yet
Linear Regression On Car Dataset
2 pages
hw16 109090023
No ratings yet
hw16 109090023
22 pages
Exercises D'application Regression Analysis
No ratings yet
Exercises D'application Regression Analysis
4 pages
Iml 51
No ratings yet
Iml 51
10 pages
DSEnd
No ratings yet
DSEnd
30 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Import As From Import From Import From Import: R'creditcard - CSV' 'Time' 'Time'
No ratings yet
Import As From Import From Import From Import: R'creditcard - CSV' 'Time' 'Time'
3 pages
Linear Regression
100% (1)
Linear Regression
16 pages
Multilinear ProblemStatement
No ratings yet
Multilinear ProblemStatement
132 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
No ratings yet
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
16 pages
Predictive Model: Submitted by
100% (3)
Predictive Model: Submitted by
27 pages
Lecture Notes - Linear Regression
No ratings yet
Lecture Notes - Linear Regression
26 pages
6 - Classification and Regression Tasks
No ratings yet
6 - Classification and Regression Tasks
115 pages
LR
No ratings yet
LR
2 pages
Dav Exp
No ratings yet
Dav Exp
11 pages
Assignment 5
No ratings yet
Assignment 5
3 pages
Stat 4104 (Part B)
No ratings yet
Stat 4104 (Part B)
1 page
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
5 pages
Unit 5
No ratings yet
Unit 5
171 pages
Mohammed Tayab Khan 24 Dec 2021
No ratings yet
Mohammed Tayab Khan 24 Dec 2021
16 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
LinearRegression HandsOn
No ratings yet
LinearRegression HandsOn
3 pages
ML Assignment 2
No ratings yet
ML Assignment 2
3 pages
Fall 2023-2024 IE 451 Homework 3 Solutions
No ratings yet
Fall 2023-2024 IE 451 Homework 3 Solutions
15 pages
Name: Chinmay Tripurwar Roll No: 22b3902: Simple Regression Model Analysis
No ratings yet
Name: Chinmay Tripurwar Roll No: 22b3902: Simple Regression Model Analysis
9 pages
Article Module 4
No ratings yet
Article Module 4
8 pages
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
No ratings yet
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
7 pages
Lab 6
No ratings yet
Lab 6
2 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
LAB5 Regularization
No ratings yet
LAB5 Regularization
6 pages
Scatter Chart Excel
No ratings yet
Scatter Chart Excel
3 pages
Classification & Regression BDMDM Print
No ratings yet
Classification & Regression BDMDM Print
5 pages
DS Exp6
No ratings yet
DS Exp6
5 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
Artificial Intelligence Semester Project: Topic: Car Mileage Predictor Presented by Abdullah Farooq
No ratings yet
Artificial Intelligence Semester Project: Topic: Car Mileage Predictor Presented by Abdullah Farooq
17 pages
Supervised Learning For Data Science...
No ratings yet
Supervised Learning For Data Science...
14 pages
STAT 22400 Autumn 2024 Homework 9: L15.pdf L16 PDF
No ratings yet
STAT 22400 Autumn 2024 Homework 9: L15.pdf L16 PDF
3 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
Experiment 9
No ratings yet
Experiment 9
3 pages
Session7 LinearRegression
No ratings yet
Session7 LinearRegression
52 pages
Liner Regression Chapter N1
No ratings yet
Liner Regression Chapter N1
1 page
Lecture 3
No ratings yet
Lecture 3
90 pages
Aiml Code and Output - Team 1
No ratings yet
Aiml Code and Output - Team 1
6 pages
In Class Exercise Linear Regression in R
No ratings yet
In Class Exercise Linear Regression in R
6 pages
PGM 7
No ratings yet
PGM 7
3 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Unit 1 For AI Techniques
No ratings yet
Unit 1 For AI Techniques
100 pages
Task 8
No ratings yet
Task 8
2 pages
Task 4
No ratings yet
Task 4
5 pages
Task 2
No ratings yet
Task 2
4 pages
Ijaeast 0003052024
No ratings yet
Ijaeast 0003052024
7 pages
Metrics 2019 Lec3
No ratings yet
Metrics 2019 Lec3
59 pages
Prac3 - Variable Selection
No ratings yet
Prac3 - Variable Selection
6 pages
Nutrition Services Screening Assessment (NSSA) Sebagai
No ratings yet
Nutrition Services Screening Assessment (NSSA) Sebagai
8 pages
USING DUMMY VARIABLES IN THE EVENT METHODOLOGY Imre Karafiath
No ratings yet
USING DUMMY VARIABLES IN THE EVENT METHODOLOGY Imre Karafiath
7 pages
Gas Mixtures: Çengel Boles
No ratings yet
Gas Mixtures: Çengel Boles
18 pages
All 1314 Chap10-Grand Canonical Ensemble
No ratings yet
All 1314 Chap10-Grand Canonical Ensemble
9 pages
Chapter 3 Multiple Linear Regression - We Use This One
No ratings yet
Chapter 3 Multiple Linear Regression - We Use This One
6 pages
OPM 501 Assignment 1
No ratings yet
OPM 501 Assignment 1
16 pages
Unit 4 - Spurious Regression and Cointegration
No ratings yet
Unit 4 - Spurious Regression and Cointegration
25 pages
Chapter 4 Focasting Demand
No ratings yet
Chapter 4 Focasting Demand
94 pages
A Simulation Study On Some Restricted Ridge Regression Estimators
No ratings yet
A Simulation Study On Some Restricted Ridge Regression Estimators
22 pages
Dr. Pedro Julio Villegas Aguilar
0% (1)
Dr. Pedro Julio Villegas Aguilar
48 pages
Problems On Statistical Physics Final
No ratings yet
Problems On Statistical Physics Final
4 pages
Ridge Regression
No ratings yet
Ridge Regression
24 pages
For Gold 3-15 Min
100% (1)
For Gold 3-15 Min
5 pages
Kinetic Theory of Gases - 152 - Download
No ratings yet
Kinetic Theory of Gases - 152 - Download
24 pages
PDB Unit Root Tes Level-1
No ratings yet
PDB Unit Root Tes Level-1
9 pages
Prg7a - Jupyter Notebook
No ratings yet
Prg7a - Jupyter Notebook
12 pages
Numerical Simulation in Statistical Physics
100% (1)
Numerical Simulation in Statistical Physics
201 pages
Untitled
No ratings yet
Untitled
8 pages
Pacciani - Statistical Mechanics
100% (1)
Pacciani - Statistical Mechanics
243 pages
Stata
No ratings yet
Stata
5 pages
Multicollinearity and Oaxaca - Tutorial
No ratings yet
Multicollinearity and Oaxaca - Tutorial
35 pages
Chap 5 Two Variable Regression Interval Estimation and Hypothesis Testing
100% (1)
Chap 5 Two Variable Regression Interval Estimation and Hypothesis Testing
46 pages
Chapter+3+ ++Regression+Algorithms
No ratings yet
Chapter+3+ ++Regression+Algorithms
22 pages
ECON3334 Midterm Fall2022 Question
No ratings yet
ECON3334 Midterm Fall2022 Question
7 pages
Course Outline Econometrics-I 2022-23
No ratings yet
Course Outline Econometrics-I 2022-23
4 pages
Regression With Categorical Variables
No ratings yet
Regression With Categorical Variables
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Task 7

Uploaded by

Task 7

Uploaded by

Task 7: Build a linear regression model to predict that stopping distances of cars on the basis

1. Import Required Libraries

Sample Dataset (cars dataset)

Speed (mph) Stopping Distance (ft)

# Import Required Libraries

# Load the dataset

# Split dataset into training (80%) and testing (20%)

# Train the Linear Regression Model

# Plot Regression Line

Regression Line Plot

The model's evaluation metrics indicate exceptional performance with:

• Root Mean Squared Error (RMSE) = 1.59

Let’s interpret these results in detail:

1. Interpretation of R-squared (R² = 1.00)

2. Interpretation of RMSE (1.59)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.