0% found this document useful (0 votes)

4 views7 pages

Chapter 5 - Regression

Chapter 5 discusses regression analysis, focusing on how one variable can predict or explain another. It introduces key concepts such as response and explanatory variables, the regression line, residuals, and the coefficient of determination. The chapter includes examples and calculations to illustrate these concepts in practical scenarios.

Uploaded by

jaylabee15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views7 pages

Chapter 5 - Regression

Uploaded by

jaylabee15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Chapter 5 – Regression

(Copyrighted – may not be repurposed, posted, or disseminated anywhere else.)

Chapter 4 – Scatterplots and Correlation

 Scatterplot – a graph that displays the relationship (or the association) between 2
quantitative variables.
 We describe a scatterplot (or a relationship) by giving the form, direction, strength, and
anything unusual (e.g., outliers).
 𝑟 = correlation coefficient – measures the strength and direction of a linear relationship
between 2 quantitative variables.

Chapter 5 – Regression

Sometimes we want to do more than measure the strength of a relationship; we want to use one
variable to help predict (or explain) the other.

Response variable
 The variable that measures the outcome of a study or the variable that we would like to
explain or predict.
 Plotted on the y axis.

Explanatory variable
 The variable that may help to explain, predict, or influence changes in the response
variable.
 Plotted on the x-axis

Example – For each of the following identify the response and explanatory variables.
(a) Student volunteers at a university drank different numbers of cans of beer. Thirty
minutes later, a police officer measured their blood alcohol content.

% of alcohol in the blood = Number of beers consumed =

(b) The results of the National Student Loan Survey includes data on the amount of debt of
recent graduates, their current income, and how stressed they feel about college debt.

Amount of debt = Current income =

How stressed they feel about their debt = __________

Grade on the final exam = __________

(d) Calories burned per week = __________

Number of hours per week spent exercising = __________

1
Example – Researchers wonder if a person’s waist (which is easy to measure) could be a good
predictor of their body fat percentage (which might be harder to measure). To investigate the
relationship, researchers measure the waists of 10 male subjects and then immerse them in water
to accurately measure their body fat percent. The data is below.

Person 1 2 3 4 5 6 7 8 9 10
Waist (inches) 32 33 33 34 36 38 39 41 41 44
Body Fat (%) 6 6 10 12 16 21 22 27 32 33

(a) Identify the response and explanatory variables.

(b) Construct a scatterplot of the data. Describe the nature of the relationship, i.e., give the
form, direction, and strength of the relationship.

Regression Line
 The line that best ∗ fits the scatterplot (or best models the relationship between 𝑥 and 𝑦.
(* What is meant by best? See the last page of these notes.)

 The equation is 𝑦̂ = 𝑎 + 𝑏𝑥 where

𝑠𝑦
o 𝑏 = =slope of the line = 𝑟 ⋅
𝑠𝑥
o 𝑎 = y-intercept of the line = 𝑦̅ − (𝑏 ⋅ 𝑥̅ )
o 𝑦̂ = predicted y value

2
Example Continued

Mean Std. Dev.

𝑥 = waist 37.1 4.122
𝑦 = body fat 18.5 10.091
𝑟 = correlation 0.982

(d) Graph the regression line on your scatterplot. How well do you think the line fits the data
(or models the relationship between 𝑥 and 𝑦)?

(e) In the context of this problem give an interpretation of the slope.

Explain the meaning of the y-intercept.

(f) Predict the body fat percent for people having waist measurements of 33, 40, and 50
inches.

3
(g) Would you trust the accuracy of all 3 predictions made in Part (f)? Why or why not?

Residual
 The vertical distance that a point is from the regression line.
 A residual represents the prediction error between the observed (or actual) 𝑦 value and
the predicted 𝑦 value.
 When a point is
o above the regression line it has a positive residual. The line under-predicts the
actual y value.
o below the regression line it has a negative residual. The line over-predicts the
actual y value.
 Residual = (Observed 𝑦 value) – (Predicted 𝑦 value) = 𝑦 − 𝑦̂

Example Continued

(h) Calculate the residuals for Person #2 and Person #9. Is the regression line over-
predicting or under-predicting the actual body fat % in these two cases?

4
Coefficient of Determination
Question – Why is there variation among the 𝑦 values, i.e., why do different people have
different body fat %?
There could be many reasons, but we group them into two major categories.
 𝑦 values vary because body fat % is related to waist.
(The larger a person’s waist the larger their body fat % will be.)
 𝑦 values vary because body fat % is related to other factors.
(This could explain why person 2 and person 3 have the same waist but different
body fat %. Similarly for person 8 and person 9.)
Statisticians measure the percent of variation in the 𝑦 values that is due to each category.

 𝑟2 =
o the coefficient of determination
o the percent of the variation in the 𝑦 values that is due to the relationship with 𝑥
o the percent of the variation in the 𝑦 values that can be explained by the
regression model

 1 − 𝑟 2 = the percent of the variation in the 𝑦 values that remains unexplained

Example Continued

(i) What percent of the variation in the body fat % can be explained by the regression
model? What percent remains unexplained?

Notes
 0 ≤ 𝑟2 ≤ 1
 If our model (that uses 𝑥 to predict 𝑦) is a good model, then “most” of the variation in the
𝑦 values should be explained by 𝑥, i.e., 𝑟 2 should be “close” to 1.
 The closer 𝑟 2 is to 1.0, the better the regression line is for modeling the relationship
between 𝑥 and 𝑦 and the better it is for predicting 𝑦 by using 𝑥.
5
Example – As soon as a bottle of soda is opened, it begins to lose its carbonation. Fourteen 12-
ounce bottles of cola were obtained, and each was assigned a randomly selected time period (in
hours). Each bottle was opened and allowed to stand at room temperature. The carbonation (y)
in each bottle was measured after the prescribed time period (x). Summaries of the data appear
below.
Mean Std. Dev.
Time 0.614 0.390
Carbonation 2.671 0.891
Correlation –0.744

(a) Identify the response and explanatory variables.

(b) Calculate the equation of the least squares regression line.

(d) Predict the carbonation after 1 hour and 15 minutes.

(e) After sitting for 2 hours, one particular bottle had an actual carbonation of 0.300.
Calculate the residual for the bottle. Would the regression line under-predict or over-
predict the actual amount of carbonation for the bottle?

(f) What percent of the variation in the y values can be explained by the regression model?

6
Question: How do we determine whether one line fits a scatterplot better than another?

Answer:
 We use the method of least squares to assign a rating (SSE) to each line.

 The line that achieves the best rating (i.e., the smallest SSE) is the regression line.

 Residual = distance (measured

vertically) that a point is from a line

 Points above the line have positive

residuals; points below the line have
negative residuals.

 SSE = sum of squared errors

=  (residual 2 )

 The line achieving the smallest SSE is

the best line. This is the regression line.

Correlation_Linear_Logistic Regression
No ratings yet
Correlation_Linear_Logistic Regression
123 pages
Amazonas Iul Aug 2018
No ratings yet
Amazonas Iul Aug 2018
100 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
Share MBBS Lecture 5 (1)-1
No ratings yet
Share MBBS Lecture 5 (1)-1
40 pages
Notes Scatter Plots
No ratings yet
Notes Scatter Plots
39 pages
Predective Analytics or Inferential Statistics
No ratings yet
Predective Analytics or Inferential Statistics
27 pages
Session_19&20
No ratings yet
Session_19&20
54 pages
Simple Linear Regression
100% (1)
Simple Linear Regression
50 pages
Describing Bivariate Numerical Data - Honors 281
No ratings yet
Describing Bivariate Numerical Data - Honors 281
34 pages
Looking at Data: Relationships: Least-Squares Regression
No ratings yet
Looking at Data: Relationships: Least-Squares Regression
23 pages
Practical Biostatistics BMB-308: Torial Port and Presentation
No ratings yet
Practical Biostatistics BMB-308: Torial Port and Presentation
28 pages
Linear Regression and Correlation
No ratings yet
Linear Regression and Correlation
35 pages
Lecture 8 Correlation and Linear Regression
No ratings yet
Lecture 8 Correlation and Linear Regression
66 pages
Stats10_Chapter+4 2
No ratings yet
Stats10_Chapter+4 2
54 pages
Introduction of Regression
No ratings yet
Introduction of Regression
57 pages
R-programming - Unit 5
No ratings yet
R-programming - Unit 5
43 pages
Correlation Regression Tutorial
No ratings yet
Correlation Regression Tutorial
42 pages
Unit 2 - Scatterplots Correlation and Regression Summer 2021
No ratings yet
Unit 2 - Scatterplots Correlation and Regression Summer 2021
43 pages
05 Class RegressionCorrelation
No ratings yet
05 Class RegressionCorrelation
57 pages
Chapter 3 Describing Relationships
No ratings yet
Chapter 3 Describing Relationships
39 pages
Scatter Plot/Diagram Simple Linear Regression Model
No ratings yet
Scatter Plot/Diagram Simple Linear Regression Model
43 pages
Regression2024 MBA
No ratings yet
Regression2024 MBA
25 pages
Chapter 3: Describing Relationships: Section 3.2
No ratings yet
Chapter 3: Describing Relationships: Section 3.2
23 pages
Lecture 4 Linear Regression
No ratings yet
Lecture 4 Linear Regression
75 pages
03 - Simple Linear Regression
No ratings yet
03 - Simple Linear Regression
13 pages
Simple Linear Regression and Correlation
No ratings yet
Simple Linear Regression and Correlation
32 pages
Correlation Simple Regression
No ratings yet
Correlation Simple Regression
26 pages
Part 2 Exploring Relationships Among Variables
No ratings yet
Part 2 Exploring Relationships Among Variables
8 pages
Topics: Regression
No ratings yet
Topics: Regression
26 pages
4_5870483869949498067
No ratings yet
4_5870483869949498067
3 pages
5 Chapter Fi
No ratings yet
5 Chapter Fi
29 pages
LP-III Lab Manual
No ratings yet
LP-III Lab Manual
49 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
46 pages
Regression 1.2 Regression Analysis 1.2.1 Introduction To Regression Analysis
No ratings yet
Regression 1.2 Regression Analysis 1.2.1 Introduction To Regression Analysis
9 pages
Regression&Corr&Annova
No ratings yet
Regression&Corr&Annova
71 pages
Tagalog English Dictionary
90% (10)
Tagalog English Dictionary
115 pages
Bal Jagat English Boarding School
No ratings yet
Bal Jagat English Boarding School
31 pages
STAR Rando Questions Stats
No ratings yet
STAR Rando Questions Stats
14 pages
Chapter 5 - Eng
No ratings yet
Chapter 5 - Eng
20 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
OceanofPDF.com Pierogi Over 50 Recipes - Zuza Zak
No ratings yet
OceanofPDF.com Pierogi Over 50 Recipes - Zuza Zak
315 pages
Statistical Analysis: Linear Regression
No ratings yet
Statistical Analysis: Linear Regression
36 pages
PROBLEMS ch05
No ratings yet
PROBLEMS ch05
117 pages
Ch 4- Correlation and Regression YARA&LAMA
No ratings yet
Ch 4- Correlation and Regression YARA&LAMA
27 pages
R R y X y Y: Imple Inear Egression
No ratings yet
R R y X y Y: Imple Inear Egression
1 page
Regression
No ratings yet
Regression
3 pages
Corr_Regression Analysis
No ratings yet
Corr_Regression Analysis
19 pages
@regression
No ratings yet
@regression
33 pages
Lecture8 4
No ratings yet
Lecture8 4
29 pages
MAP 716 Lecture 4 Simple Linear Regression
No ratings yet
MAP 716 Lecture 4 Simple Linear Regression
23 pages
Stats101A - Chapter 1
No ratings yet
Stats101A - Chapter 1
25 pages
Mini Paper Bahasa Inggris
No ratings yet
Mini Paper Bahasa Inggris
13 pages
ML Assignment No. 1: 1.1 Title
No ratings yet
ML Assignment No. 1: 1.1 Title
8 pages
Prediction Is A Key Task of Statistics
No ratings yet
Prediction Is A Key Task of Statistics
18 pages
Chapter 6 Student
No ratings yet
Chapter 6 Student
21 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Statistics For Business STAT130: Unit 8: Correlation and Regression Analysis
No ratings yet
Statistics For Business STAT130: Unit 8: Correlation and Regression Analysis
56 pages
Burger KIng
No ratings yet
Burger KIng
13 pages
Chapter 4 Regression
No ratings yet
Chapter 4 Regression
38 pages
Statistics: Introduction To Regression
No ratings yet
Statistics: Introduction To Regression
14 pages
WWW Tunwalai Com Chapter 289863-En
No ratings yet
WWW Tunwalai Com Chapter 289863-En
36 pages
Regression
No ratings yet
Regression
24 pages
Ylang Ylang Essential Oil
No ratings yet
Ylang Ylang Essential Oil
3 pages
Chapter 8
No ratings yet
Chapter 8
8 pages
Regression Analysis - VCE Further Mathematics
No ratings yet
Regression Analysis - VCE Further Mathematics
5 pages
Capstone
No ratings yet
Capstone
12 pages
biostat lecture note 3
No ratings yet
biostat lecture note 3
5 pages
Swedish Spices and Herbs Importers
67% (3)
Swedish Spices and Herbs Importers
17 pages
Challenger Eu Tractor Mt845e Mt875e Agcc08 5 e 1001 Operator Manual
No ratings yet
Challenger Eu Tractor Mt845e Mt875e Agcc08 5 e 1001 Operator Manual
23 pages
Few Left Olive A Global History One-Click Download
No ratings yet
Few Left Olive A Global History One-Click Download
17 pages
Ramdan Prayer Time Salafi
No ratings yet
Ramdan Prayer Time Salafi
32 pages
1000 Basic English Words 4 PDF
No ratings yet
1000 Basic English Words 4 PDF
9 pages
Lang Belta Cheat Sheet by Iro
No ratings yet
Lang Belta Cheat Sheet by Iro
2 pages
LKPD English Expressing Tastes Yuni Rugianti, S.pd. 20240816 162144 0000
No ratings yet
LKPD English Expressing Tastes Yuni Rugianti, S.pd. 20240816 162144 0000
7 pages
Amrit Dhara Darlaghat
No ratings yet
Amrit Dhara Darlaghat
9 pages
Why Is The Cloud Kitchen So Successful
No ratings yet
Why Is The Cloud Kitchen So Successful
4 pages
Penicillin: Weird RPG Zine Issue 1 F A L L 2 0 1 9
No ratings yet
Penicillin: Weird RPG Zine Issue 1 F A L L 2 0 1 9
16 pages
1 Appendix 1 /jjj2048
No ratings yet
1 Appendix 1 /jjj2048
16 pages
Vinegar Report 2021
No ratings yet
Vinegar Report 2021
15 pages
1 English Level Test Elementary A1
No ratings yet
1 English Level Test Elementary A1
5 pages
Lesson 1 FREE TIME
No ratings yet
Lesson 1 FREE TIME
4 pages
Parle Esti SalesAnalysis
No ratings yet
Parle Esti SalesAnalysis
3 pages
Fidela Marie Bolongaita Detailed Lesson Plan English 3
No ratings yet
Fidela Marie Bolongaita Detailed Lesson Plan English 3
10 pages
What School Lunch Looks Like Around The World
No ratings yet
What School Lunch Looks Like Around The World
2 pages
V. Action Research Workplan and Timelines
No ratings yet
V. Action Research Workplan and Timelines
2 pages
Unit 5: Exercise 1: Match The Jobs With The Pictures. Use The Words in The Box
No ratings yet
Unit 5: Exercise 1: Match The Jobs With The Pictures. Use The Words in The Box
11 pages
6 Summer Drinks Recipes - Fruit Drinks - Easy Refreshing Drinks - Summer Fruit Juice - Hebbar's Kitchen
No ratings yet
6 Summer Drinks Recipes - Fruit Drinks - Easy Refreshing Drinks - Summer Fruit Juice - Hebbar's Kitchen
3 pages
Subject Object Pronouns: Grammar Worksheet
0% (1)
Subject Object Pronouns: Grammar Worksheet
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 5 - Regression

Uploaded by

Chapter 5 - Regression

Uploaded by

Chapter 5 – Regression

(Copyrighted – may not be repurposed, posted, or disseminated anywhere else.)

Chapter 4 – Scatterplots and Correlation

% of alcohol in the blood = Number of beers consumed =

Amount of debt = Current income =

How stressed they feel about their debt = __________

Grade on the final exam = __________

(d) Calories burned per week = __________

Number of hours per week spent exercising = __________

(a) Identify the response and explanatory variables.

 The equation is 𝑦̂ = 𝑎 + 𝑏𝑥 where

Mean Std. Dev.

(e) In the context of this problem give an interpretation of the slope.

 1 − 𝑟 2 = the percent of the variation in the 𝑦 values that remains unexplained

(a) Identify the response and explanatory variables.

(b) Calculate the equation of the least squares regression line.

(d) Predict the carbonation after 1 hour and 15 minutes.

 Residual = distance (measured

 Points above the line have positive

 SSE = sum of squared errors

 The line achieving the smallest SSE is

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Chapter 5 - Regression

Uploaded by

Chapter 5 - Regression

Uploaded by

Chapter 5 – Regression

(Copyrighted – may not be repurposed, posted, or disseminated anywhere else.)

Chapter 4 – Scatterplots and Correlation

% of alcohol in the blood = __________ Number of beers consumed = __________

Amount of debt = __________ Current income = __________

How stressed they feel about their debt = __________

Grade on the final exam = __________

(d) Calories burned per week = __________

Number of hours per week spent exercising = __________

(a) Identify the response and explanatory variables.

 The equation is 𝑦̂ = 𝑎 + 𝑏𝑥 where

Mean Std. Dev.

(e) In the context of this problem give an interpretation of the slope.

 1 − 𝑟 2 = the percent of the variation in the 𝑦 values that remains unexplained

(a) Identify the response and explanatory variables.

(b) Calculate the equation of the least squares regression line.

(d) Predict the carbonation after 1 hour and 15 minutes.

 Residual = distance (measured

 Points above the line have positive

 SSE = sum of squared errors

 The line achieving the smallest SSE is

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

% of alcohol in the blood = Number of beers consumed =

Amount of debt = Current income =