0% found this document useful (0 votes)
14 views35 pages

Ch. 9.1, 9.2

Chapter 9 covers correlation and regression, detailing the relationship between two variables, how to calculate correlation coefficients, and the distinction between correlation and causation. It introduces linear regression, explaining how to find the regression line and predict values based on it. Additionally, the chapter discusses multiple regression and the significance of residuals in regression analysis.

Uploaded by

Nabil Alrefaai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views35 pages

Ch. 9.1, 9.2

Chapter 9 covers correlation and regression, detailing the relationship between two variables, how to calculate correlation coefficients, and the distinction between correlation and causation. It introduces linear regression, explaining how to find the regression line and predict values based on it. Additionally, the chapter discusses multiple regression and the significance of residuals in regression analysis.

Uploaded by

Nabil Alrefaai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 35

Chapter 9

Correlation and
Regression

Copyright 2019, 2015, 2012, Pearson Education, Inc. 1


Chapter Outline
• 9.1 Correlation
• 9.2 Linear Regression
• 9.3 Measures of Regression and Prediction Intervals
• 9.4 Multiple Regression

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 2


Section 9.1

Correlation

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 3


Section 9.1 Objectives
• An introduction to linear correlation, independent and
dependent variables, and the types of correlation
• How to find a correlation coefficient
• How to distinguish between correlation and causation

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 4


Correlation
Correlation
• A relationship between two variables.
• The data can be represented by ordered pairs (x, y)
 x is the independent (or explanatory) variable
 y is the dependent (or response) variable

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 5


Correlation
You try, Identify the explanatory (independent) variable
and the response (dependent) variable.

A teacher wants to determine if the teaching method used


by her students can be used to predict the students' test
scores.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 6


Correlation
• In a scatter plot, the ordered pairs (x, y) are graphed
as points in a coordinate plane.
• The independent (explanatory) variable x is measured
on the horizontal axis, and the dependent (response)
variable y is measured on the vertical axis.
• A scatter plot can be used to determine whether a
linear (straight line) correlation exists between two
variables.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 7


Correlation

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 8


Example: Constructing a
Scatter Plot
An economist wants to determine
whether there is a linear relationship
between a country’s gross domestic
product (GDP) and carbon dioxide
(CO2) emissions. The data are
shown in the table. Display the data
in a scatter plot and describe the
type of correlation. (Source: World
Bank and U.S. Energy Information
Administration)

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 9


Solution: Constructing a
Scatter Plot

Appears to be a positive linear correlation. Reading


from left to right, as the gross domestic products
increase, the carbon dioxide emissions tend to increase.
. Copyright 2019, 2015, 2012, Pearson Education, Inc. 10
Example: Constructing a
Scatter Plot
You try, Give an example of 2 variables that are negatively
correlated.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 11


Example: Constructing a
Scatter Plot
A student conducts a study to determine whether there is
a linear relationship between the number of hours a
student exercises each week and the student’s grade
point average (GPA). The data are shown in the table
below. Display the data in a scatter plot and describe the
type of correlation.

Copyright 2019, 2015, 2012, Pearson Education, Inc. 12


Example: Constructing a
Scatter Plot
Solution:
From the scatter plot, it appears that there is no linear correlation
between the variables.

The number of hours a student exercises each week


does not appear to be related to the student’s
grade point average.
Copyright 2019, 2015, 2012, Pearson Education, Inc. 13
Correlation Coefficient
Correlation coefficient
• A measure of the strength and the direction of a linear
relationship between two variables.
• The symbol r represents the sample correlation
coefficient.
• A formula for r is
n  xy   x y n is the number
r
n  x 2   x
2
n  y 2   y
2 of data pairs

• The population correlation coefficient is represented


by ρ (rho).
. Copyright 2019, 2015, 2012, Pearson Education, Inc. 14
Correlation Coefficient
• The range of the correlation coefficient is –1 to 1.

–1 0 1
If r = –1 there is If r is close to 0 If r = 1 there is a
a perfect there is no linear perfect positive
negative correlation correlation
correlation

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 15


Linear Correlation

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 16


Example: Using Technology to
Find a Correlation Coefficient
Use a technology tool to
calculate the correlation
coefficient for the Old
Faithful data. What can you
conclude? The data can be
found in the StatCrunch data
set “Old_Faithful” from
Chapter 9.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 17


Solution: Using Technology to
Find a Correlation Coefficient
r 0.979.
This value of r
suggests a strong
positive correlation.

As the durations of
the eruptions increase,
the times until the
next eruption tend to
increase.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 18


Using Technology to Find a
Correlation Coefficient
You try, An economist wants to
determine whether there is a linear
relationship between a country’s
gross domestic product (GDP) and
carbon dioxide (CO2) emissions.
The data are shown in the table.
Find the correlation coefficient.
(Source: World Bank and U.S. Energy
Information Administration)

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 19


Possible Explanations for a
Correlation
1. The correlation may be a coincidence.

2. Both variables might be directly influenced by some


common underlying cause.
3. One of the correlated variables may actually be a
cause of the other. Note that, even in this case, it may
be only one of several causes.

Copyright 2019, 2015, 2012, Pearson Education, Inc. 20


Example: Explanation for a Correlation
Consider the negative correlation between infant
mortality and life expectancy. Which of the three
possible explanations for a correlation applies?

Copyright 2019, 2015, 2012, Pearson Education, Inc. 21


Example: Explanation for a
Correlation
Consider the negative correlation between infant
mortality and life expectancy. Which of the three
possible explanations for a correlation applies?
The negative correlation is probably due to a
common underlying cause – the quality of health
care. In countries where health care is better in
general, infant mortality is lower and life expectancy
is higher.

Copyright 2019, 2015, 2012, Pearson Education, Inc. 22


Eating Breakfast May Beat
Teen Obesity
In the study, published in Pediatrics, researchers analyzed the dietary and
weight patterns of 2,216 adolescents over a five-year period (1998-1999 to
2003-2004) from public schools in Minneapolis-St. Paul, Minn.

The researchers write that teens who ate breakfast regularly had a lower
percentage of total calories from saturated fat and ate more fiber and
carbohydrates than those who skipped breakfast. In addition, regular
breakfast eaters seemed more physically active than breakfast skippers.

Over time, researchers found teens who regularly ate breakfast tended to
gain less weight and had a lower body mass index than breakfast skippers.

Copyright 2019, 2015, 2012, Pearson Education, Inc. 23


Example: Explanation for a
Correlation
You try, Consider the correlation between breakfast
skipping and adolescent obesity? Which of the three
possible explanations for a correlation applies?

https://www.khanacademy.org/math/probability/xa88
397b6:scatterplots/estimating-trend-lines/v/correlatio
n-and-causality

Copyright 2019, 2015, 2012, Pearson Education, Inc. 24


Section 9.2

Linear Regression

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 25


Section 9.2 Objectives
• How to find the equation of a regression line
• How to predict y-values using a regression equation

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 26


Regression lines
• After verifying that the linear correlation between
two variables is significant, the next step is to
determine the equation of the line that best models
the data.
• This line is called a regression line, and its equation
can be used to predict the value of y for a given value
y
of x.

x
. Copyright 2019, 2015, 2012, Pearson Education, Inc. 27
Residuals
• For each data point, di represents the difference between the
observed y-value and the predicted y-value for a given x-value.
• These differences are called residuals and can be positive,
negative, or zero.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 28


Regression Line
Regression line (line of best fit)
• The line for which the sum of the squares of the
residuals is a minimum.  di2
• The equation of a regression line for an independent
variable x and a dependent variable y is
ŷ = mx + b
y-intercept
Predicted Slope
y-value for
a given x-
value
. Copyright 2019, 2015, 2012, Pearson Education, Inc. 29
The Equation of a Regression
Line
• The equation of a regression line for an independent
variable x and a dependent variable y is
ŷ = mx + b
where is the predicted y-value for a given x-value.
The slope m and y-intercept b are given by
n  xy   x y y x
m 2 b y  mx   m
n  x 2   x n n
where is the mean of the y-values in the data set, is
the mean of the x-values, and n is the number of pairs
of data.

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 30


Example: Predicting y-Values
Using Regression Equations
The regression equation for the gross domestic products
(in trillions of dollars) and carbon dioxide emissions (in
millions of metric tons) data is ŷ = 199.535x + 56.036.
Use this equation to predict the expected carbon dioxide
emissions for the following gross domestic products.
(Recall from section 9.1 that x and y have a significant
linear correlation.)
1. $1.2 trillion dollars
2. $2.6 trillion dollars

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 31


Solution: Predicting y-Values
Using Regression Equations
1. $1.2 trillion dollars
To predict the expected carbon dioxide emissions,
substitute each gross domestic product for x in the
regression equation. Then calculate .

When the gross domestic product is $1.2 trillion, the


predicted CO2 emissions are 295.478 million metric
tons.
. Copyright 2019, 2015, 2012, Pearson Education, Inc. 32
Solution: Predicting y-Values
Using Regression Equations
2. You try, $2.6 trillion dollars
To predict the expected carbon dioxide emissions,
substitute each gross domestic product for x in the
regression equation. Then calculate .

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 33


Ch. 9.2 HW, #2

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 34


Using Technology to Predict
y-Values
You try, An economist wants to
determine whether there is a linear
relationship between a country’s
gross domestic product (GDP) and
carbon dioxide (CO2) emissions. Use
the data to predict the CO2 emissions
of country whose GDP is 4.3 trillion
dollars. (Source: World Bank and U.S.
Energy Information Administration)

. Copyright 2019, 2015, 2012, Pearson Education, Inc. 35

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy