Topic 11 - Correlation
Topic 11 - Correlation
SOCIAL SCIENCE
SHAR 2053
CORRELATION
CORRELATION
Direction
Form
Degree
Scatterplot
Correlation Coefficient
Pearson Correlation
MEASURING AND DESCRIBING
RELATIONSHIPS BETWEEN VARIABLES
✓ Positive Relationship:
Both variables vary in the same direction - as
one goes up, the other goes up
eg.
salary and years of education are positively correlated because
the people who make the highest salaries tend to be the ones
who have gone to school the longest
✓ Negative Relationship:
Two variables vary in the opposite direction - as one goes up, the
other goes down. e.g:
The number of daily hassles and the amount of immunoglobulin A in a person's
system are negatively correlated because as the number of hassles goes up, the
amount of immunoglobulin A tends to go down.
X X
Y Y
X X
Scatterplot of Family Income vs Student’s Average Grade
COMPUTING CORRELATION
COEFFICIENT r
where
PERFORMING HYPOTHESIS TESTING WITH
PEARSON CORRELATION
The Recommended Steps (manual computation)
Step 4 - Compute
Step 5 - Make decision (compare r-value in step 4 with r-critical value in step 3)
EXAMPLE
Given is a set of scores.
a. Draw a scatter plot
b. Make a preliminary estimation of the correlation.
c. Calculate the Pearson Correlation and test the hypothesis
EXAMPLE
Looking at the
scatter plot, it
appears that there is
a very good (but not
perfect) positive
correlation. You
should expect an
approximate value of
r = +.8 or +.9.
Step 1
Ho : r = 0 (there is no population correlation)
H1 : r 0 (there is a population correlation)
Step 2 - p-value
(0.05), with two tail,
df = 3.
Step 3 - r-critical
value = 0.878
Mx=6 My=4
Step 5
With n=5 pairs of X and Y values the test has df=3. For a two-tailed test with α = .05, the
critical value is 0.878. Because our correlation is smaller than this value, we fail to reject
the null hypothesis and conclude that the correlation is not significant.
TRY IT OUT!
Student X Y
The table shows 8
pairs of data
(measuring X and Y) A 11 14
are taken from a
B 6 7
sample. Draw a
scatter plot and C 16 15
perform
appropriate D 4 7
hypothesis testing
to see if there is E 1 3
significant
F 10 9
relationship
between the two G 5 9
variables.
H 3 8
Student X Y X-Mx Y-My (X-Mx)2 (Y-My)2 (X-Mx)(Y-My)
A 11 14
B 6 7
C 16 15
D 4 7
E 1 3
F 10 9
G 5 9
H 3 8
Mx= My= SSx = SSy = SP =
THE HYPOTHESES:
https://cengage.vitalsource.com/#/books/9781305856424/cfi/593!/4
/4@0.00:0.00