Chapter 3 PDF
Chapter 3 PDF
CHAPTER 3
MEASURES OF DISPERSION AND LOCATION
3.6 Coefficient of Variation - is a measure of spread that describes the amount of variability
relative to the mean. Because the coefficient of variation is unitless, you can use it instead
of the standard deviation to compare the spread of data sets that have different units or
different means.
Note: Small measure of variation indicates
• More homogeneous
• Clustered closely around the mean
• Less variable
• More consistent
• More uniformly distributed
𝑠
Formula: 𝐶𝑉 = (100), where s = standard deviation and 𝑥̄ = mean
𝑥̄
Example1. The average age of the engineers at VSAS Pipeline Corporation is 30 years
with a standard deviation of 3. The average monthly salary of the engineers is
P45,000 with a standard deviation of P3,150. Determine the coefficient of variation
of age and salary.
Solution:
AGE: Given 𝑥̄ 1 = 30 𝑠1 = 3
𝑠 3
𝐶𝑉1 = 𝑥̄1 (100) = (30) (100) = (0.1)(100) = 𝟏𝟎%
1
Example2. Consider the set of contestants. Compare the variation of the two.
Set A 6 7 8 7 9
Set B 6 8 5 9 7
(Set A) x 𝒙 − 𝒙̄ (𝒙 − 𝒙̄ )𝟐
6 -1.40 1.96
7 -0.40 0.16
8 0.60 0.36
7 -0.40 0.16
9 1.60 2.56
∑ 𝑥 = 37 ∑(𝑥 − 𝑥̄ )2 = 5.20
∑ 𝑥 37
𝑥̄1 = = = 𝟕. 𝟒𝟎
𝑛 5
1
PSY 303 – PSYCHOLOGICAL STATISTICS
𝑠 1.14
𝐶𝑉1 = 𝑥̄1 (100) = (7.40) (100) = (0.1541)(100)
1
∑ 𝑥 35
𝑥̄ 2 = = =𝟕
𝑛 5
∑(𝑥−𝑥̄)2 10 10
𝑠2 = √ =√ =√ = 𝟏. 𝟓𝟖
𝑛−1 5−1 4
𝑠 1.58
𝐶𝑉2 = 𝑥̄2 (100) = ( ) (100) = (0.2257)(100)
2 7
𝑪𝑽𝟐 = 𝟐𝟐. 𝟓𝟕%
2
PSY 303 – PSYCHOLOGICAL STATISTICS
AGE f
11 – 20 5
21 – 30 7
31 – 40 12
41 – 50 22
51 – 60 8
61 – 70 4
71 – 80 2
TOTAL
2. In a subdivision with more than a hundred home owners, a distribution of two different
units is given to compare the dispersion of weights with the dispersion of heights. The
mean heights is 5.7ft with s=0.9ft and the mean weight is 72.5kg with s=8.1kg. Compare
the dispersion in the heights and weights.
3
PSY 303 – PSYCHOLOGICAL STATISTICS
3.7 Kurtosis – it is a statistical measure used to describe the distribution of observed data
around the mean. It measures the relative peakedness or flatness of a distribution.
𝑛(𝑛+1) 𝑥−𝑥̄ 4 3(𝑛−1)2
Formula: 𝑘𝑢𝑟𝑡 = {[(𝑛−1)(𝑛−2)(𝑛−3)] [∑ ( ) ]} − (𝑛−2)(𝑛−3)
𝑠
Types of Kurtosis
A. Mesokurtic – are intermediate distribution which are neither too peaked nor too
flat. The values are immediately distributed about the center. Its kurtosis is zero
(kurt=0).
B. Leptokurtic – are distributions where values clustered heavily or pile up in the
center. There is tall distribution with narrow humps and long and high tails. Its
kurtosis is positive (kurt>0) and it denotes a high degree of peakedness.
C. Platykurtic – are flat distributions with values more evenly distributed about the
center with broad humps and short tails. Its kurtosis is negative (kurt<0) and it
denotes a low degree of peakedness.
Recall:
∑ 𝑓𝑥 2,167
𝑥̄ = = = 𝟓𝟒. 𝟏𝟖
𝑛 40
∑ 𝑓 (𝑥−𝑥̄ )2 6,461.7760 6,461.7760
𝑠=√ 𝑛−1
=√ 40−1
=√ 39
= 𝟏𝟐. 𝟖𝟕
4
PSY 303 – PSYCHOLOGICAL STATISTICS
5
PSY 303 – PSYCHOLOGICAL STATISTICS
EXERCISES 3.7
6
PSY 303 – PSYCHOLOGICAL STATISTICS
3.8 Skewness - Degree to which a statistical distribution is not in balance around the mean
(is asymmetrical or lopsided), a perfectly symmetrical distribution having a value of 0.
Distributions with extreme values (outliners) above the mean have positive skew, and
the distributions with outliners below the mean have negative skew.
Coefficient of Skewness – measures the general shape of the distribution or the
lack of symmetry of a distribution.
Formulas:
3(𝑥̄−𝑥̃)
𝑠𝑘 = (Pearson’s Coefficient of Skewness)
𝑠
𝑛 𝑥−𝑥̄ 3
𝑠𝑘 = (𝑛−1)(𝑛−2) [∑ ( ) ] (Software Coefficient of Skewness)
𝑠
Types of Distribution
A. Negatively Skewed Distribution – The mass of the data values fall to the right of
the mean and group at the upper end of the distribution with the tail to the left.
The mean is to the left of the median and the mode is to the right of the median.
B. Symmetrical Distribution – The data values are evenly distributed on both sides
of the mean. The distribution is unimodal and the mean, median and mode are
equal and are at the center of the distribution
C. Positively Skewed Distribution – Most of the values in the data fall to the left of
the mean and group at the lower end of the distribution with the tail to the right.
The mean is to right of the median and the mode is to the left of the median.
7
PSY 303 – PSYCHOLOGICAL STATISTICS
3
𝑥−𝑥 𝑥−𝑥
CI f 𝑥 𝑥−𝑥 ( ) ( )
𝑠 𝑠
18 – 26 2 22 -32.18 -2.5004 -15.6325
27 – 35 1 31 -23.18 -1.8011 -5.8427
36 – 44 5 40 -14.18 -1.1018 -1.3375
45 – 53 8 49 -5.18 -0.4025 -0.0652
54 – 62 15 58 3.82 0.2968 0.0261
63 – 71 6 67 12.82 0.9961 0.9883
72 – 80 3 76 21.82 1.6954 4.8732
𝒙−𝒙 𝟑
TOTAL n = 40 ∑( ) = -16.9903
𝒔
Solution:
Recall:
∑ 𝑓𝑥 2,167
𝑥̄ = = = 𝟓𝟒. 𝟏𝟖
𝑛 40
∑ 𝑓 (𝑥 − 𝑥̄ )2 6,461.7760 6,461.7760
𝑠=√ =√ =√ = 𝟏𝟐. 𝟖𝟕
𝑛−1 40 − 1 39
𝑛
−𝑐𝑓 20−16 4 36
𝑥̃ = 𝐿𝐵 + [ 2 𝑓 ] ⋅ 𝑖 = 53.5 + [ ] (9) = 53.5 + [ ] (9) = 53.5 + = 53.5 + 2.40
15 15 15
̃ = 𝟓𝟓. 𝟗𝟎
𝒙
𝑛 𝑥−𝑥̄ 3 40
b. 𝑠𝑘 = [ ∑( ) ] = [(40−1)(40−2)] [−16.9903]
(𝑛−1)(𝑛−2) 𝑠
40 40
=[ ] [−16.9903] = [ ] [−16.9903]
(39)(38) 1,482
= [0.027][−16.9903] = −0.4587
𝒔𝒌 = −𝟎. 𝟒𝟔 Negatively Skewed Distribution
8
PSY 303 – PSYCHOLOGICAL STATISTICS
EXERCISES 3.8
1. Given the grouped frequency distribution. Find the value of Pearson’s Coefficient of
Skewness
AGE f
11 – 20 5
21 – 30 7
31 – 40 12
41 – 50 22
51 – 60 8
61 – 70 4
71 – 80 2
TOTAL
9
PSY 303 – PSYCHOLOGICAL STATISTICS
3.9 Outliers - A value that "lies outside" (is much smaller or larger than) most of the other
values in a set of data.
- it is an observation that lies an abnormal distance from other values
in a random sample from a population.
- Extremely high or extremely low values in the data set.
Example1: Using the ungrouped data of the Ages of Previous Customers. Check
the data set for outliers.
59 50 52 38 80 62 77 56
60 61 58 62 51 36 54 18
71 54 44 52 26 63 58 56
41 34 61 50 60 53 62 62
53 43 63 71 65 79 45 66
n = 40, even
𝑘𝑛 1
𝑄𝑘 = +
4 2
(1)(40) 1 40 1
𝑄1 = + = + = 10 + 0.5 = 10.5
4 2 4 2
50 + 50 100
𝑄1 = = = 𝟓𝟎
2 2
(3)(40) 1 120 1
𝑄3 = + = + = 30 + 0.5 = 30.5
4 2 4 2
62 + 62 124
𝑄3 = = = 𝟔𝟐
2 2
𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 62 − 50
𝑰𝑸𝑹 = 𝟏𝟐
𝑎. 𝑄1 − 1.5(𝐼𝑄𝑅 ) = 50 − 1.5(12) = 50 − 18 = 𝟑𝟐
𝑏. 𝑄3 + 1.5(𝐼𝑄𝑅 ) = 62 + 1.5(12) = 62 + 18 = 𝟖𝟎
Example2: The Age of 12 patients that entered the hospitals in the morning of a day are
47, 80, 42, 53, 36, 75, 30, 61, 22, 43, 85, and 72. Check the data set for outliers.
10
PSY 303 – PSYCHOLOGICAL STATISTICS
𝑘 (𝑛 + 1)
𝑄𝑘 =
4
(1)(11 + 1) (1)(12)
𝑄1 = =
4 4
12
=
4
𝑄1 = 3𝑟𝑑
𝑸𝟏 = 𝟑𝟔
(3)(11 + 1) (3)(12)
𝑄3 = =
4 4
36
=
4
𝑄3 = 9𝑡ℎ
𝑸𝟑 = 𝟕𝟐
𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 72 − 36
𝑰𝑸𝑹 = 𝟑𝟔
NO OUTLIERS
11
PSY 303 – PSYCHOLOGICAL STATISTICS
EXERCISES 3.9
1. During the first 11 weeks of world economic crisis, the weekly average hours worked in
manufacturing jobs were 45, 46, 40, 39, 30, 36, 38, 41, 42, 48, and 35 respectively. Check
for outliers of the given set of data.
12
PSY 303 – PSYCHOLOGICAL STATISTICS
Xlowest Xhighest
Q1 Q2 = 𝑥̃ Q3
Example: Using the ungrouped data of the Ages of Previous Customers. Make a boxplot.
Solution: Given 𝑄1 = 50 𝑄3 = 62 OUTLIERS are 18 and 26
Xlowest = 34 Xhighest = 80
𝑘𝑛 1
𝑄𝑘 = +
4 2
(2)(40) 1 80 1
𝑄2 = + = + = 20 + 0.5 = 20.5
4 2 4 2
56 + 58 114
𝑄2 = = = 𝟓𝟕
2 2
50 57 62
18 26 34 80
Xlowest Xhighest
Q1 Q2 = 𝑥̃ Q3
15 20 25 30 35 40 45 50 55 60 65 70 75 80 85
13
PSY 303 – PSYCHOLOGICAL STATISTICS
EXERCISES 3.10
1. Consider the data of scores 3, 5, 9, 13, 15, 17, 19, 21, 23, and 25. Construct a boxplot.
2. Twelve randomly selected psychology students were asked to state the number of hours
they slept last Sunday. The resulting data are 6, 7, 8, 10, 5, 4, 11, 12, 11, 10, 8 and 7.
Construct a boxplot.
14
PSY 303 – PSYCHOLOGICAL STATISTICS
1. the following data give the weight (in pounds) lost by 8 randomly selected new member
of a psychology club at the end of their first 3 months of membership. Complete the table
and find the following:
x 𝑥 − 𝑥̄ |𝑥 − 𝑥̄ |
13
14
17
20
25
18
16
15
a. Range
b. Average Deviation
c. Standard Deviation using the second formula
AGE f
18 – 20 3
21 – 23 6
24 – 26 8
27 – 29 5
30 – 32 4
33 – 35 4
TOTAL
a. Range
b. Average Deviation
c. Standard Deviation using the first formula
d. Variance using the shortcut formula
e. 𝑃80 , 𝐷5 and 𝑄3
f. Midhinge, IQR and QD
g. Kurtosis
h. Pearson’s Coefficient of Skewness
i. Construct a Box and Whisker plot
3. ASIAN company manufactures computer terminals. The following data are the number of
computer terminals produced at a company for a sample of fifteen days: 7, 20, 15, 20, 22,
17, 22, 18, 19, 12, 15, 16, 17, 13, and 22. Check for outliers of the given set of data and
Construct a boxplot.
15