Measures of Dispersion
Measures of Dispersion
3
Types of Measures of Dispersion
There are two main types of measures of dispersion:
1. Absolute Measure of Dispersion
2. Relative Measure of Dispersion
1. Absolute Measure of Dispersion
The absolute measure of dispersion measures the variation present among
the observations in the unit of the variable or square of the unit of the
variable.
2. Relative Measure of Dispersion
The relative measure of dispersion measures the variation present among
the observations relative to their average. It is expressed in the form of
ratio, or percentage. It is independent of the unit of measurement.
4
Measures of Dispersion
The commonly used measures of absolute dispersion are:
1. Range
2. Quartile Deviation
3. Mean (Average) Deviation
4. Variance and Standard Deviation
5
Measures of Spread
❖Distance Based Measures of Spread
• The range
• The Semi interquartile range
❖Centre Based Measures of Spread
• The mean deviation
• The variance
• The standard deviation
6
Distance Based Measures of Spread
Range
Range. The difference between the largest and the smallest
observations:
X Max = X m , X min = X 0
Range
• Coefficient = Xm − X0
of Range.
Xm − X0
Coefficient of Range =
Xm + X0
7
Range
Example: The following data set shows the weekly TV viewing
times, in hours. Find Range and Coefficient of Range
25 41 27 32 43 66 35 31 15 5
34 26 32 38 16 30 38 30 20 21
X m = 66, X 0 = 5
Range = X m − X 0 = 66 − 5 = 61
X m − X 0 66 − 5
Coefficient of Range = = = 0.86
X m + X 0 66 + 5
8
Disadvantages of the Range
• Ignores the way in which data are distributed
7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5
• Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119
9
Interquartile Range
10
Semi Interquartile Range or Quartile Deviation
11
Quartile Deviations. Find IQR,Q.D and Coefficient of Q.D
5 15 16 20 21 25 26 27 30 30
31 32 32 34 35 37 38 41 43 66
1(20+1)
LOCATION of Q1 = th observation in the data
4
=5.25th observation
VALUE of Q1 = 5th obs. + 0.25{6th obs.-5th obs.}
= 21 + 0.25{25-21} = 22.0 Hours
12
2(20 + 1)
LOCATION of Q2 = th observation in the data
4
= 10.50th observation
VALUE of Q 2 = 10th obs. + 0.50{11th obs.-10th obs.}
= 30 + 0.50{31- 30} = 30.5 Hours
3(20 + 1)
LOCATION of Q = th observation in the data
3 4
= 15.75th observation
VALUE of Q3 = 15th obs. + 0.75{16th obs. -15th obs.}
= 35 + 0.75{37 - 35} = 36.5 Hours
13
Q1 = 22.0 Hours, Q3 = 36.5 Hours
I.Q.R=Q3 − Q1 = 36.5 − 22.0 = 14.50
Q3 − Q1 36.5 − 22.0
Q.D = = = 7.25
2 2
Q3 − Q1 36.5 − 22.0
Coefficient of Q.D = = = 0.12
Q3 + Q1 36.5 + 22.0
14
Centre Based Measures of Spread
• Several statistics use the centre of the data as a point of reference
and reflect how data is clustered around it
15
Mean Deviation: is defined as the arithmetic mean of
the absolute deviation of observations from
mean/Median/Mode.
M .D from Mean M.D from Median
M .Dmean =
X −X
M .Dmedian =
X − Median
n n
M .D M .D
Coefficient of M .D = Coefficient of M .D =
X Median
M.D from Mode
M .Dmod e =
X − Mode
n
M .D
Coefficient of M .D =
Mode 16
X − X
Find M.D and Coefficient X cm
4 +6
of M.D. From mean
6 +4
9 +1
• The M.D of scores is 12 2
M .D =
X −X
13 3
n
16 6
M .D
Coefficient of M .D = 60 22
X
X=
60
= 10 X X −X
6
M .D =
X − X 22
= = 3.67
n 6
M .D 3.67
Coeff . of M .D = = = 0.367
X 10 17
X − Median
Find M.D and Coefficient X cm
4 +6.5
of M.D. from Median
6 +4.5
n +1
Median = th value = 3.5th value 9 +1.5
2
12 1.5
Median = 3rd value + 0.5(4th value − 3rd value)
13 2.5
median = 9 + 0.5(12 − 9) = 10.5 16 5.5
20
• The M.D of scores is X − Median
M .DMedian =
X − Median
=
20
− = 3.33
n 6
M .D 3.33
Coefficient of M .D = = = 0.317
Median 10.5
18
X − Mode
Find M.D and Coefficient X cm
4 +5
of M.D. from Mode
9 0
Mode = 9 9 0
12 3
• The M.D of scores is 13 4
16 7
19
X − Mode
M .DMode=
X − Mode
=
19
− = 3.17
n 6
M .D 3.17
Coefficient of M .D = = = 0.352
Mode 9
19
The Mean (absolute) Deviation
Problem:-Sum of deviations from X (X − X ) X −X
( )
2
( X − X ) 2
1 X
S =
2
n
or S =
2
n
X −
2
n
(Unbiased Estimate Variance)
( )
2
(X − X ) 2
1 X
S 2
=
n −1
or S =
2
n −1
X −
2
n
21
Q. Find Variance of the X ( X − X )2
given data, s2 4 36
6 16
X 60 9 1
X= = = 10 12 4
n 6
13 9
S =
2 ( X − X ) 102
2
= = 17
16 36
n 6 60 102
X
S =
2 ( X − X ) 2
=
102
= 20.4
( X − X ) 2
n −1 5
22
Q. Find Variance of the
X cm X2
given data, s2
4 16
X 60 6 36
X = = = 10
n 6 9 81
( X)
2
1
= 17 12 144
S =
2
X −
2
n n 13 169
1 (60) 2 16 256
S = 702 −
2
= 17 cm 2
6 6 60 702
1
( X)
2
X X 2
S =
2
n −1
X −
2
n
1 (60) 2
n=6
S = 702 −
2
= 20.4
5 6
23
Standard Deviation: is defined as the Positive square root of
the arithmetic mean of the Squared deviation of
observations from mean it is denoted by
S
( Biased Estimate S .D )
( )
2
( X − X ) 2
1 X
S=
n
or S =
n
X −
2
n
(Unbiased Estimate S .D )
(X − X ) ( )
2
2
1 X
S=
n −1
or S =
n −1
X −
2
n
24
Q. Find S.D of the given X ( X − X )2
data, s 4 36
6 16
X 60
X= = = 10
9 1
n 6 12 4
(X − X )
13 9
2
102
S= = = 4.123 16 36
n 6 60 102
X ( X − X ) 2
S=
(X − X ) 2
=
102
= 4.517
n −1 5
25
Q. Find S.D and Coefficient
X cm X2
Variation of the given data, s2
4 16
X 60
X = = = 10 6 36
n 6
9 81
( X)
2
1
S=
n
X −
2
n
12 144
13 169
16 256
1 (60) 2
S= 702 − = 17 = 4.123cm
6 6 60 702
1
( X)
2
X X 2
S=
n −1 X −
2
n
n=6
1 (60) 2
S= 702 − = 20.4 = 4.517
5 6
26
Coefficient of Variation (CV)
• Always in percentage (%)
• Shows relative variability, that is, variability relative to the
magnitude of the data i.e variation relative to mean
• Can be used to compare two or more sets of data measured
in different units or same units but different average size
S
C.V = .100%
X
27
Coefficient of Variation (CV)
X = 10
S = 4.123
4.123
CV = 100%=41.23%
10
28
Comparing Coefficient of Variation
Mr. Ali
AverageMarks out of 60 = X 1 =53
Standard deviation=S1 = 5
S1 5 Mr. Ali is
CVALI = 100% = 100% = 9.43%
X1 53 more
Consistent in
Mr.Zain performance
AverageMarks out of 80 = X 1 =72
Standard deviation=S1 = 10
S1 10
CVALI = 100% = 100% = 13.88%
X1 72
29
Comparing Coefficient
of Variation
• Stock A:
o Average price last year = $50
o Standard deviation = $5
s $5
C V A = 100% = 100% = 10%
x $50 Both stocks
have the same
• Stock B:
standard
o Average price last year = $100 deviation, but
o Standard deviation = $5 stock B is less
variable relative
to its price
s $5
C VB = 100% = 100% = 5%
x $100
Coefficient of Variation
Summary statistics for WEIGHT and HEIGHT (both ratio variables) of Pakistani adults in different units:
Weight Height Weight Height
Mean 160 pounds 66 inches SD 30 pounds 4 inches
72.6 kilograms 5.5 feet 13.6 kilograms 0.33 feet
0.08 tons 168 centimeters 0.015 tons 10.2 centimeters
Which variable [WEIGHT or HEIGHT] has greater dispersion? [No meaningful answer can be given]
Which variable has greater dispersion relative to its average, e.g., greater Coefficient of Dispersion (SD relative
to mean)?
S1 30 13.6 0.015
CVWeight = 100% = = = = 18.7%
X1 160 72.6 0.08
S1 4 0.33 10.2
CVHeight = 100% = = = = 6.1%
X1 66 5.5 168
Note that the Coefficient of Variation is a pure number, not expressed in any units and is the same whatever
units the variable is measured in.
Choosing Appropriate
Measure of Variability
• If data are symmetric, with no serious outliers, use range and
standard deviation.
• If data are skewed, and/or have serious outliers, use IQR.
• If comparing variation across two data sets, use coefficient of
variation (C.V)
32