Unit III
Unit III
UNIT-III
MEASURE OF DISPERSION
The measures of central tendency describes the central part of values in the data set
appears to concentrate around a central value called average. But these measures do not reveal
how these values are dispersed (spread or scattered) on each side of the central value. Therefore
while describing data set it is equally important to know how for the item in the data are close
around or scattered away from the measures of central tendency.
Definitions
Dispersion or spread is the degree of the scatter or variation of the variable about a central value
– Brooks & Dick
Example
Look at the runs scored by the two cricket players in a test match:
1. Absolute measures
2. Relative measures
1. Absolute Measures
Such measures are not suitable for comparing the variability of the two data sets which
are expressed in different units of measurements.
Range
Raw Data
Range is defined as difference between the largest and smallest observations in the data
set. Range (R) = Largest value in the data set (L) - Smallest value in the data set (S)
R=L–S
Grouped Data
For grouped frequency distribution of values in the data set, the range is the difference
between the upper class limit of the last class interval and the lower class limit of first class
interval.
Coefficient of Range
Coefficient of Range = (L - S) / (L + S)
Example
The following data relates to the heights of 10 students (in cm’s) in a school. Calculated
the range and coefficient of range
158, 164, 168, 170, 142, 160, 154, 174, 159, 146
Solution
L = 174 S = 142
Range = 32
Coefficient of range = (L – S) / (L + S)
Example
Calculate the range and the co-efficient of range for the marks obtained by 100 students
in a school
Range = L – S
= 75 – 60
Range = 15
Coefficient of range = (L – S) / (L + S)
Merits
Limitations
The calculations of range is based on only two values - largest value and smallest value.
It is largely influenced by two extreme values.
It cannot be computed in the case of open-ended frequency distributions
It is not suitable for further mathematical treatment.
Coefficient of Q.D.
Q3 Q1 2 Q3 Q1
Q3 Q1 2 Q3 Q1
Example
Find out the value of quartile deviation and its coefficient from the following data:
Roll No 1 2 3 4 5 6 7
Marks 20 28 40 12 30 15 50
Solution
12 15 20 28 30 40 50
N 1
Q1 Size of th item
4
7 1
Size of th item
4
Size of 2nd item is 15. Thus Q1 = 15
N 1
Q3 Size of 3 th item
4
3 8
Size of th item
4
Size of 6nd item is 40. Thus Q3 = 40
Q3 Q1
Q.D =
2
40 15
2
Q.D 12.5
Q3 Q1
Coefficient of Q.D.
Q3 Q1
40 15 25
40 15 55
Coefficient of Q.D. 0.455
Example
Marks 10 20 30 40 50 60
No. of Students 4 7 15 8 7 2
Solution:
N 1 43 1
Q3 Size of 3 th item 3 33rd item
4 4
Q3 Q1 40 20
Q.D. 10
2 2
Q3 Q1 40 20 20
Coefficient of Q.D. 0.333
Q3 Q1 40 20 60
Example
Calculate quartile deviation and the coefficient of quartile deviation from the following data
Wages in Rupees per hour Less than 35 35-37 38-40 41-43 Over 43
Number of wage earners 14 62 99 18 7
Solution
Wages
f c.f
(Rs. Per hour)
Less than 35 14 14
35-37 62 76
38-40 99 175
41-43 18 193
Over 43 7 200
Q Q1
Q.D = 3
2
N 200
Q1 Size of th item 50th item
4 4
N
c. f .
Q1 L 4 i
f
50 14
Q1 35 2 35 1.16 36.16
62
3N 3 200
Q3 Size of th item 150th item
4 4
3N
c. f .
Q1 L 4 i
f
150 76
Q3 38 2 38 1.49 39.49
99
39.49 36.16
Q.D. 1.67
2
Merits
It is not affected by the extreme (highest and lowest) values in the data set.
It is an appropriate measure of variation for a data set summarized in open-ended class
intervals.
It is a positional measure of variation; therefore it is useful in the cases of erratic or
highly skewed distributions.
Limitations
The QD is based on the middle 50 per cent observed values only and is not based on all
the observations in the data set, therefore it cannot be considered as a good measure of
variation.
It is not suitable for mathematical treatment.
It is affected by sampling fluctuations.
The QD is a positional measure and has no relationship with any average in the data set.
Mean Deviation
The Mean Deviation (MD) is defined as the arithmetic mean of the absolute deviations of
the individual values from a measure of central tendency of the data set. It is also known as the
average deviation.
The measure of central tendency is either mean or median. If the measure of central
tendency is mean (or median), then we get the mean deviation about the mean (or median).
MD (about mean)
D
D xx
N
MD (about median)
D m
Dm x median
N
Example
The following are the weights of 10 children admitted in a hospital on a particular day.
Find the mean deviation about mean, median and their coefficients of mean deviation.
7 4 10 9 15 12 7 9 9 18
Solution
n = 10; Mean : x
x 100 10
n 10
9 9 18
Median 9
2 2
MD (about mean)
D 30 3
N 10
MD (about median)
D m
28
2.8
N 10
Example
x 10 11 12 13 14
f 3 12 18 12 3
Solution
x f c.f D x median f D
10 3 3 2 6
11 12 15 1 12
12 18 33 0 0
13 12 45 1 12
14 14 48 2 6
Total = 60 48 36
MD
f D
N
N 1 48 1
Median Size of th item 24.5th item
2 2
MD
f D
36
0.75
N 48
Example
N
c. f .
Median L 2 i
f
50 37
Median 30 10 30 5.2 35.2
25
MD
D
1314.8
13.148
n 100
Standard Deviation
Definition: Standard deviation is the positive square root of average of the deviations of all the
observation taken from the mean. It is denoted by a Greek letter σ.
Ungrouped data
,d x x
N
d d
2 2
Standard deviation ,d x A
N
2. Assumed mean method:
N
fd fd
2 2
, d x A
N N
fd fd
2
x A
2
N
C, d
N C
Merits
Limitations
Example
Weights of children admitted in a hospital is given below calculate the standard deviation of
weights of children.
13 15 12 19 10.5 11.3 13 15 12 9
Solution
A.M ., x
x
n
13 15 12 19 10.5 11.3 13 15 12 9
10
129.8
10
x 12.98
x d = x – 12.98 d2
13 0.02 0.0004
15 2.02 4.0804
12 -0.98 0.9604
19 6.02 36.2404
10.5 2.48 6.1504
11.3 -1.68 2.8224
13 0.02 0.0004
15 2.02 4.0804
12 -0.98 0.9604
9 -3.98 15.8404
129.8 71.136
Standard deviation
d 2
,d x x
N
71.136
10
2.67
Example
The wholesale price of a commodity for seven consecutive days in a month is as follows:
Days 1 2 3 4 5 6 7
Commodity/price/quintal 240 260 270 245 255 286 264
Calculate the variance and standard deviation.
Solution
Observations (x) d = x - A d2
240 -15 225
260 5 25
270 15 225
245 -10 100
255 0 0
286 31 961
264 9 81
35 1617
d d
2 2
Variance 2
n n
2
1617 35
7 7
231 52 231 25
Variance 2 206
206 14.35
Example
Solution
A = 35, C = 10
Profit Mid-value x A
d f fd fd2
(Rs. In Crore) (x) C
0-10 5 -3 8 -24 72
10-20 15 -2 12 -24 48
20-30 25 -1 20 -20 20
30-40 35 0 30 0 0
40-50 45 1 20 20 20
50-60 55 2 10 20 40
Total 100 -28 400
fd fd
2 2
Standatd deviation C
N N
200 28
2
C
100 100
2 0.078 10
13.863
Coefficient of Variation
The coefficient of variation is obtained by dividing the standard deviation by the mean
and multiplying it by 100. Symbolically,
Coefficient of Variation C.V . 100
x
Merit
The CV is independent of the unit in which the measurement has been taken, but standard
deviation depends on units of measurement. Hence one should use the coefficient of
variation instead of the standard deviation.
Limitations
If the value of mean approaches 0, the coefficient of variation approaches infinity. So the
minute changes in the mean will make major changes.
Example
The scores of two batsmen, A and B, in ten innings during a certain season, are as under:
Solution
Coefficient of Variation C.V . 100
x
5
C.V for batsman A 100 10%
50
25
C.V for batsman B 100 33.33%
75
Example
The weekly sales of two products A and B were recorded as given below
Product A 59 75 27 63 27 28 56
Product B 150 200 125 310 330 250 225
Find out which of the two shows greater fluctuations in sales.
Solution:
For comparing the fluctuations in sales of two products, we will prefer to calculate coefficient of
variation for both the products.
A = 56
Sales (x) Frequency (f) fd fd2
d=x–A
27 2 -29 -58 1682
28 1 -28 -28 784
56A 1 0 0 0
59 1 3 3 9
63 1 7 7 49
75 1 19 19 361
Total 7 -57 2885
x A
fd 56 57 47.86
N 7
fd fd
2
2885 57
2 2
Variance 2
412.14 66.30 345.84
N N 7 7
18.59
C.V .( A) 100 100 38.84%
x 47.86
A = 225
Sales (x) Frequency (f) fd fd2
d=x–A
125 1 -100 -100 10000
150 1 -75 -75 5625
200 1 -25 -25 625
225 1 0 0 0
250 1 25 25 625
310 1 85 85 7225
330 1 105 105 11025
Total 7 15 35125
x A
fd 225 15 227.14
N 7
fd fd 35125 15 2
2 2
Variance 2
5017.85 4.59 5013.26
N N 7 7
70.80
C.V .( B) 100 100 31.17%
x 227.14
Since the coefficient of variation fro product A is more than that of product B,