Shapes
Shapes
Biostatistics
No. Biostat -8
Date:25.01.2009
MEASURES OF DISPERSION,
RELATIVE STANDING AND
SHAPE
A
B
Characteristics of an Ideal Measure of
Dispersion
It should be rigidly defined.
1.
Characteristics of Range
Simplest and most crude measure of
dispersion
It is not based on all the observations.
Unduly affected by the extreme values
and fluctuations of sampling.
The range may increase with the size of
the set of observations though it can
decrease
Gives an idea of the variability very
quickly
Percentiles, Quartiles (Measure of Relative Standing)
and Interquartile Range
Descriptive measures that locate the relative position of an
observation in relation to the other observations are called
measures of relative standing.
They are quartiles, deciles and percentiles
The quartiles & the median divide the array into four equal parts,
deciles into ten equal groups, and percentiles into one hundred
equal groups.
Given a set of n observations X1, X2, …. Xn, the pth percentile ‘P’ is the
value of X such that ‘p’ per cent of the observations are less than
and 100 –p per cent of the observations are greater than P.
25th percentile = 1st Quartile i.e., Q1
50th percentile = 2nd Quartile i.e., Q2
75th percentile = 3rd Quartile i.e., Q3
QL M QU
Figure 8.1 Locating of lower, mid and upper quartiles
Percentiles, Quartiles and Interquartile Range Contd.
n+1
Q1 = th ordered observation
4
Note:
1. MD is based on all values and hence cannot be calculated for open-
ended distributions.
2. It uses average but ignores signs and hence appears unmethodical.
3. MD is calculated from mean as well as from median for both
ungrouped data using direct method and for continuous
distribution using assumed mean method and short-cut-method.
4. The average used is either the arithmetic mean or median
Computation of Mean absolute Deviation
For individual series: X1, X2, ……… Xn
∑ |Xi -X|
M.A.D =
n
For discrete series: X1, X2, ……… Xn & with
corresponding frequency f1, f2, ……… fn
∑ fi |Xi -X|
M.A.D = ∑ fi
X: Mean of the data series.
Computation of Mean absolute Deviation:
For continuous grouped data: m1, m2, …… mn are the
class mid points with corresponding class
frequency f1, f2, ……… fn
∑ fi|mi -X|
M.A.D = ∑fi
X: Mean of the data series.
Coeff. Of MAD: = (MAD /Average)
The average from which the Deviations are
calculated. It is a relative measure of dispersion
and is comparable to similar measure of other
series.
Example:
Find MAD of Confinement after delivery in the
following series.
Days of No. of Total days of Absolute fi|Xi - X|
Confinement patients (f) confinement of each Deviation
( X) group Xf from mean
|X - X |
6 5 30 1.61 8.05
7 4 28 0.61 2.44
8 4 32 1.61 6.44
9 3 27 2.61 7.83
10 2 20 3.61 7.22
Total 18 137 31.98
Σ fi ( xi–x )2
σ= ∑fixi2 ∑fixi 2
------------
Σ fi
σ=
∑fi -( ∑ f )
i
Standard Deviation (σ) Contd.
Σ fi ( mi–x )2
σ= ∑fimi2 ∑fimi 2
√ ------------
Σ fi
σ=
∑ fi -( ∑ f )
i
• For any data set with the mean ‘µ’ and the
standard deviation ‘s’ at least 75% of the
values will fall within the 2σ interval and at
least 89% of the values will fall within the 3σ
interval of the mean
TABLE: Calculation of the standard deviation (σ)
Weights of 265 male students at the university of Washington
Class-Interval f d fd fd2
(Σƒd2) (Σfd)2
(Weight)
σ= - ×(i)
n n2
90-99 1 -5 -5 25
100-109 1 -4 -4 16 931 (99)2
110-119 9 -3 -27 81 = - ×(10)
265 265
120-129 30 -2 -60 120
130-139 42 -1 -42 42
140-149 66 0 0 0 (3.5132 – 0.1396) (×10)
=
150-159 47 1 47 47
160-169 39 2 78 156 = (1.8367) (10)
170-179 15 3 45 135
180-189 11 4 44 176
= 18.37 or 18.4
d = (Xi –A)/i n = Σfi
190-199 1 5 5 25
200-209 3 6 18 108 . A = 144.5, i = 10
n =265 Σƒd= 99 Σƒd2 = 931
Means, standard deviation, and coefficients of variation of the age
distributions of four groups of mothers who gave birth to one or
more children in the city of minneapol in: 1931 to 1935. Interprete
the data
CLASSIFICATION X σ CV
Resident married 28.2 6.0 21.3
Non-resident married 29.5 6.0 20.3
Resident unmarried 23.4 5.8 24.8
3
n n
xi − x
Skewness = ∑
(n − 1)(n − 2) i =1 s
n(n + 1) n
xi − x
4
3(n − 1) 2
Kurtosis = ∑ −
(n − 1)(n − 2)(n − 3) i =1 s (n − 2)(n − 3)