0% found this document useful (0 votes)

96 views36 pages

Shapes

The document discusses measures of dispersion and relative standing used in biostatistics. It defines measures of dispersion as quantifying how scattered data values are around the average. Several measures are described, including range, interquartile range, mean deviation, and standard deviation. Relative standing is measured using percentiles and quartiles to describe an observation's position compared to others. The purpose of these measures is to assess variability, compare distributions, and describe data shape. Worked examples are provided to demonstrate calculating different dispersion statistics.

Uploaded by

aue.ponytoy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views36 pages

Shapes

Uploaded by

aue.ponytoy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Lecture Series on

Biostatistics

No. Biostat -8
Date:25.01.2009

MEASURES OF DISPERSION,
RELATIVE STANDING AND
SHAPE

Dr. Bijaya Bhusan Nanda,

M. Sc (Gold Medalist) Ph. D. (Stat.)
Topper Orissa Statistics & Economics Services, 1988
bijayabnanda@yahoo.com
CONTENTS
 What is measures of dispersion?
 Why measures of dispersion?
 How measures of dispersions are calculated?
Range
Quartile deviation or semi inter-quartile range,
Mean deviation and
Standard deviation.
Methods for detecting outlier
 Measure of Relative Standing
 Measure of shape
LEARNING OBJECTIVE

 They will be able to:

describe the homogeneity or heterogeneity
of the distribution,
understand the reliability of the mean,
compare the distributions as regards the
variability.
describe the relative standing of the data
and also shape of the distribution.
What is measures of dispersion?
(Definition)
Central tendency measures do not
reveal the variability present in the
data.
Dispersion is the scattered ness of
the data series around it average.
Dispersion is the extent to which
values in a distribution differ from the
average of the distribution.
Why measures of dispersion?
(Significance)
 Determine the reliability of an
average
 Serve as a basis for the control
of the variability
 To compare the variability of
two or more series and
 Facilitate the use of other
statistical measures.
Dispersion Example

 Number of minutes 20  X:Mean Time – 14.6

clients waited to see a
consulting doctor minutes
Consultant Doctor  Y:Mean waiting time
X Y 14.6 minutes
05 15 15 16  What is the difference
12 03 12 18 in the two series?
04 19 15 14
37 11 13 17
06 34 11 15

X: High variability, Less consistency.

Y: Low variability, More Consistency
Frequency curve of distribution of
three sets of data

A
B
Characteristics of an Ideal Measure of
Dispersion
It should be rigidly defined.
1.

It should be easy to understand and easy to calculate.

It should be based on all the observations of the data.

It should be easily subjected to further mathematical

It should be least affected by the sampling fluctuation .

It should not be unduly affected by the extreme values.

6.
How dispersions are measured?
 Measure of dispersion:
 Absolute: Measure the dispersion in the
original unit of the data.
 Variability in 2 or more distrn can be
compared provided they are given in the
same unit and have the same average.
 Relative: Measure of dispersion is free from
unit of measurement of data.
 It is the ratio of a measaure of absolute
dispersion to the average, from which
absolute deviations are measured.
 It is called as co-efficient of dispersion.
How dispersions are measured? Contd.
 The following measures of
dispersion are used to study the
variation:
 The range
 The inter quartile range and
quartile deviation
 The mean deviation or average
deviation
 The standard deviation
How dispersions are measured? Contd.
Range:
The difference between the values of the two
extreme items of a series.
Example:
Age of a sample of 10 subjects from a population
of 169subjects are:
X1 X2 X3 X4 X5 X6 X7 X8 X9 X10
42 28 28 61 31 23 50 34 32 37
The youngest subject in the sample is
23years old and the oldest is 61 years, The
range: R=XL – Xs
= 61-23 =38
Co-efficient of Range:
R = (XL - XS) / (XL + XS)
= (61 -23) / (61 + 23) =38 /84 = 0.452

Characteristics of Range
 Simplest and most crude measure of
dispersion
 It is not based on all the observations.
 Unduly affected by the extreme values
and fluctuations of sampling.
 The range may increase with the size of
the set of observations though it can
decrease
 Gives an idea of the variability very
quickly
Percentiles, Quartiles (Measure of Relative Standing)
and Interquartile Range
 Descriptive measures that locate the relative position of an
observation in relation to the other observations are called
measures of relative standing.
 They are quartiles, deciles and percentiles
 The quartiles & the median divide the array into four equal parts,
deciles into ten equal groups, and percentiles into one hundred
equal groups.
 Given a set of n observations X1, X2, …. Xn, the pth percentile ‘P’ is the
value of X such that ‘p’ per cent of the observations are less than
and 100 –p per cent of the observations are greater than P.
 25th percentile = 1st Quartile i.e., Q1
 50th percentile = 2nd Quartile i.e., Q2
 75th percentile = 3rd Quartile i.e., Q3
QL M QU
Figure 8.1 Locating of lower, mid and upper quartiles
Percentiles, Quartiles and Interquartile Range Contd.
n+1
Q1 = th ordered observation
4

Q2 = 2(n+1) th ordered observation

4
Q3 = 3(n+1) th ordered observation
4
Interquartile Range (IQR): The difference
between the 3rd and 1st quartile.
IQR = Q3 – Q1
Semi Interquartile Range:= (Q3 – Q1)/ 2
Coefficient of quartile deviation:
(Q3 – Q1)/(Q3 + Q1)
Interquartile Range
Merits:
 It is superior to range as a measure of dispersion.
 A special utility in measuring variation in case of open end
distribution or one which the data may be ranked but measured
quantitatively.
 Useful in erratic or badly skewed distribution.
 The Quartile deviation is not affected by the presence of
extreme values.
Limitations:
 As the value of quartile deviation dose not depend upon every
item of the series it can’t be regarded as a good method of
measuring dispersion.
 It is not capable of mathematical manipulation.
 Its value is very much affected by sampling fluctuation.
 Another measure of relative standing is the z-score
for an observation (or standard score).
 It describes how far individual item in a distribution
departs from the mean of the distribution.
 Standard score gives us the number of standard
deviations, a particular observation lies below or above
the mean.
Standard xscore (or z -score) is defined as follows:
z-score=
For a population: X-µ
σ
where X =the observation from the population
µ the population mean, σ = the population s.d
For a sample z-score= X-X
s
where X =the observation from the sample
X the sample mean, s = the sample s.d
Mean Absolute Deviation (MAD) or Mean
Deviation (MD)
 The average of difference of the values of items from some average
of the series (ignoring negative sign), i.e. the arithmetic mean of the
absolute differences of the values from their average .

Note:
1. MD is based on all values and hence cannot be calculated for open-
ended distributions.
2. It uses average but ignores signs and hence appears unmethodical.
3. MD is calculated from mean as well as from median for both
ungrouped data using direct method and for continuous
distribution using assumed mean method and short-cut-method.
4. The average used is either the arithmetic mean or median
Computation of Mean absolute Deviation
For individual series: X1, X2, ……… Xn
∑ |Xi -X|
M.A.D =
n
For discrete series: X1, X2, ……… Xn & with
corresponding frequency f1, f2, ……… fn
∑ fi |Xi -X|
M.A.D = ∑ fi
X: Mean of the data series.
Computation of Mean absolute Deviation:
For continuous grouped data: m1, m2, …… mn are the
class mid points with corresponding class
frequency f1, f2, ……… fn
∑ fi|mi -X|
M.A.D = ∑fi
X: Mean of the data series.
Coeff. Of MAD: = (MAD /Average)
The average from which the Deviations are
calculated. It is a relative measure of dispersion
and is comparable to similar measure of other
series.
Example:
Find MAD of Confinement after delivery in the
following series.
Days of No. of Total days of Absolute fi|Xi - X|
Confinement patients (f) confinement of each Deviation
( X) group Xf from mean
|X - X |
6 5 30 1.61 8.05
7 4 28 0.61 2.44
8 4 32 1.61 6.44
9 3 27 2.61 7.83
10 2 20 3.61 7.22
Total 18 137 31.98

X = Mean days of confinement = 137 / 18 = 7.61

MAD=31.98 / 18=1.78, Coeff.of MAD= 1.78/7.61=0.233
Problem:
Find the MAD of weight and coefficient of MAD of
470 infants born in a hospital in one year from following
table.

Weight 2.0-2.4 2.5-2.9 3.0-3.4 3.5-3.9 4.0-4.4 4.5+

in Kg
No. of 17 97 187 135 28 6
infant
Merits and Limitations of MAD
 Simple to understand and easy to compute.
 Based on all observations.
 MAD is less affected by the extreme items than
the Standard deviation.
 Greatest draw back is that the algebraic signs
are ignored.
 Not amenable to further mathematical
treatment.
 MAD gives us best result when deviation is
taken from median. But median is not
satisfactory for large variability in the data. If
MAD is computed from mode, the value of the
mode can not be determined always.
Standard Deviation (σ)
It is the positive square root of the average of squares
of deviations of the observations from the mean. This is
also called root mean squared deviation (σ) .
For individual series: x1, x2, ……… xn
Σ ( xi–x )2 ∑xi2 ∑xi 2
σ=
√ ------------
n
σ=
√ n -( n )
For discrete series: X1, X2, ……… Xn & with
corresponding frequency f1, f2, ……… fn

Σ fi ( xi–x )2
σ= ∑fixi2 ∑fixi 2
------------
Σ fi
σ=
∑fi -( ∑ f )
i
Standard Deviation (σ) Contd.

For continuous grouped series with class

midpoints : m1, m2, ……… mn & with
corresponding frequency f1, f2, ……… fn

Σ fi ( mi–x )2
σ= ∑fimi2 ∑fimi 2
√ ------------
Σ fi
σ=
∑ fi -( ∑ f )
i

Variance: It is the square of the s.d

Coefficient of Variation (CV): Corresponding
Relative measure of dispersion.
σ
CV = ------- × 100
X
Characteristics of Standard Deviation:
 SD is very satisfactory and most widely used
measure of dispersion
 Amenable for mathematical manipulation
 It is independent of origin, but not of scale
 If SD is small, there is a high probability for
getting a value close to the mean and if it is large,
the value is father away from the mean
 Does not ignore the algebraic signs and it is less
affected by fluctuations of sampling
 SD can be calculated by :
• Direct method
• Assumed mean method.
• Step deviation method.
 It is the average of the distances of the observed
values from the mean value for a set of data
 Basic rule --More spread will yield a larger SD

Uses of the standard deviation

 The standard deviation enables us to determine,

with a great deal of accuracy, where the values

of a frequency distribution are located in relation
to the mean.
 Chebyshev’s Theorem

• For any data set with the mean ‘µ’ and the
standard deviation ‘s’ at least 75% of the
values will fall within the 2σ interval and at
least 89% of the values will fall within the 3σ
interval of the mean
TABLE: Calculation of the standard deviation (σ)
Weights of 265 male students at the university of Washington
Class-Interval f d fd fd2
(Σƒd2) (Σfd)2
(Weight)
σ= - ×(i)
n n2
90-99 1 -5 -5 25
100-109 1 -4 -4 16 931 (99)2
110-119 9 -3 -27 81 = - ×(10)
265 265
120-129 30 -2 -60 120
130-139 42 -1 -42 42
140-149 66 0 0 0 (3.5132 – 0.1396) (×10)
=
150-159 47 1 47 47
160-169 39 2 78 156 = (1.8367) (10)
170-179 15 3 45 135
180-189 11 4 44 176
= 18.37 or 18.4
d = (Xi –A)/i n = Σfi
190-199 1 5 5 25
200-209 3 6 18 108 . A = 144.5, i = 10
n =265 Σƒd= 99 Σƒd2 = 931
 Means, standard deviation, and coefficients of variation of the age
distributions of four groups of mothers who gave birth to one or
more children in the city of minneapol in: 1931 to 1935. Interprete
the data
CLASSIFICATION X σ CV
Resident married 28.2 6.0 21.3
Non-resident married 29.5 6.0 20.3
Resident unmarried 23.4 5.8 24.8

Non-resident unmarried 21.7 3.7 17.1

Example: Suppose that each day laboratory technician A completes

40 analyses with a standard deviation of 5. Technician B completes
160 analyses per day with a standard deviation of 15. Which
employee shows less variability?
Uses of Standard deviation
 Uses of the standard deviation
• The standard deviation enables us to
determine, with a great deal of accuracy,
where the values of a frequency
distribution are located in relation to the
mean. We can do this according to a
theorem devised by the Russian
mathematician P.L. Chebyshev (1821-
1894).
Measure of Shape
The fourth important numerical characteristic of a
data set is its shape: Skewness and kurtosis.
 Skewness
• Skewness characterizes the degree of
asymmetry of a distribution around its
mean. For a sample data, the
skewness is defined by the formula:

3
n n
 xi − x 
Skewness = ∑ 
(n − 1)(n − 2) i =1  s 


where n = the number of observations in the sample,

xi= ith observation in the sample, s= standard deviation of
the sample, x = sample mean
Measure of Shape

Figure 8.2 +ve or Right-skewed

distribution
Kurtosis:
Kurtosis characterizes the relative peakedness or flatness of
a distribution compared with the bell-shaped distribution
(normal distribution).
Kurtosis of a sample data set is calculated by the formula:

 n(n + 1) n
 xi − x  
4
3(n − 1) 2
Kurtosis =  ∑   −
 (n − 1)(n − 2)(n − 3) i =1  s   (n − 2)(n − 3)

Positive kurtosis indicates a relatively peaked distribution.

Negative kurtosis indicates a relatively flat distribution.
The distributions with positive and negative kurtosis
are depicted in Figure 8.4 , where the distribution with
null kurtosis is normal distribution.
REFERENCE
1. Mathematical Statistics- S.P Gupta
2. Statistics for management- Richard I.
Levin, David S. Rubin
3. Biostatistics A foundation for Analysis
in the Health Sciences.
THANK YOU

AWWA M22 Sizing Water Service Lines and Meters 3rd Ed 2014 PDF
100% (3)
AWWA M22 Sizing Water Service Lines and Meters 3rd Ed 2014 PDF
133 pages
02-1 Sampling Challenges in Laterite - Waheed Ahmad
100% (1)
02-1 Sampling Challenges in Laterite - Waheed Ahmad
31 pages
Dispersion 1
No ratings yet
Dispersion 1
32 pages
Measures of Dispersion, Relative Standing and Shape: No. Biostat - 8 Date:25.01.2009
No ratings yet
Measures of Dispersion, Relative Standing and Shape: No. Biostat - 8 Date:25.01.2009
49 pages
Bast 503 Lect 5
No ratings yet
Bast 503 Lect 5
53 pages
MBA Quantitative Techniques and Analytics 03
No ratings yet
MBA Quantitative Techniques and Analytics 03
14 pages
Dispersion
No ratings yet
Dispersion
26 pages
Final Dispersion2025
No ratings yet
Final Dispersion2025
61 pages
Dispersion: (Measures of Variability)
No ratings yet
Dispersion: (Measures of Variability)
93 pages
Final Measures of Dispersion DR Lotfi
No ratings yet
Final Measures of Dispersion DR Lotfi
54 pages
Dispersion
50% (2)
Dispersion
58 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
7 pages
Chapter 3 Measure of Variation Dhiraj (Becon 2025)
No ratings yet
Chapter 3 Measure of Variation Dhiraj (Becon 2025)
83 pages
Lecture 5 Notes
No ratings yet
Lecture 5 Notes
23 pages
Measures of Dispersion-April19
No ratings yet
Measures of Dispersion-April19
26 pages
Data Organization Method
No ratings yet
Data Organization Method
65 pages
MCT and MD For Pharmacy Students
No ratings yet
MCT and MD For Pharmacy Students
58 pages
Dispersion
No ratings yet
Dispersion
31 pages
Stat-L CHAPTER 4
No ratings yet
Stat-L CHAPTER 4
18 pages
Dispersion 26-11-2023
No ratings yet
Dispersion 26-11-2023
41 pages
Variability
No ratings yet
Variability
26 pages
Measures of Dispersion and Shape - ALAS DOS NA POTA
No ratings yet
Measures of Dispersion and Shape - ALAS DOS NA POTA
12 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
8 pages
1final Measures of Dispersion
No ratings yet
1final Measures of Dispersion
48 pages
Lecture No. 6 Measures of Variability
No ratings yet
Lecture No. 6 Measures of Variability
25 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
BS Unit 3
No ratings yet
BS Unit 3
22 pages
Arm & Sa Spring 13
No ratings yet
Arm & Sa Spring 13
64 pages
Unit 2 Measures of Dispersion: Structure
No ratings yet
Unit 2 Measures of Dispersion: Structure
16 pages
QTT201 Ca-2
No ratings yet
QTT201 Ca-2
14 pages
Business Statistics: Dr. Basheer Ahmad Samim
No ratings yet
Business Statistics: Dr. Basheer Ahmad Samim
70 pages
Chapter Four: Measures of Dispersion (Variation) : Abebuabebaw
No ratings yet
Chapter Four: Measures of Dispersion (Variation) : Abebuabebaw
21 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
26 pages
Imp - MEASURES OF DISPERSION
No ratings yet
Imp - MEASURES OF DISPERSION
5 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
14 pages
Lesson 1
No ratings yet
Lesson 1
37 pages
COM 201 LessonNote
No ratings yet
COM 201 LessonNote
8 pages
Measures of Dispersion (Autosaved)
No ratings yet
Measures of Dispersion (Autosaved)
64 pages
Chapter 5
No ratings yet
Chapter 5
6 pages
Topic 3
No ratings yet
Topic 3
49 pages
Measures of Dispersion OR Measures of Variations
No ratings yet
Measures of Dispersion OR Measures of Variations
7 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Module 1 Statistical Inference
No ratings yet
Module 1 Statistical Inference
67 pages
Day 05
No ratings yet
Day 05
22 pages
Chapter 5 Statistics and Data
No ratings yet
Chapter 5 Statistics and Data
25 pages
Descriptive Lec
No ratings yet
Descriptive Lec
8 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
31 pages
Module 1 Statistical Inference
No ratings yet
Module 1 Statistical Inference
67 pages
Measures of Variability
No ratings yet
Measures of Variability
49 pages
Handnote Chapter 4 Measures of Dispersio
No ratings yet
Handnote Chapter 4 Measures of Dispersio
45 pages
Statistical Machine Learning
100% (1)
Statistical Machine Learning
12 pages
Measure of Variation
No ratings yet
Measure of Variation
16 pages
Chapter 04
No ratings yet
Chapter 04
18 pages
Agenda: Review Homework Lecture/discussion Week 9 Assignment
No ratings yet
Agenda: Review Homework Lecture/discussion Week 9 Assignment
30 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
32 pages
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
No ratings yet
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
23 pages
Week 3 Notes
No ratings yet
Week 3 Notes
119 pages
Tech Seminar Sem 2
No ratings yet
Tech Seminar Sem 2
35 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
32 pages
Dispersion
No ratings yet
Dispersion
74 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
64 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Monitoring of Stack Emissions To Air (Environment Agency)
No ratings yet
Monitoring of Stack Emissions To Air (Environment Agency)
86 pages
Characteristics of An Effective Research Problem: What Is Research Methodology? (Why It's Important and Types)
No ratings yet
Characteristics of An Effective Research Problem: What Is Research Methodology? (Why It's Important and Types)
10 pages
Dealer's Satisfaction Maihar Cement
No ratings yet
Dealer's Satisfaction Maihar Cement
71 pages
Module 4
No ratings yet
Module 4
2 pages
All Chapters-2
No ratings yet
All Chapters-2
83 pages
Tvs
No ratings yet
Tvs
67 pages
BRM 2003 New
No ratings yet
BRM 2003 New
23 pages
Contributing Factors On The Prevalent Deviant Behavior of Senior High School Students of La Immaculada Concepcion School
No ratings yet
Contributing Factors On The Prevalent Deviant Behavior of Senior High School Students of La Immaculada Concepcion School
98 pages
Biostat 1
No ratings yet
Biostat 1
24 pages
Statistical Characteristics of Numerical Data
No ratings yet
Statistical Characteristics of Numerical Data
9 pages
President Ramon Magsaysay State University
No ratings yet
President Ramon Magsaysay State University
1 page
Chapter 1 Tut Exc
No ratings yet
Chapter 1 Tut Exc
8 pages
1 s2.0 S0378426623001966 Main
No ratings yet
1 s2.0 S0378426623001966 Main
17 pages
Quartile & Deviation
No ratings yet
Quartile & Deviation
31 pages
Quantitative Methods Module
No ratings yet
Quantitative Methods Module
28 pages
Soe Thiha Maung, OMBA-200218
No ratings yet
Soe Thiha Maung, OMBA-200218
66 pages
Welfare Final Project
100% (1)
Welfare Final Project
89 pages
Discussion Questions
No ratings yet
Discussion Questions
8 pages
2011 AP Statistics Free-Response Questions
No ratings yet
2011 AP Statistics Free-Response Questions
7 pages
NEW PDF Factors Affecting Women Entrepreneurs in Micro Business in East Shewa Zone
100% (1)
NEW PDF Factors Affecting Women Entrepreneurs in Micro Business in East Shewa Zone
16 pages
Integrating Mathematics With Yamuna River Project
No ratings yet
Integrating Mathematics With Yamuna River Project
24 pages
Statistics and Probability
No ratings yet
Statistics and Probability
95 pages
Melkamu Zewdu
No ratings yet
Melkamu Zewdu
91 pages
Synopsis - ITC Limited (FBD)
No ratings yet
Synopsis - ITC Limited (FBD)
3 pages
Test 2 Math & Stats 200561
No ratings yet
Test 2 Math & Stats 200561
7 pages
MR Chapter 3,4 & 5
No ratings yet
MR Chapter 3,4 & 5
16 pages
Chapter 1-5 NCAE RESULTS
25% (4)
Chapter 1-5 NCAE RESULTS
56 pages
Daniel STR
No ratings yet
Daniel STR
57 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Shapes

Uploaded by

Shapes

Uploaded by

Lecture Series on

Dr. Bijaya Bhusan Nanda,

 They will be able to:

 Number of minutes 20  X:Mean Time – 14.6

X: High variability, Less consistency.

It should be easy to understand and easy to calculate.

It should be based on all the observations of the data.

It should be easily subjected to further mathematical

It should be least affected by the sampling fluctuation .

It should not be unduly affected by the extreme values.

Q2 = 2(n+1) th ordered observation

X = Mean days of confinement = 137 / 18 = 7.61

Weight 2.0-2.4 2.5-2.9 3.0-3.4 3.5-3.9 4.0-4.4 4.5+

For continuous grouped series with class

Variance: It is the square of the s.d

Uses of the standard deviation

with a great deal of accuracy, where the values

Non-resident unmarried 21.7 3.7 17.1

Example: Suppose that each day laboratory technician A completes

where n = the number of observations in the sample,

Figure 8.2 +ve or Right-skewed

Positive kurtosis indicates a relatively peaked distribution.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.