Intro. SWE and Agro Eco
Intro. SWE and Agro Eco
In the plural sense:- statistics is defined as the collection of numerical facts or figures (or the
raw data themselves).
Eg. 1. Vital statistics (numerical data on marriage, births, deaths, etc).
2. The average mark of statistics course for students is 70% would be considered as a
statistics whereas Abebe has got 90% in statistics course is not statistics.
Remark: statistics are aggregate of facts. Single and isolated figures are not statistics as they
cannot be compared and are unrelated.
In its singular sense:- the word Statistics is the subject that deals with the methods of
collecting, organizing, presenting, analyzing and interpreting statistical data.
Classification of Statistics
Statistics is broadly divided into two categories based on how the collected data are used.
Descriptive Statistics:- deals with describing the data collected without going further
conclusion.
Example 1.1: Suppose that the mark of 6 students in Statistics course for Mathematics is given
as 40, 45, 50, 60, 70 and 80. The average mark of the 6 students is 57.5 and it is considered
as descriptive statistics.
Inferential Statistics:- It deals with making inferences and/or conclusions about a population
based on data obtained from a sample of observations. It consists of performing hypothesis
testing, determining relationships among variables and making predictions.
Example 1.2: In the above example, if we say that the average mark in Statistics course for
Mathematics students is 57.5, then we talk about inferential statistics (draw conclusion based
on the sample observation).
1
1.2 Stages of Statistical Investigation
The area of statistics points out the following five stages. These are collection, organization,
presentation, analysis and interpretation of data.
Collection of data: This is the process of obtaining measurements or counts or obtaining raw
data.
Data can be collected in a variety of ways; one of the most common methods is through the
use of sample or census survey. Survey can also be done in different methods, three of the
most common methods are:
• Telephone survey
• Mailed questionnaire
• Personal interview.
Organization of data: - Data collected from published sources are generally in organized form.
However if an investigator has collected data through a survey, it is necessary to edit these
data in order to correct any apparent inconsistencies, ambiguities, and recording errors.
This phase also includes correcting the data for errors, grouping data into classes and
tabulating.
Presentation of data:- After the data have been collected and organized they can be presented
in the form of tables, charts, diagrams and graphs. This presentation in an orderly manner
facilitates the understanding as well as analysis of data.
Analysis of data: - the basic purpose of data analysis is to dig out useful information for
decision making. This analysis may simply be a critical observation of data to draw some
meaningful conclusions about it or it may involve highly complex and sophisticated
mathematical techniques.
Interpretation of data: - Interpretation means drawing conclusions from the data collected and
analyzed. Correct interpretation will lead to a valid conclusion of the study & thus can aid in
decision making.
1.3 Definition of some statistical terms
Population: - It is the totality of objects under study. The population represents the target of an
investigation, and the objective of the investigation is to draw conclusions about the population
hence we sometimes call it target population. The word population doesn’t necessarily refer to
people. Examples:
• All clients of Telephone Company
• All students of Mekdela Amba University (MAU)
• Population of families, etc.
2
The population could be finite or infinite (an imaginary collection of units).
Sample: - is part or subset of population under study.
Sampling frame:- is the list of all possible units of the population that the sample can be drawn
from it. Eg. List of all students of MAU, List of all residential houses in Gimba town, etc
Quantitative variables: - are variables which assume numerical values. eg. Age, weight, etc.
3
b) Economics: Statistics are widely used in economics study and research.
• To measure and forecast Gross National Product (GNP)
• Statistical analyses of population growth, inflation rate, poverty, unemployment figures,
rural or urban population shifts and so on influence much of the economic policy making.
• Financial statistics are necessary in the fields of money and banking including consumer
savings and credit availability.
c) Statistics and research: there is hardly any advanced research going on without the use of
statistics in one form or another. Statistics are used extensively in medical, pharmaceutical and
agricultural research.
Function/Uses of Statistics
Today the field of statistics is recognized as a highly useful tool to making decision process by
managers of modern business, industry, frequently changing technology. It has a lot of functions
in everyday activities. The following are some uses of statistics:
• It condenses and summarizes a mass of data: the original set of data (raw data) is normally
voluminous and disorganized unless it is summarized and expressed in few presentable,
understandable & precise figures.
• Statistics facilitates comparison of data: measures obtained from different set of data can
be compared to draw conclusion about those sets. Statistical values such as averages,
percentages, ratios, rates, coefficients, etc, are the tools that can be used for the purpose of
comparing sets of data.
• Statistics helps to predict future trends: statistics is very useful for analyzing the past and
present data and forecasting future events.
• Statistics helps to formulate & review policies: Statistics provide the basic material for
framing suitable policies. Statistical study results in the areas of taxation, on unemployment
rate, on inflation, on the performance of every sort of military equipment, etc, may convince a
government to review its policies and plans with the view to meet national needs and
aspirations.
• Formulating and testing hypothesis: Statistical methods are extremely useful in formulating
and testing hypothesis and to develop new theories.
Limitations of Statistics
The field of statistics, though widely used in all areas of human knowledge and widely applied
in a variety of disciplines such as engineering, economics and research, has its own limitations.
Some of these limitations are:
4
a) It does not deal with individual values: as discussed earlier, statistics deals with aggregate
of facts. For example, wage earned by an individual worker at any one time, taken by itself is
not a statistics.
b) It does not deal with qualitative characteristics directly: statistics is not applicable to
qualitative characteristics such as beauty, honesty, poverty, standard of living and so on since
these cannot be expressed in quantitative terms. These characteristics, however, can be
statistically dealt with if some quantitative values can be assigned to these with logical criterion.
For example, intelligence may be compared to some degree by comparing IQs or some other
scores in certain intelligence tests.
c) Statistical conclusions are not universally true: since statistics is not an exact science,
as is the case with natural sciences, the statistical conclusions are true only under certain
assumptions.
d) It can be misused: statistics cannot be used to full advantage in the absence of proper
understanding of the subject matter.
1.5 Levels of Measurement
Proper knowledge about the nature and type of data to be dealt with is essential in order to
specify and apply the proper statistical method for their analysis and inferences.
Scale Types
Measurement is the assignment of values to objects or events in a systematic fashion. Four
levels of measurement scales are commonly distinguished: nominal, ordinal, interval, and ratio
and each possessed different properties of measurement systems. The first two are qualitative
while the last two are quantitative.
Nominal scale: The values of a nominal attribute are just different names, i.e., nominal
attributes provide only enough information to distinguish one object from another. Qualities with
no ranking or ordering, no numerical or quantitative value. These types of data are consists of
names, labels and categories. This is a scale for grouping individuals into different categories.
Example 1.3: Eye color: brown, black, etc, sex: male, female.
• In this scale, one is different from the other
• Arithmetic operations (+, -, *, ÷) are not applicable, comparison (<, >, ≠, etc) is
impossible
Ordinal scale: - defined as nominal data that can be ordered or ranked.
• Can be arranged in some order, but the differences between the data values are
meaningless.
5
• Data consisting of an ordering of ranking of measurements are said to be on an ordinal
scale of measurements. That is, the values of an ordinal scale provide enough
information to order objects.
• One is different from and greater /better/ less than the other
• Arithmetic operations (+, -, *, ÷) are impossible, comparison (<, >, ≠, etc) is possible.
Example 1.4 -Letter grading (A, B, C, D, F), -Rating scales (excellent, very good, good, fair,
poor), military status (general, colonel, lieutenant, etc).
Interval Level: data are defined as ordinal data and the differences between data values are
meaningful. However, there is no true zero, or starting point, and the ratio of data values are
meaningless. Note: Celsius & Fahrenheit temperature readings have no meaningful zero and
ratios are meaningless.
In this measurement scale:-
• One is different, better/greater and by a certain amount of difference than another.
• Possible to add and subtract. For example; 800c – 500c = 300c, 700c – 400c = 300c.
• Multiplication and division are not possible. For example; 60 0c = 3(200c). But this does
not imply that an object which is 600c is three times as hot as an object which is 200c.
Most common examples are: IQ, temperature.
Ratio scale: Similar to interval, except there is a true zero (absolute absence), or starting point,
and the ratios of data values have meaning.
• Arithmetic operations (+, -, *, ÷) are applicable. For ratio variables, both differences and
ratios are meaningful.
• One is different/larger /taller/ better/ less by a certain amount of difference and so much
times than the other.
• This measurement scale provides better information than interval scale of
measurement.
Example 1.5: weight, age, number of students.
6
CHAPTER TWO: METHODS OF DATA COLLECTION AND PRESENTATION
This unit will deal how to collect and present the data you have collected so that they can be
of use. Thus, the collected data also known as raw data are always in an unorganized form
and need to be organized and presented in a meaningful and readily comprehensible form in
order to facilitate further statistical analysis.
Objectives: At the end of this chapter students will be able to:
• Arrange raw data in an array and then classified data to construct a frequency table and
a cumulative frequency table.
• To organize data using frequency distribution.
• To present data using suitable graphs or diagrams.
2.1 Methods of Data Collection
Data:- is the raw material of statistics. It can be obtained either by measurement or counting.
When we determine that the appropriate approach to seeking an answer to a question will
require the use of statistics, we begin to search for suitable data to serve as the raw material
for our investigation.
Sources of data
The statistical data may be classified under two categories depending up on the sources.
1. Primary data: - Data collected by the investigator himself for the purpose of a specific
inquiry or study. Such data are original in character & are mostly generated by surveys
conducted by individuals or research institutions.
It is more reliable & accurate since the investigator can extract the correct information by
removing doubts, if any, in the minds of the respondents regarding certain questions.
2. Secondary data: - When an investigator uses data, which have already been collected by
others, such data are called secondary data. Such data are primary data for the agency
that collected them, and become secondary for some one else who uses these data for his
own purposes. Example of secondary data: books, reports, magazines, etc.
When our source is secondary data check that:
✓ The type and objective of the situations.
✓ The purpose for which the data are collected and compatible with the
present problem.
✓ The nature and classification of data is appropriate to our problem.
✓ There are no biases and misreporting in the published data.
Note: Data which are primary for one may be secondary for the other.
7
2.2 Methods of Data Presentation
Having collected and edited the data, the next important step is to organize it. That is to present
it in a readily comprehensible condensed form that aids in order to draw inferences from it. It is
also necessary that the like be separated from the unlike ones.
The presentation of data is broadly classified in to the following two categories:
✓ Tabular presentation
✓ Diagrammatic and Graphic presentation.
The process of arranging data in to classes or categories according to similarities technically is
called classification. It eliminates inconsistency and also brings out the points of similarity and/or
dissimilarity of collected items/data.
Classification is necessary because it would not be possible to draw inferences and conclusions
if we have a large set of collected [raw] data.
2.2.1 Frequency distribution
Frequency: - is the number of times a certain value or class of values occurs.
Frequency distribution (FD):- is the organization of raw data in table from using classes and
frequency.
There are three types of FD and there are specific procedures for constructing each type.
The three types are:-
I. Categorical FD
II. Ungrouped FD and
III. Grouped FD
I. Categorical FD: Used for data that can be placed in specific categories; such as nominal,
ordinal level of data.
Example 2.1: Twenty five patients were given a blood test to determine their blood type. The
data is as shown below: A B B AB A O O B AB B B B O A O O O AB AB A O O B A.
Solution: since the data are categorical by taking the four blood types as classes we can
construct a FD as shown below.
Step 1: Make a table as shown below
Is often constructed for small set of data or data once discrete variable?
✓ First find the smallest and largest raw score in the collected data.
✓ Arrange the data in order of magnitude and count the frequency.
✓ To facilitate counting one may include a column of tallies.
80 76 90 85 80
70 60 62 70 85
65 60 63 74 75
76 70 70 80 85
Construct a frequency distribution, which is ungrouped.
Solution:
Step 1: Find the range, Range=Max-Min=90-60=30.
Step 2: Make a table as shown
Step 3: Tally the data.
Step 4: Compute the frequency.
9
Mark Tally Frequency
60 // 2
62 / 1
63 / 1
65 / 1
70 //// 4
74 / 1
75 // 2
76 / 1
80 /// 3
85 /// 3
90 / 1
-Each individual value is presented separately, that is why it is named ungrouped frequency
distribution.
3. Grouped Frequency Distribution (GFD).
When the range of the data is large the data must be grouped in to classes that are more than
one unit in width.
Definition of some basic terms
• Grouped frequency distribution: is a FD when several numbers are grouped into one
class.
• Class limits (CL): It separates one class from another. The limits could actually appear in
the data and have gaps between the upper limits of one class and the lower limit of the
next class.
• Unit of measure (U): This is the possible difference between successive values. E.g. 1,
0.1, 0.01, 0.001……
• Class boundaries: Separate one class in a grouped frequency distribution from the other.
The boundary has one more decimal place than the raw data. There is no gap between
the upper boundaries of one class and the lower boundaries of the succeeding class.
Lower class boundary is found by subtracting half of the unit of measure from the lower
class limit and upper class boundary is found by adding half unit measure to the upper
class limit.
• Class width (W): The difference between the upper and lower boundaries of any
consecutive class. The class width is also the difference between the lower limit or upper
limits of two consecutive classes.
10
• Class mark (Mid point): It is found by adding the lower and upper class limit (Boundaries)
and divided the sum by two.
• Cumulative frequency (CF): It is the number of observation less than the upper class
boundary or greater than the lower class boundary of class.
• CF (Less than type): it is the number of values less than the upper class boundary of a
given class.
• CF (Greater than type): it is the number of values greater than the lower class boundary
of a given class.
• Relative frequency (Rf ):The frequency divided by the total frequency. This gives the
percent of values falling in that class.
Rfi = fi/n= fi/∑fi
• Relative cumulative frequency (RCf): The running total of the relative frequencies or
the cumulative frequency divided by the total frequency gives the percent of the values
which are less than the upper class boundary or the reverse.
CRfi = Cfi/n= Cfi/∑fi
12
Class Class Class Tally Frequency CF(<) CF(>) RF RCF(>)
limit boundary Mark
6 – 11 5.5 – 11.5 8.5 // 2 2 20 2/20=0.1 1
12 – 17 11.5 – 17.5 14.5 // 2 4 18 2/20=0.1 0.9
18 – 23 17.5 – 23.5 20.5 ///// // 7 11 16 7/20=0.35 0.8
24 – 29 23.5 – 29.5 26.5 //// 4 15 9 4/20=0.2 0.45
30 – 35 29.5 – 35.5 32.5 /// 3 18 5 3/20=0.15 0.25
36 – 41 35.5 – 41.5 38.5 // 2 20 2 2/20=0.1 0.10
➢ Pie chart
➢ Bar chart
➢ Pictogram
A) Pie chart
A pie chart is a circle that is divided in to sections or wedges according to the percentage of
frequencies in each category of the distribution. The angle of the sector is obtained using:
Example 2.4: Draw a suitable diagram to represent the following population in a town.
13
Solutions:
Step 3: Using a protractor and compass, graph each section and write its name with
corresponding percentage.
Boys Men
15% 25%
Girls Women
40% 20%
B) Bar Charts
✓ Used to represent & compare the frequency distribution of discrete variables and
attributes or categorical series.
✓ Bars can be drawn either vertically or horizontally.
✓ All bars must have equal width and the distance between bars must be equal.
✓ The height or length of each bar indicates the size (frequency) of the figure represented.
There are different types of bar charts. The most common being:
14
I. Simple bar chart
✓ Are used to display data on one variable.
✓ They are thick lines (narrow rectangles) having the same breadth. The magnitude of a
quantity is represented by the height /length of the bar.
Example 2.5: Number of students in the four department of Science College given as follows:
800 600
Frequency
Solution:
Example 2.6: Draw a component (sub-divided) bar chart of the number of students by
department is given in the example 2.5.
Solution:
800
600 Female
Frequency 400 Male
200
0
Phys Maths Chem Bio
Department
15
III. Multiple Bar charts
✓ These are used to display data on more than one variable.
✓ They are used for comparing different variables at the same time.
Example 2.7: The following data represent sales by product, 1957- 1959 of a given company
for three products A, B, C.
C) Pictograph
In this diagram, we represent data by means of some picture symbols. We decide about a
suitable picture to represent a definite number of units in which the variable is measured.
The histogram, frequency polygon and cumulative frequency graph or ogive is most commonly
applied graphical representation for continuous data.
16
➢ Represent the class boundaries for the histogram or ogive or the mid points for the frequency
polygon on the X axis.
➢ Plot the points.
➢ Draw the bars or lines to connect the points.
Histogram
A graph which displays the data by using vertical connected bars of various heights to represent
frequencies. Class boundaries are placed along the horizontal axis. Class marks and class
limits are some times used as quantity on the X axis.
Example 2.8: Construct a histogram to represent the following data.
Histogram
Frequency
20
15
15 12
10
10
4 4
5 3 2
0
Class boundaries
Frequency polygon
If we join the mid-points of the tops of the adjacent rectangles of the histogram with line
segments a frequency polygon is obtained. When the polygon is continued to the x-axis just
outside the range of the lengths the total area under the polygon will be equal to the total area
under the histogram.
Example 2.9: Construct a frequency polygon to represent the previous data in example 2.8.
Solution:
17
Class Frequency Class Class R.F. % R.F. Less than More than
limits marks boundaries C.F. C. F.
(percent)
Adding two class marks with f i = 0 , we have 9.5 at the beginning, and 89.5 at the end, the
Frequency Polygon
20
F
r
10
e
q
u
y 0
e 9.5 19.5 29.5 39.5 49.5 59.5 69.5 79.5 89.5
n
c Class mark
An Ogive (pronounced as “oh-jive”) is a line that depicts cumulative frequencies, just as the cumulative
frequency distribution lists cumulative frequencies. Note that the Ogive uses class boundaries along the
horizontal scale, and graph begins with the lower boundary of the first class and ends with the upper
boundary of the last class. Ogive is useful for determining the number of values below or above some
particular value. There are two type of Ogive namely less than Ogive and more than Ogive. The
difference is that less than Ogive uses less than cumulative frequency and more than Ogive uses more
than cumulative frequency on y axis.
18
Example 2.10: Draw a both types of ogives for the F.D. of Example 2.8.
Solutions:
40 40
30 30
20 20
10 10
0 0
14.5 24.5 34.5 44.5 54.5 64.5 74.5 84.5
14.5 24.5 34.5 44.5 54.5 64.5 74.5 84.5
Class Boundaries
Class Boundaries
Note: For both ogives, one class with frequency zero is added for similar reason with the frequency
polygon.
Objectives
At the end of this chapter students will be able to:
• Identify measure of central tendency.
• Understand properties of arithmetic mean.
• Summarize an aggregate of statistical data by using single measure.
• Define and calculate the mean, mode and median.
• Measure the position of data using quartiles, deciles and percentiles with their
interpretation.
3.1 The Summation Notation ()
Statistical Symbols: Let a data set consists of a number of observations, represents by 𝑥1 , 𝑥2
, … , 𝑥𝑛 where n (the last subscript) denotes the number of observations in the data and 𝑥𝑖 is the
ith observation. Then the sum of all numbers (𝑥𝑖 ′𝑠) where i goes from 1 up to n is symbolically
given by ∑𝑛𝑖=1 𝑥𝑖 𝑜𝑟 ∑ 𝑥𝑖 𝑜𝑟 ∑ 𝑥 that is,
∑ 𝑥𝑖 = 𝑥1 + 𝑥2 + … + 𝑥𝑛
x - whole set of numbers
𝑥𝑖 - specific score in a set of numbers
n - total number of observations
19
For instance a data set consisting of six measurements 2, 3, 9, 10, 8 and -2 is represented by
𝑥1 , 𝑥2 , … , 𝑥6 where 𝑥1 = 2, 𝑥2 =3, 𝑥3 =9, 𝑥4 = 10, 𝑥5 = 8 and 𝑥6 =-2 Their sum becomes
∑6𝑖=1 𝑥𝑖 = 𝑥1 + 𝑥2 + … + 𝑥6 = 2+3+9+10+8+ (-2) = 30
Some Properties of the Summation Notation
1. ∑𝑛𝑖=1 𝑐 = n.c, where c is a constant number.
3. Easily understood.
4. Simple to compute.
20
The following are types of Central Tendency which are suitable for a particular type of data.
These are
• Arithmetic Mean
- Weighted Arithmetic Mean
- Combined mean
• Geometric Mean
• Harmonic Mean
• Median
• Mode or modal value
3.3.1 Arithmetic Mean: - Arithmetic mean is defined as the sum of the measurements of the
items divided by the total number of items. It is usually denoted by 𝑥̅ .
Arithmetic Mean for individual series
Suppose 𝑥1 , 𝑥2 , … , 𝑥𝑛 are observed values in a sample of size n from a population of size N,
n<N then the arithmetic mean of the sample, denoted by 𝑥̅ is given by
𝑥1 + 𝑥2+ … +𝑥𝑛 ∑𝑛
𝑖=1 𝑥𝑖
𝑥̅ = =
𝑛 𝑛
21
When the numbers 𝑥1 , 𝑥2 , … , 𝑥𝑘 occur with frequencies 𝑓1 , 𝑓2 , … , 𝑓𝑘 , respectively, then the
mean can be expressed in a more compact form as:
𝑥1 𝑓1 +𝑥2 𝑓2 + …+𝑥𝑘 𝑓𝑘 ∑𝑘
𝑖=1 𝑥𝑖 𝑓𝑖
𝑥̅ = = ∑𝑘
𝑓1 +𝑓2 + …+ 𝑓𝑘 𝑖=1 𝑓𝑖
Example 3.3: Calculate the arithmetic mean of the sample of numbers of students in 10
classes:
50 42 48 60 58 54 50 42 50 42
∑𝑛
𝑖=1 𝑥𝑖 50+42+48+60+58+54+50+42+50+42 496
𝑥̅ = = = = = 49.6 ≈ 50
𝑛 10 10
In this case there are three 42’s, one 48, three 50’s, one 54, one 58 and one 60. The number
of times each number occurs is called its frequency and the frequency is usually denoted by f.
The information in the sentence above can be written in a table, as follows.
Value, xi 42 48 50 54 58 60
Frequency, fi 3 1 3 1 1 1
The formula for the arithmetic mean for data of this type is
𝑥1 𝑓1 +𝑥2 𝑓2 + …+𝑥𝑘 𝑓𝑘 ∑𝑘
𝑖=1 𝑥𝑖 𝑓𝑖
𝑥̅ = = ∑𝑘
𝑓1 +𝑓2 + …+ 𝑓𝑘 𝑖=1 𝑓𝑖
22
Calculate the mean
Solution:
The formula to be used for the mean is as follows:
∑𝑘
𝑖=1 𝑥𝑖 𝑓𝑖
𝑥̅ = ∑𝑘
𝑖=1 𝑓𝑖
Let us calculate these values and make a table for these values for the sake of convenience.
Class Interval (CI) 60-62 62-64 64-66 66-68 68-70 70-72 Total
Mid-Point (𝑥𝑖 ) 61 63 65 67 69 71
• The sum of squares of deviations from the mean is the least comparing to other measure of
n
central tendencies. That is, ( x − A)
i =1
i
2
is minimum when A = x .
w1 x1 + w2 x2 + + wn xn wi xi
xw = =
w1 + w2 + + wn wi
Example 3.5: A student’s final mark in Mathematics, Physics, Chemistry and Biology are
respectively A, B, D and C. If the respective credits (weight) received for these courses are 4,
4, 3 and 2, determine the average grade the student has got for the course.
Solution
We use a weighted arithmetic mean, weight associated with each course being taken as the
number of credits received for the corresponding course.
𝑥𝑖 4 3 1 2 Total
𝑤𝑖 4 4 3 2 13
𝑥𝑖 𝑤𝑖 16 12 3 4 35
w1 x1 + w2 x2 + + wn xn wi xi
xw = =
w1 + w2 + + wn wi
16+12+3+4 35
= = = 2.69
13 13
Combined mean: When a set of observations is divided into k groups and 𝑥̅1 is the mean of n1
observations of group 1, 𝑥̅2 is the mean of n2 observations of group2, …, 𝑥̅𝑘 is the mean of nk
24
observations of group k, then the combined mean, denoted by 𝑥̅ 𝑐 , of all observations taken
together is given by
This is a special case of the weighted mean. In this case the sample sizes are the weights.
Example 3.6: In the Previous year there were two sections taking Statistics course. At the end
of the semester, the two sections got average marks of 70 & 78. There were 45 and 50 students
in each section respectively. Find the mean mark for the entire students.
Solution:
Geometric mean for individual series: The geometric mean, G.M. of an individual series of
positive numbers (> 0) 𝑥1 , 𝑥2 , … , 𝑥𝑛 is defined as the nth root of their product.
1
G.M . = n x1f1 .x2f2 ..xmf m = antilog ( ∑ 𝑓𝑖 𝑙𝑜𝑔𝑥𝑖 ) where n is sum of 𝑓𝑖 for all i.
𝑛
25
Example 3.8: Compute the geometric mean of the following values: 3, 3, 4, 4, 4, 5, 6 and 6.
Solution
Values 3 4 5 6
Frequency 2 3 1 2
8
G.M. = √32 𝑋43 𝑋51 𝑋62 = 4.236
Geometric mean for continuous grouped FD:- The above formula can also be used
whenever the frequency distribution is grouped continuous, class marks of the class intervals
are considered as xi.
Example 3.9 A car travels 25 miles at 25 mph, 25 miles at 50 mph, and 25 miles at 75 mph.
Find average mean ( the harmonic mean) of the three velocities.
Solution
3
H .M =
n = 1 1 1 = 40.9
+ +
1 1 1 25 50 75
+ ++
x1 x2 xn
Harmonic mean for discrete data arranged in FD: If the data is arranged in the form of
frequency distribution
n
H .M = , where n = f k
m
f1 f 2 f
+ ++ m k =1
x1 x 2 xm
Harmonic mean for continuous grouped FD: Whenever the frequency distribution are
grouped continuous, class marks of the class intervals are considered as 𝑥𝑖 and the above
formula can be used as
𝑛 m
H.M. = 𝑓𝑖 where n = f k
∑𝑛
𝑖=1 k =1
𝑥𝑖
Solution;
i. The data in ascending order is given by:
-5 0 1 2 4 5 6 8 10 15
n=10 ➔n is even. The two middle values are 5th and 6th observations. So the median is,
10 10
( )𝑡ℎ +( +1)𝑡ℎ 5𝑡ℎ +6𝑡ℎ 4+5
2 2
𝑥̃ = value = = = 4.5
2 2 2
27
Median for Discrete data arranged in a frequency distribution:- In this case also, the
median is obtained by the above formula. After arranging the values in an increasing order find
the smallest CF greater than or equal to the rank/position of the median value (i.e., that value
obtained by a & b above formula) and the corresponding value is the median.
Median for grouped continuous data:-For continuous data, the median is obtained by the
following formula.
w n
Median = L + − CF = ~
x
f med 2
f med = the frequency of the median class; and CF = the cum. freq. corresponding to the
class preceding the median class. That is, the sums of the frequencies of all classes lower than the
median class. Where the median class is the class which contains the (n/2)th observation whether n
is odd or even, since the items have already lost their originality once they are grouped in to
continuous classes.
Example 3.11: Calculate the median for the following frequency distribution.
C.I 1 - 5 6 - 10 11 – 15 16 – 20 21 - 25 26 - 30 31 – 35 Total
Freq. 4 8 12 6 3 4 3 40
C.I 1 - 5 6 - 10 11 – 15 16 – 20 21 - 25 26 - 30 31 – 35 Total
Freq. 4 8 12 6 3 4 3 40
Cuml. Freq. 4 12 24 30 33 37 40
Since n = 40, 40/2 = 20, and the smallest CF greater than or equal to 20 is 24; thus, the median
class
is the third class. And for this class, L = 10.5, w = 5, f med =12, CF = 12. Then applying the formula,
we get:
~
x =10.5+(20-12)*5/12=13.8
28
Merits of median
• It is less affected by extreme values.
• Median can be calculated even in case of open-ended intervals.
• It can be computed for ratio, interval, and ordinal level of data.
Demerits of median
• Its value is not determined by each & every observation.
• It is not a good representative of the data if the number of items (data) is small.
• The arrangement of items in order of magnitude is sometimes very tedious process if
the number of items is very large.
3.3.5 The Mode or modal value
The mode or the modal value is the value with the highest frequency and denoted by 𝑥̂. A data
set may not have a mode or may have more than one mode. A distribution is called a bimodal
distribution if it has two data values that appear with the greatest frequency. If a distribution has
more than two modes, then the distribution is multimodal. If a distribution has no modes, then
the distribution is no modal.
Mode of individual series:- The mode or the modal value of individual series (raw data) is simply
obtained by locating the observation with the maximum frequency.
Mode for discrete data arranged in a frequency distribution:-In the case of discrete grouped
data, the mode is determined just by looking to that value (s) having the highest frequency.
29
In such cases, one can only determine the modal class easily: the class with the highest frequency.
1
Mode = L + w , where L = the lower class boundary of the modal class; 1 = f mod − f 1 ,
1 + 2
preceding the modal class; f 2 = frequency of the class immediately succeeding the modal class;
and fmode = frequency of the modal class.
Example 3.13: Calculate the mode for the frequency distribution of data of example 3.11.
Solution: By inspection, the mode lies in the third class, where L =10.5, fmod = 12, f1=8, f2=6, w =
5
1
Mode = L + w = 10.5 + (12-8)*5/(12-8)+(12-5) = 12.5
1 + 2
Merits of mode
30
• The median is resistant to the influence of extreme data values or outliers and its value
does not respond strongly to the changes of a few extreme data values regardless of
how large the change may be.
• The mode has an advantage over both the mean and the median when the data is
categorical since it is not possible to calculate the mean or median for this type of data.
Also, the mode usually indicates the location within a large distribution where the data
values are concentrated. However, the mode can not always be calculated because if a
distribution has all different data values, then the distribution is non modal.
• In the case of symmetrical distribution; mean, median and mode coincide. That is
mean=median = mode. However, for a moderately asymmetrical (non symmetrical)
distribution, mean and mode lie on the two ends and median lies between them and they
have the following important empirical relationship, which is
Mean – Mode = 3(Mean - Median)
Example 3.14: In a moderately asymmetrical distribution, the mean and the mode are 30 and
42 respectively. What is the median of the distribution?
Solution:
Median = (2mean + Mode)/2 = (2*30 + 42)/3 = 34
Hence the median of the distribution is 34.
Which of the Three Measures is the Best?
At this stage, one may ask as to which of these three measure of central tendency is the best.
There is no simple answer to this question. It is because these three measures are based upon
different concepts. The arithmetic mean is the sum of the values divided by the total number of
observations in the series. The median is the value of the middle observations tend to
concentrate. As such; the use of a particular measure will largely depend on the purpose of the
study and the nature of the data. For example, when we are interested in knowing the
consumers’ preferences for different brands of television sets or kinds of advertising, the choice
should go in favor of mode. The use of mean and median would not be proper. However, the
median can sometimes be used in the case of qualitative data when such data can be arranged
in an ascending or descending order. Let us take another example. Suppose we invite
applications for a certain vacancy in our company. A large number of candidates apply for that
post. We are now interested to know as to which age or age group has the largest concentration
of applicants. Here, obviously the mode will be the most appropriate choice. The arithmetic
mean may not be appropriate as it may be influenced by some extreme values.
31
3.5 Measures of Non-central Locations
Median is the value of the middle item which divides the data in to two equal parts and found
by arranging the data in an increasing or decreasing order of magnitude, where as quintiles are
measures which divides a given set of data in to approximately equal subdivision and are
obtained by the same procedure to that of median. They are averages of position (non-central
tendency). Some of these are quartiles, deciles and percentiles.
Quartiles: are values which divide the data set in to approximately four equal parts, denoted
by 𝑄1 , 𝑄2 𝑎𝑛𝑑 𝑄3 . The first quartile (𝑄1) is also called the lower quartile and the third quartile
(𝑄3 ) is the upper quartile. The second quartile ( 𝑄2 ) is the median.
• Quartiles for Individual series:
Let x1 , x 2 , , x n be n ordered observations. The ith quartile (Qi ) is the value of the item
corresponding
That is, after arranging the data in ascending order, Q 1, Q2, & Q3 are, obtained by:
• Quartiles in continuous data:- For continuous data, use the following formula:
w in
Qi = L + − CF
f Qi 4
Where i = 1,2, 3, and L, w ,fQi and CF are defined in the same way as the median.
𝑤 𝑛 𝑤 2𝑛 𝑤 3𝑛
i.e. Q1 = L +𝑓 (4 − 𝐶𝐹) , Q2 = L + 𝑓 ( 4 − 𝐶𝐹) 𝑎𝑛𝑑 Q3 = L + 𝑓 ( 4 − 𝐶𝐹)
𝑄1 𝑄2 𝑄3
The class under question is the one including (ixn/4)th value. That is, the class with the minimum
frequency greater than or equal to (ixn/4) th is the class of the ith quartile.
Deciles: are values dividing the data approximately in to ten equal parts, denoted by 𝐷1 , 𝐷2,…, 𝐷9 .
• Deciles for Individual Series:
32
Let x1 , x 2 , , x n be n ordered observations. The ith deciles (𝐷𝑖 ) is the value of the item
corresponding
That is, after arranging the data in ascending order, D1, D2, . . . & D9 are, obtained by:
distribution this case also, we will follow the same procedure as the median. That is, we
construct the less than cumulative frequency distribution and apply the formula of deciles for
individual series.
• Deciles for continuous data: Apply the following formula and follow the procedures of
quartile for continuous data.
𝑤 𝑖𝑛
𝐷𝑖 = 𝐿 + (10 − 𝐶𝐹) ,i = 1, 2,...,9 . Then
𝑓𝐷𝑖
Define the symbols in similar ways as we did in the case of quartiles for continuous data.
Percentiles: are values which divide the data approximately in to one hundred equal parts,
and
denoted by 𝑃1 , 𝑃2,…, 𝑃99 .
• Percentiles for Individual Series:
Let x1 , x 2 , , x n be n ordered observations. The ith percentile (𝑃𝑖 ) is the value of the item
That is, after arranging the data in ascending order, P 1, P2, . . . & P99 are, obtained by:
33
Define the symbols similar ways as we did in the case of quartiles or deciles for continuous
data.
Interpretations
1. 𝑄𝑖 is the value below which ( i × 25) percent of the observations in the series are found
(where i = 1, 2,3). For instance 𝑄3 means the value below which 75 percent of observations
in the given series are found.
2. 𝐷𝑖 is the value below which ( i ×10) percent of the observations in the series are found
(where i = 1, 2,...,9 ). For instance 𝐷4 is the value below which 40 percent of the values are
found in the series.
3. 𝑃𝑖 is the value below which i percent of the total observations are found (where i = 1,
2,3,...,99 ). For example 60 percent of the observations in a given series are below 𝑃60 .
Example 3.15: Calculate 𝑄1 , 𝑄2 , 𝑄3, 𝐷4, 𝐷9, 𝑃40 & 𝑃90 for the following data given on the table
below.
X 10 11 12 13 14 15 16 17 18
F 2 8 25 48 65 40 20 9 2
Solution: The data is arranged in an increasing order. So we need to construct only the
cumulative frequency table before calculating the required values.
X 10 11 12 13 14 15 16 17 18
F 2 8 25 48 65 40 20 9 2
The total number of observations is 219 which is odd. Clearly then the median is 14. i.e.
𝑛+1 𝑡ℎ 219+1 𝑡ℎ
𝑥̃ = ( ) =( ) value = 110th value = 14
2 2
1(𝑛+1) 𝑡ℎ 1(219+1) 𝑡ℎ
𝑄1 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 55th value = 13
4 4
2(𝑛+1) 𝑡ℎ 2(219+1) 𝑡ℎ
𝑄2 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 110th value = 14 = 𝑥̃
4 4
3(𝑛+1) 𝑡ℎ 3(219+1) 𝑡ℎ
𝑄3 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 165th value = 15
4 4
4(𝑛+1) 𝑡ℎ 4(219+1) 𝑡ℎ
𝐷4 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 88th value = 14
10 10
34
9(𝑛+1) 𝑡ℎ 9(219+1) 𝑡ℎ
𝐷9 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 198th value = 16
10 10
40(𝑛+1) 𝑡ℎ 40(219+1) 𝑡ℎ
𝑃40 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 88th value = 14
100 100
90(𝑛+1) 𝑡ℎ 90(219+1) 𝑡ℎ
𝑃90 = ( ) 𝑣𝑎𝑙𝑢𝑒 = ( ) 𝑣𝑎𝑙𝑢𝑒 = 198th value = 16
100 100
Example 3.16: Marks of 50 students out of 85 is given below. Based on the data find 𝑄1,
𝐷4 𝑎𝑛𝑑 𝑃7.
Marks 46-50 51-55 56-60 61-65 66-70 71-75 76-80
fi 4 8 15 5 9 5 4
Solution:- first find the class boundaries and cumulative frequency distributions.
Marks 46-50 51-55 56-60 61-65 66-70 71-75 76-80
fi 4 8 15 5 9 5 4
Cum. 4 12 27 32 41 46 50
Frequency
Q1 Measure of (n/4)th value = 12.5th value which lies in group 55.5 – 60.5
𝑤 𝑛 5
Q1 = L +𝑓 (4 − 𝐶𝐹) = 55.5 +15 (12.5 − 12) = 55.7
𝑄1
D4 Measure of (4n/10)th value = 20th value which lies in group 55.5 – 60.5.
𝑤 4𝑛 5
D4 = L +𝑓 (10 − 𝐶𝐹) = 55.5 +15 (20 − 12) = 58.2
𝐷4
P7 Measure of (7n/100)th value = 3.5th value which lies in group 45.5 – 50.5
𝑤 7𝑛 5
P7 = L +𝑓 (100 − 𝐶𝐹) = 45.5 +4 (3.5 − 0) = 49.875.
𝑃7
35
CHAPTER FOUR: MEASURES OF DISPERSION (VARIATION)
4.1 Introduction
Just as central tendency can be measured by a number in the form of an average, the amount
of variation (dispersion, spread, or scatter) among the values in the data set can also be
measured. The measures of central tendency describe that the major part of values in the data
set appears to concentrate around a central value called average with the remaining values
scattered (distributed) on either sides of that value. But these measures do not reveal how these
values are dispersed (spread or scatter) on each side of the central value. The dispersion of
values is indicated by the extent to which these values tend to spread over an interval rather
than cluster closely around an average.
The term dispersion is generally used in two senses. Firstly, dispersion refers to the variations
of the items among themselves. If the value of all the items of a series is the same, there will
be no variation among different items of a series. Secondly, dispersion refers to the variation
of the items around an average. If the difference between the value of items and the average
is large, the dispersion will be high and on the other hand if the difference between the value of
the items and averaging is small, the dispersion will be low. Thus, dispersion is defined as
scatteredness or spreadness of the individual items in a given series.
36
These measures are not suitable for comparing the variability in two distributions
having variables expressed in different units.
Relative measure
of dispersion
𝑅 =𝐿−𝑆
Where R=Range, L= Largest value in a given set of data, S= smallest value in a given set of
data.
37
• The difference between upper class limit of the last class and the lower class limit of the
first class, or
• The difference between the largest class mark and the smallest class mark, or
• The difference between the upper class boundary of the last class and the lower class
boundary of the first class.
The range is used in describing like the maximum change in daily temperature, rainfall, etc.
When the sample size is small, it can be an adequate measure of variation. It is commonly used
in quality control.
L−S
𝑅𝑒𝑙𝑎𝑡𝑖𝑣𝑒 𝑅𝑎𝑛𝑔𝑒(𝑅𝑅) =
L+S
Example 4.1: Five students obtained the following marks in statistics: 20, 35, 25, 30, 15. Find the
range and relative range
𝑅𝑎𝑛𝑔𝑒 = 𝐿 − 𝑆 = 35 − 15 = 20
L−S 35 − 15
𝑅𝑅 = = = 0.4
L+S 35 + 15
Example 4.2: Find out range and relative range of the following given data.
Frequency 4 9 15 30 40
Solution: Here,
30 − 5
Range = 30 – 5 = 25, 𝑅𝑅 = = 0.7143 .
30 + 5
Inter-quartile range and quartile deviation are other measures of dispersion. The difference
between the upper quartile (𝑄3 ) and lower quartile (𝑄1 ) is called inter-quartile range.
Symbolically,
The inter-quartile ranges covers dispersion of middle 50% of the items of the series. Quartile
deviation, also called semi-inter-quartile range, is half of the difference between the upper and
lower quartile. That is, half of the inter-quartile range. Its formula is
𝑄3 − 𝑄1
𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 (𝑄𝐷) =
2
The relative measure of quartile deviation also called the coefficient of quartile deviation (CQD)
𝑄 −𝑄
is defined as: 𝐶𝑄𝐷 = 𝑄3+𝑄1
3 1
Example 4.3: Find inter-quartile range, quartile deviation and coefficient of quartile deviation
from the following data.
Solution: First arrange the data in ascending order. 15, 18, 20, 24, 27, 28, 30
𝑛 + 1 𝑡ℎ 7 + 1 𝑡ℎ
𝑄1 = 𝑠𝑖𝑧𝑒 𝑜𝑓 ( ) 𝑖𝑡𝑒𝑚 = 𝑠𝑖𝑧𝑒 𝑜𝑓 ( ) 𝑖𝑡𝑒𝑚
4 4
𝑛 + 1 𝑡ℎ 7 + 1 𝑡ℎ
𝑄3 = 𝑠𝑖𝑧𝑒 𝑜𝑓 3 ( ) 𝑖𝑡𝑒𝑚 𝑠𝑖𝑧𝑒 𝑜𝑓 3 ( ) 𝑖𝑡𝑒𝑚
4 4
39
𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 28 − 18 = 10
𝑄3 − 𝑄1 28 − 18
𝑄𝐷 = = =5
2 2
𝑄3 − 𝑄1 28 − 18
𝐶𝑄𝐷 = = = 0.217
𝑄3 + 𝑄1 28 + 18
Example 4.4: Find inter-quartile range, quartile deviation and coefficient of quartile deviation
from the following data
Marks 2 3 4 5 6 7 8 9
No. Of students 10 11 12 13 5 12 7 5
Solution:
Marks 2 3 4 5 6 7 8 9
No. of students 10 11 12 13 5 12 7 5
CF 10 21 33 46 51 63 70 75=N
𝑁+1 75 + 1
𝑄1 = ( )= = 19𝑡ℎ 𝑖𝑡𝑒𝑚 = 3
4 4
𝑁+1 75+1
𝑄3 = 3 ( ) = 3( ) = 57th item = 7
4 4
𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 7 − 3 = 4
𝑄3 − 𝑄1 7 − 3
𝑄𝐷 = = =2
2 2
𝑄3 − 𝑄1 7 − 3
𝐶𝑄𝐷 = = = 0.4
𝑄3 + 𝑄1 7 + 3
Remark: Q.D or CQD includes only the middle 50% of the observation.
Merits of QD
40
Demerits of QD
➢ It is not based on all the items (it ignores 50% items, i.e., the first 25% and the last
25%).
➢ It is greatly influenced by sampling fluctuations.
➢ It is not amenable to algebraic manipulations.
class and n = ∑ 𝑓𝑖 .
1. The mean deviation about the arithmetic mean is, therefore, given by
∑|𝑋 −𝑋| ̅
𝑀𝐷(𝑋̅) = 𝑛𝑖 … for ungrouped data (individual series).
∑ 𝑓𝑖 |𝑋𝑖 −𝑋̅|
𝑀𝐷 (𝑋̅) = 𝑛
. . . for discrete data arranged in FD and a grouped continuous frequency distribution;
where 𝑋𝑖 is the value for discrete data arranged in FD and class mark of the ith class for continuous
grouped data, 𝑓𝑖 is the frequency of the ith class and n = ∑ 𝑓𝑖 .
41
2. The mean deviation about the median is also given by
∑|𝑋 −x̃|
𝑀𝐷(𝑋̃) = 𝑛𝑖 … for ungrouped data (individual series).
∑ 𝑓 |𝑋 −x̃|
𝑀𝐷(𝑋̃) = 𝑖 𝑛 𝑖 . . . for discrete data arranged in FD and a grouped continuous frequency distribution;
where 𝑋𝑖 is the value for discrete data arranged in FD and class mark of the ith class for continuous
grouped data , 𝑓𝑖 is the frequency of the ith class and n = ∑ 𝑓𝑖 .
∑ 𝑓𝑖 |𝑋𝑖 −x̂|
𝑀𝐷(x̂) = 𝑛
. . for discrete data arranged in FD and a grouped continuous frequency distribution;
where 𝑋𝑖 is the value for discrete data arranged in FD and class mark of the ith class for continuous
grouped data, 𝑓𝑖 is the frequency of the ith class and n = ∑ 𝑓𝑖 .
|𝑋𝑖 − x̃| 1.5 1.5 0.5 0.5 0.5 0.5 1.5 1.5 2.5 3.5 14
|𝑋𝑖 − 𝑋̂| 1 1 0 0 0 1 2 2 3 4 14
42
Since the distribution is ungrouped the mean deviation about mean, median and mode:
∑|𝑋𝑖 − 𝑋̅| 14
𝑀𝐷(𝑋̅) = = = 1.4
𝑛 10
∑|𝑋𝑖 − x̃| 14
𝑀𝐷(𝑋̃) = = = 1.4
𝑛 10
∑|𝑋𝑖 −x̂| 14
𝑀𝐷(x̂) = = 10 = 1.4
𝑛
Merits of 𝑴𝑫
Demerit of 𝑴𝑫
➢ It does not take in to account the signs of the deviations of items from the average.
Remark: Of all the mean deviations taken about different averages or any arbitrary value, the
mean deviation about the median has the smallest value.
The relative measure of mean deviation, also called the coefficient of mean deviation is obtained
by dividing mean deviation by the particular average used in computing mean deviation. Thus,
𝑀𝐷(𝑋 ) ̅
𝐶𝑀𝐷(𝑋̅) = 𝑋̅ where MD is the mean deviation calculated about the arithmetic mean.
➢ CMD about the median is given by:
𝑀𝐷(𝑋 ) ̃
𝐶𝑀𝐷(𝑋̃) = 𝑋̃ in which case MD is calculated about the median of the observations.
Example 4.6: Calculate the coefficient of mean deviation about the mean, median and mode
for the data in Example 4.5 above.
Solution:
𝑀𝐷(𝑋̅) 1.4
𝐶𝑀𝐷(𝑋̅) = = = 0.23
𝑋̅ 6
43
𝑀𝐷(𝑋̃) 1.4
𝐶𝑀𝐷(𝑋̃) = = = 0.25
𝑋̃ 5.5
𝑀𝐷(x̂) 1.4
𝐶𝑀𝐷(x̂) = = = 0.28
x̂ 5
4.3.4 The Variance, Standard Deviation and Coefficient of Variation
Like the mean deviation, the variance is also based on all observations in a set of data.
But the variance is the average of squared deviations from the mean. Recall that the sum of
squared deviations is minimum only when taken from the mean. Squared deviations are
mathematically manipulated than absolute deviations. Thus, if we averaged the squared
deviations from the mean and take the square root of the result (to compensate for the fact that
the deviations were squared), we obtain the standard deviation. This overcomes the limitation
of the mean deviation.
mark of the ith class, fi is the frequency of the ithclass and N=∑ fi
Sample Variance (𝑺𝟐 )
One would expect the sample variance to simply be the population variance with the population
mean replaced by the sample mean. However, one of the major uses of statistics is to estimate
the corresponding parameter. This formula has the problem that the estimated value isn't the
same as the parameter. To offset this, the sum of the squares of the deviations is divided by
one less than the sample size.
• For ungrouped data
∑𝑛
𝑖=1(𝑥𝑖 −𝑥̅ )
2 1
𝑆2 = = 𝑛−1 [∑ni=1 xi 2 − 𝑛𝑥̅ 2 ] where 𝒙
̅ is the sample arithmetic mean and n is the
𝑛−1
44
• For discrete data arranged in FD
If the values xi have frequencies fi (i=1, 2,…,m), then the sample variance is given by:
1 m
or S = fi ( xi − x )
∑ 𝑓𝑖 (𝑥𝑖 −𝑥̅ )2 1 2 2
2 2 2
𝑆 = = 𝑛−1 [∑ fi xi − 𝑛𝑥̅ ]
𝑛−1 n − 1 i =1
of the ith class, fi is the frequency of the ith class and n=∑ fi .
The Standard Deviation
There is a problem with variances. Recall that the deviations were squared. That means that
the units were also squared. To get the units back the same as the original data values, the
square root must be taken.
➢ Population Standard Deviation (s )
𝜎 = √𝝈𝟐 where 𝜎 2 is the population variance.
➢ Sample Standard Deviation ( S )
𝑆 = √𝑆 2 where 𝑆 2 is the sample variance.
Example 4.7: Find the sample variance and standard deviation of:
xi 2 4 5 6 8
fi 2 2 3 1 2
1 1 49 1
𝑆 2 = 𝑛−1 [∑ fi xi 2 − 𝑛𝑥̅ 2 ] = 9 [279 − 10(10)2 ] = 9 (38.9) = 4.32, 𝑎𝑛𝑑 𝑆 = √4.32 = 2.08.
45
Example 4.8: Find the sample variance and standard deviation for the distribution:
Freq. 4 1 2 3
Solution: In a continuous F.D., xi is the class mark representing the ith class.
C.I xi fi f i xi f i xi
2
1-5 3 4 12 36
6-10 8 1 8 64
11-15 13 2 26 338
16.20 18 3 54 972
1 1
𝑆 2 = 𝑛−1 [∑ fi xi 2 − 𝑛𝑥̅ 2 ] = 9 [1410 − 10(10)2 ]
410
= = 45.56,
9
𝑆 = √45.56 = 6.75.
1. If a constant is added to (or subtracted from) all the values, the variance remains the
same; i.e., for any constant k, V ( xi k ) = V ( xi ) .
Example 4.9 Consider the 6 sample values xi: 54,52,53,50,51, and 52.
The sample variance is 2 = V (xi ) . Now, subtract 50 from each value to get:
2. If each and every value is multiplied by a non-zero constant (k), the standard deviation
is multiplied by |𝑘| and the variance is multiplied by k2; i.e., V (kxi ) = k 2V ( xi ) .
3. Both the variance and the standard deviation give more weight to extreme values and
less to those which are near to the mean.
46
Coefficient of Variation
The standard deviation is an absolute measure of dispersion. The corresponding relative
measure is known as the coefficient of variation (CV).
Of course, standard deviation is an absolute measure of dispersion that expresses the variation
in the same unit as the original data but it cannot be the sole basis for comparing two
distributions. For instance, if we have a standard deviation of 10 and a mean of 5, the values
vary by an amount twice as large as the mean itself. If, on the other hand, we have a standard
deviation of 10 and a mean of 5000, the variation relative to the mean is significant. Therefore,
we cannot know the dispersion of a set of data until we know the standard deviation, the mean,
and how the standard deviation compares with the mean.
Coefficient of variation is used in such problems where we want to compare the variability of
two or more different series. Coefficient of variation is the ratio of the standard deviation to the
arithmetic mean, usually expressed in percent.
𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
CV = × 100%
𝑚𝑒𝑎𝑛
Mean score 85 65
Standard deviation 25 12
Compare the relative dispersions of the two departments’ scores using the appropriate way.
Solution:
47
Mathematics Departments Chemistry Departments
𝑆 𝑆
CV = ̅ × 100 CV = ̅ × 100
x x
25 12
= 85 × 100 = 65 × 100
= 29.41% = 18.46%
Interpretation: Since the CV of Mathematics Department students is greater than that of
Chemistry Department students, we can say that there is more dispersion relative to the mean
in the distribution of Mathematics students’ scores compared with that of Chemistry students.
4.4 Standard Scores (Z-Scores)
A standard score for sample value in a data set is obtained by subtracting the mean of the data
set from the value and dividing the result by the standard deviation of the data set. Basically,
the standard score (z-score) tells us how many standard deviations a specific value is above or
below the mean value of the data set. That is, the z-score is the number of standard deviations
the data value falls above (positive z-score) or below (negative z-score) the mean for the data
set.
𝑋−𝜇
𝑍 𝑠𝑐𝑜𝑟𝑒 =
𝜎
Z-score computed from the sample
𝑋 − 𝑋̅
𝑍 𝑠𝑐𝑜𝑟𝑒 =
𝑆
Example 4.11: What is the Z-score for the value of 14 in the following sample data set?
3 8 6 14 4 12 7 10
Solution:
14−8
𝑋̅ = 8, SD = 3.8173 thus, Z =3.8173 ≈ 1.57.
The data value of 14 is located 1.57 standard deviations above the mean 8 because the z-
score is positive.
Example 4.12: Suppose that a student scored 66 in Statistics and 80 in Mathematics. The
score of the summary of the courses is given below.
Course Average score Standard deviation of the score
Statistics 51 12
Mathematics 72 16
48
In which course did the student scored better as compared to his classmates?
Solution:
𝑋−𝜇 66−51 15
Z-score of student in Statistics: 𝑍 = = = 12 = 1.25
𝜎 12
𝑋−𝜇 80−72 8
Z-score of student in Mathematics: 𝑍 = = = 16 = 0.5
𝜎 16
From these two standard scores, we can conclude that the student has scored better in
Statistics course relative to his classmates than in Mathematics course.
The measures of central tendency and variation discussed in previous one do not reveal the
entire story about a frequency distribution. Two distributions may have the same mean and
standard deviation but may differ in their shape of the distribution. Further description of their
characteristics is necessary that is provided by measures of skewness and kurtosis.
4.5.1 Moments
Moments are statistical tools used in statistical investigation. The moments of a distribution are
the arithmetic mean of the various powers of the deviations of items from some number. In our
course, we shall use it in the study of Skewness and Kurtosis of statistical distribution.
∑ 𝑋𝑖 𝑟
𝑀𝑟 = , Where 𝑟 = 0, 1, 2, 3, …
𝑛
Moments about the origin for grouped frequency distribution and for ungrouped frequency
∑ 𝑓 𝑖 𝑋𝑖 𝑟
distribution is 𝑀𝑟 = 𝑛
Where 𝑓𝑖 is the frequency of 𝑋𝑖 . 𝑋𝑖 is the midpoint in the case of grouped frequency distribution
or class value in the case of ungrouped frequency distribution.
∑(𝑋𝑖 − 𝑋̅)𝑟
𝑀𝑟′ =
𝑛
49
Moments about the mean for grouped frequency distribution and for ungrouped frequency
∑ 𝑓𝑖 (𝑋𝑖 −𝑋̅ )𝑟
distribution. 𝑀𝑟′ =
𝑛
Where 𝑓𝑖 is the frequency of 𝑋𝑖 . 𝑋𝑖 is the midpoint in the case of grouped frequency distribution
or class value in the case of ungrouped frequency distribution.
∑(𝑋𝑖 − 𝐴)𝑟
𝑀𝑟′ =
𝑛
Moments about any arbitrary constant 𝐴 for grouped frequency distribution and for ungrouped
∑ 𝑓𝑖 (𝑋𝑖 −𝐴)𝑟
frequency distribution 𝑀𝑟′ = .
𝑛
Example 4.13: Find the first four moments about the mean for the following individual
series 𝑋𝑖 : 3 6 8 10 18
Solution: n=5,
S.No 𝑿𝒊 ̅)
(𝑿𝒊 − 𝑿 ̅ )𝟐
(𝑿𝒊 − 𝑿 ̅ )𝟑
(𝑿𝒊 − 𝑿 ̅ )𝟒
(𝑿𝒊 − 𝑿
1 3 -6 36 -216 1296
2 6 -3 9 -27 81
3 8 -1 1 -1 1
4 10 1 1 1 1
5 18 9 81 729 6561
Thus,
51
How to check the presence of skewness in a distribution?
A measure of skewness gives a numerical expression for and the direction of asymmetry in a
distribution. It gives information about the shape of the distribution and the degree of variation
on either side of the central value. The three most commonly used measures of skewness are
Pearson’s coefficient of skewness, Bowley’s coefficient of skewness and coefficient of
skewness based on moments.
𝛼3 > 0,➔ the distribution is positively skewed/skewed to the right, i.e mode < median <mean
➔smaller observations are less frequent than larger observations. i.e., the majority of
4.5.3 Kurtosis
53
𝑀′4 5.8
b/ 𝛼4 = = 1.62 = 2.26 < 3, ➔the curve is platykurtic.
𝑀′22
Example 4.14: Find the coefficient of skewness and the coefficient of kurtosis for the
above example 4.13.
Solution:
𝑀′3 97.2 97.2
i) 𝛼3 = 3/2 = 3 = 129.527 = 0.75
𝑀′2 (25.6)2
Example 5.1: In an experiment of rolling a fair die, S = {1, 2, 3, 4, 5, 6}, each sample point is
an equally likely outcome. It is possible to define many events on this sample space as follows:
54
C = {1, 3, 5} - the event of getting an odd number.
Example 5.2
If we toss a coin the sample space (S) of this experiment S = {head, tail} where head and tail
are two faces of a coin. If we are interested the outcome of head will turn up then the event E=
{head}.
Example 5.3: Find the sample space of tossing a coin three times.
S= {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT}
• Mutually exclusive event: - two events A and B are said to be mutually exclusive if there
is no sample point which is common to A and B. i.e. A ∩ B = ∅
• Independent event: two or more events are said to be independent if the occurrence or
non-occurrence of an event does not affect the occurrence or non-occurrence of the other.
• Dependent Events: Two events are dependent if the first event affects the outcome or
occurrence of the second event in a way the probability is changed.
• Complement of an Event: the complement of an event A means nonoccurrence of A and
is denoted by A', or Ac contains those points of the sample space which don’t belong to A.
• Equally likely outcomes: if each outcome in a sample space has the same chance to be
occurred.
Example 5.4: Casting a fair die all possible outcomes are equally likely.
5.2 Counting rules: addition, multiplication, Permutation & Combination rule
In order to calculate probabilities, we have to know
• The number of elements of an event.
• The number of elements of the sample space.
That is in order to judge what is probable, we have to know what is possible.
In order to determine the number of out comes one can use several rules of counting:
1. The addition rule
2. The multiplication rule
3. Permutation rule
4. Combination rule
1. The addition Rule
Suppose that a procedure, designated by 1, can be done in n 1 ways. Assume that a second
procedure designated by 2, can be done in n2 ways. Suppose furthermore, that it is not possible
that both 1 and 2 done together. Then, the number of ways in which we can do1 or 2 is 𝑛1 + 𝑛2
ways.
55
Example 5.5: suppose we are planning a trip to some place. If there are 3 bus routes & two
train routs that we can take, then there are 3+2=5 different routs that we can take.
2. Multiplication rule: If an operation consists of k steps and the 1st step can be done in n1
ways, the 2nd step can be done in n2 ways (regardless of how the 1st step was performed), the
kth step can be done in nk ways, (regardless of how the preceding steps were performed), then
the entire operation can be performed in n1 · n2 ·… · nk ways.
Example 5.6: Suppose that a person has 2 different pairs of trousers and 3 shirts. In how many
ways can he wear his trousers and shirts?
Solution: He can choose the trousers in n1 = 2 ways, and shirts in n2 = 3 ways. Therefore, he
Permutation Rule:
a) The number of permutations of n objects taken all together is n!
n! n!
i.e. n!= n*(n-1)*(n-2)*…*3*2*1 = Pn = = = n!
(n − n)! 0!
n
Note: By definition 0! = 1
b) The arrangement of n distinct objects in a specific order using r objects at a time is called the
permutation of n objects taken r objects at a time. It is written as nPr and the formula is
n!
Pr =
(n − r )!
n
c) The number of distinct permutation of n objects in which n 1 are alike, n2 are alike, ..., nk are
alike is
n! for n = n1 + n2 + + nk
n1 !.n 2 !. .n k !
Example 5.7: Find number of permutations of the letters in the word ‘‘statistics’’.
Solution:
There are 3 s’s, 3t’s, 1a’s, 2i’s and 1c’s. i.e. 𝑛1 = 3, 𝑛2 = 3, 𝑛3 = 1, 𝑛4 = 2 𝑎𝑛𝑑 𝑛5 = 1
Therefore 10! = 50,400.
3!.3!.1!.2!1!
Example 5.8: A photographer wants to arrange 3 persons in a row for photograph. How many
different types of photographs are possible?
56
Solution:
Assume 3 persons Aster (A), lemma (L), Yared (Y) and n=3
Since n! =3! = 3*2! = 6, there are 6 possible arrangement ALY, AYL, LAY, LYA,YLA and YAL.
Example 5.9: Suppose we have a letters A,B, C, D & E
a) How many permutations are there taking all the four?
b) How many permutations are there taking two letters at a time?
Solution:
a) Here n = 5, there are four distinct object.
There are 5! = 120 permutations.
b) Here n = 5, r = 2
There are 5P2 = 5!/(5-2)! = 120/6 = 20 permutations.
Example 5.10: Fifteen Ethiopian athletes were entered to the race. In how many different ways
could prizes for the first, the second and the third place be awarded?
Solution
15 objects taken 3 at a time 15P3=15!/(15-3)! = 2730 ways.
4. Combination-A selection of objects considered without regard to order in which they occur
is called Combination. The number of combination of n different objects taking r of them at a
n n!
time is C r = = , for r = 0,1,2,, n .
r r!(n − r )!
n
Example 5.11: Given the letters A, B, C, and D list the permutation and combination for
selecting two letters.
Solution:
Permutation Combination
AB BA CA DA AB BC
AC BC CB DB AC BD
AD BD CD DC AD DC
Note that in permutation AB is different from BA but in combination AB is the same as BA.
Example 5.12: In a club containing 7 members a committee of 3 people is to be formed. In how
many ways can the committee be formed?
n n! 7 7!
Solution: 7C3 = n C r = = ➔ 7 C3 = = = 35
r r!(n − r )! 3 3!(7 − 3)!
Example 5.13: How many four-digit numbers can be formed with the 10 digits 0,1,2, . . ,9 if
a/ repetitions are allowed
b/ repetitions are allowed, and
57
c/ the last digit must be zero & repetitions are not allowed.
Solution:
a/ the first digit can be any one of 9 (since 0 is not allowed). The second, third and fourth digits
can be any one of 10. Then 9.10.10.10=9000 numbers can be formed.
b/ the first digit can be any one of 9 & the remaining three can be chosen in 9 P3 ways. Thus 9.
c/ the first digit can be chosen in 9 ways & the next two digits in 9 P2 ways. Thus 9. 8 P2 = 504
1. Classical approach:
Definition: If there are n equally likely outcomes of an experiment, and out of the n outcomes
event A occur only k times the probability of the event A is denoted by P (A) is defined as
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠 𝑓𝑎𝑣𝑜𝑟𝑎𝑏𝑙𝑒 𝑡𝑜 𝑒𝑣𝑒𝑛𝑡 𝐴 𝑛(𝐴) 𝑘
p(A) = = 𝑛(𝑆) = 𝑛
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
Note: Classical approach of measuring probability fails to answer for the following conditions:
• If total number of outcomes is infinite or if it is not possible to enumerate all elements of the
sample space.
58
• If each outcome is not equally likely.
Example 5.14: Compute a/ the probability of having two boys & one girl is a three child family
using the classical method, assuming boys & girls are equally likely.
b/ using (a) compute the probability of having three boys in a three-child family.
c/ using (a) compute the probability of having three girls in a three –child family.
d/ using (a) compute the probability of having two girls & one boy in three child family.
Solution
59
b) Let A be the event that 6 will be non defective.
30 50
Total way in which A occur = * = NA=n (A)
4 6
𝑛(𝐴) 30 50 80
P (A) ) =𝑛(𝑆) = * / = 0.265
4 6 10
c) Let A be the event that all will be non defective.
30 50
Total way in which A occur = * = NA=n (A)
0 10
𝑛(𝐴) 30 50 80
P (A) =𝑛(𝑆) = * / = 0.00624.
0 10 10
2. The Frequentist Approach (Empirical Probability): This approach to probability is based
on relative frequencies.
Definition: Suppose we do again and again a certain experiment n times and let A be an event
of the experiment and let k be the number of times that event A occurs. Therefore the probability
of the event A happening in the long run is given by:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑡𝑖𝑚𝑒𝑠 𝑒𝑣𝑒𝑛𝑡 𝐴 ℎ𝑎𝑠 𝑜𝑐𝑐𝑢𝑟𝑒𝑑 𝑘
P(A) = =
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛𝑠 𝑛
In other words given a frequency distribution, the probability of an event (A) being
𝐹𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑐𝑙𝑎𝑠𝑠 𝐴
in a given class is P(A) =
𝑇𝑜𝑡𝑎𝑙 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑖𝑛 𝑡ℎ𝑒 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛
Example 5.16: The national center for health statistics reported that of every 539 deaths in
recent years, 24 resulted that from automobile accident, 182 from cancer, and 353 from other
disease. What is the probability that particular death is due to an automobile accident?
Solution
P (automobile) = death due to automobile /total death =24/539 = 0.445
The probability that particular death is due to an automobile accident is 0.445.
3. The axiomatic approach.
Let E be a random experiment and S be a sample space associated with E. With each event A
a real number called the probability of A satisfies the following properties called axioms of
probability or postulates of probability.
1. 0≤ P (A) ≤ 1
2. P(S) =1, S is the sure/certain event.
3. If A1 and A2 are mutually exclusive events, the probability that one or the other occur equals
the sum of the two probabilities. i. e. P(A1∪A2)=P(A1)+P(A2)
Similarly P(A1∪A2∪ . . . An) = P(A1)+P(A2) +. . . +P(An) = ∑𝑛𝑖=1 𝐴𝑖
4. P (A') =1-P (A)
60
5. P (ø) =0, ø is the impossible event.
5.4 Some probability rules
Rule l: let A be an event and A' be the complement of A with respect to a given sample space
of an experiment, then P(A')=1-P(A)
Proof: let S be a sample space S=AUA' and, A and A' are mutually exclusive
A∩A' = ø
P(S) = P (AUA') = P (A') + P (A) and P(S) = 1
1= P (A') + P (A) => P (A') = 1-P (A)
Rule 2: let A and B are events of a sample space S, then
P (A' ∩ B) = P (B) - P (A ∩ B)
Proof: B =S ∩ B = (AUA') ∩ B = (A∩ B) U (A'∩ B)
If A∩B ≠ ø , then P(B) =P (A∩ B) +P (A' ∩ B)
P (A' ∩ B) = P(B) – P(A ∩ B).
Rule 3: Suppose A and B are two events of a sample space, then
P(AUB) = P(A) + P(B) – P(A ∩ B)
Proof:
(AUB) = AU(A' ∩ B), A and A' ∩ B are disjoint sets
∴ P(AU B) = p(A) + p(A' ∩ B) . . . .*
But we have already proved that P (A’ n B) = P (B) – P (A ∩ B)
Put this in equation *
P(A U B) = P(A) + P (B) – P (A ∩ B)
Example 5.17: A fair die is thrown twice. Calculate the probability that the sum of spots on the
face of the die that turn up is divisible by 2 or 3.
Solution
S={(1,1),(1,2),(1,3),(1,4),(1,5),(1,6),(2,1),(2,2),(2,3),(2,4),(2,5),(2,6),(3,1),(3,2),(3,3),(3,4),(3,5),
(3,6),(4,1),(4,2),(4,3),(4,4),(4,5),(4,6),(5,1),(5,2),(5,3),(5,4),(5,4),(5,5),(5,6),(6,1),(6,2),(6,3),(6,4
),(6,5),(6,6)}
This sample space has 6*6 =36 elements let A be the event that the sum of the spots on the
die is divisible by 2 and B be the event that the sum of the spots on the die is divisible by three,
then
A = {(1,1), (1,3), (1,5), (2,2), (2,4), (2,6), (3,1), (3,3), (3,5), (4,2), (4,4), (4,6), (5,1), (5,3), (5,5),
(6,2), (6,4), (6,6)}
B = {(1,2), (1,5), (2,1), (2,4), (3,3), (3,6), (4,2), (4,5), (5,1), (5,4), (6,3), (6,6)}
A∩B = {(1, 5), (2,4), (3,3), (4,2), (5,1), (6,6)}
61
P (A or B) = P (A U B)
= P (A) +P (B) – P (A∩B)
= 18/36 + 12/36 -6/36 = 24/36 = 2/3
5.5 Conditional Probability and Independence
5.5.1 Conditional Probability
If A and B are events. Conditional probability of A given B means the probability of occurrence
of A when the event B has already happened.
It is denoted by P (A/B) and is defined by
P (A/B) = P(A ∩ B)/P (B), if P (B)≠0
Conditional probability of B given A means the probability of occurrence of B when the event A
has already happened. It is denoted by P (B/A) and is defined
P (B/A) = P(A ∩ B)/P (A), if P (A)≠0
P (A ∩ B) = P (A) P (B/A) = P (B) P (A/B).
5.5.2 Multiplication Law of Probability
If A and B are events in a sample space S, then
P (A ∩ B) = P (A) P (B/A), P (A) ≠ 0
P (A ∩ B) = P (B) P (A/B), P (B) ≠ 0
Where P (B/A) represents the conditional probability of B given A and P (A/B) represents the
conditional probability of A given B.
Note: Extension of multiplication law of probability for ‘n’ events A1, A2, …, An we have
P (A1 ∩ A2 ∩ …∩An) = P (A1) P (A2/A1) p (A3/A1 ∩ A2)…P(An/A1∩ A2 ∩ …∩An-1)
Example 5.18: A coin is tossed twice. If it is already known that the first coin has thrown a head,
what is the probability of getting two heads?
Solution:
S = {HH, HT, TH, TT}, A = the first shows a head = {HH, HT}, B= two heads occur ={HH}
P (B/A) = P(A ∩ B)/ P(A)
But A ∩ B ={HH}, P(A ∩ B) =1/4, P(A)=1/2, therefore, P (B/A) = P(A ∩ B)/ P(A) = 1/2
Example 5.19: Let A and B are events such that P (A U B) = ¾, P (A ∩ B) = ¼ and P(A' ) = 2/3.
Find P (A'/B)
Solution:
P(A') = 2/3 ➔ P (A) = 1- P(A') = 1-2/3 = 1/3
Now, P (A U B) = P (A) + P (B) - P (A ∩ B)
➔3/4 = 1/3 + P (B) – ¼
➔P(B) = 3/4 - 1/3 + ¼ = 2/3
62
Therefore, P (A/B) = P (A ∩ B)/P(B) = 3/8 ➔ P(A'/B) =1-P (A/B) = 1-3/8 =5/8.
5.5.3 Probability of Independent Event
Two events A and B are said to be independent if the occurrence of A has no bearing on
occurrence of B. That means knowledge of A has occurred given no information about the
occurrence of B. Two events, A and B, are said to be independent if P(A∩B) = P(A)P(B).
Suppose A and B are independent events with 0<P (A) <1 and 0<P (B) <1. The following
statements true:
i. A' and B' are independent, ii. A and B' are independent, iii. A' and B are independent
iv. P(B|A) = P(B), v. P(B|A') = P(B)
Example 5.20: A box contains four black and six white balls. What is the probability of getting
two black balls in drawing one after the other under the following conditions?
a. The first ball drawn is not replaced
b. The first ball drawn is replaced
Solution: Let A= first drawn ball is black
B= second drawn is black
Required P (A n B)
a. P (A ∩ B) = P (B/A) P(A) = (4/10) (3/9) = 2/15
b. P (A ∩ B) = P (A) P (B) = (4/10) (4/10) = 16/100 = 4/25.
5.6 Total probability and Bayes’ Theorem
Total probability:-If events B1, B2, …,& Bk constitute a partition of the sample space S &
Example 5.21: In a factory, machines A1, A2, A3 manufactures 25%, 35%, 40% of the total
output respectively. Out of their products 5%, 4% & 2% are, respectively defective. An item is
drawn at random from the products is found to be defective. What is the probability that
defective item is produced by all machines?
So/n: p(A1)=0.25, p(A2) = 0.35, p(A3) = 0.40, P(D/A1)= 0.05, P(D/A2) = 0.04, P(D/A3) =0.02
63
Bayes’ Theorem:- If B1, B2, …,& Bk are events which make an exhaustive partition of the
sample space S, if A is any event in S, then the conditional probability of Bi given that A has
P( Bi ) P( A / Bi )
already occurred is: P( Bi / A) = k
P( B ) P( A / B )
i
i i
Example 5.22: Based on the above example, what is the probability that it was manufactured
by machine A1?
P( A1 ) P( D / A1 )
Sol/n:- P( A1 / D) = k
= (0.25)(0.05)/0.0345 = 0.3623
P( A ) P( D / A )
i
i i
64