Measures of Central Tendency
Measures of Central Tendency
Specific Objectives
At the end of this topic the trainee should be able to:
➢ Define the measures of central tendency;
➢ State the properties of the measure of central tendency;
➢ Determine the measures of central tendency.
Introduction
Measures of central tendency show the tendency of some central value
around which data tends to cluster. These are statistical values which tend
to occur at the centre of any well ordered set of data. Whenever these
measures occur they do not indicate the centre of that data
Objectives of averaging
➢ To get one single value that describes the characteristics of the
entire data.
➢ To facilitate comparison.
i) Class limits
These are numerical values which limits the extend of a given class i.e. all
the observations in a given class are expected to fall within the interval
which is bounded by the class limits e.g. 15 & 19 are class limits as in the
table of the example above.
49
ii) Class boundaries
These are statistical boundaries, which separate one class from the other.
They are usually determined by adding the lower class limit to the next
upper class limit and dividing by 2 e.g. in the above table the class
19 + 20
boundary between 19 and 20 is 19.5 which is = .
2
Arithmetic Mean
The most popular and widely used measure for representing the entire data
by one value is an average. Its value is contained by adding together all the
observation and dividing this total by the number of observation
These is commonly known as average or mean it is obtained by first of all
summing up the values given and by dividing the total value by the total no.
of observations.
X
I.e. mean =
n
Where x = no. of values
∑ = summation
n = no of observations
Example
The mean of 60, 80, 90, 120
50
60 + 80 + 90 +120
4
350
=
4
= 87.5
The arithmetic mean is very useful because it represents the values of most
observations in the population.
The mean therefore describes the population quite well in terms of the
magnitudes attained by most of the members of the population
= 2245.72 hours
51
The following data was obtained from students who were registered in a
certain college.
The table shows the age distribution
Required calculate the mean age of the students using the coded method
= 29.36 years
Limitation
• It is difficult to compute mean without making assumption regarding
the size of the class- interval of the open end classer.
52
• If the data is not drawn from a ‘normal’ population, then the a.m.
may give a wrong impression about the population.
• In some situations, the a.m. may give unrealistic values especially
when dealing with discrete variables e.g. when working out the
average no. of children in a no. of families. It may be found that the
average is 4.4 which is unrealistic in human beings.
The Median
This is a statistical value which is normally located at the center of a given
set of data which has been organized in the order of magnitude or size e.g.
consider the set 14, 17, 9, 8, 20, 32, 18, 14.5, 13. When the data is ordered
it will be 8, 9, 13, 14, 14.5, 17, 18, 20, and 32. The middle number/median
is 14.5. The importance of the median lies in the fact that it divides the
data into 2 equal halves. The no. of observations below and above the
median is equal. In order to determine the value of the median from
grouped data, the data is grouped the median may be determined by using
the following methods
i. Graphical method using the cumulative frequency curve
(ogive).
ii. The formula
Example
Referring to the table in 105, determine the median using the methods
above
Xv
160
140
53
120
100
80
60
40
20
20 40 60 80 100 120 140 160
Media= L +i⁄f{m-c}
73.5 - 56
= 60 + × 20
48
= 60 + 7.29
= 67.29
Merits of median
• It is not influenced by the presence extreme values
• The sum of the deviation of observation from median is minimum
• It shows the centre of a given set of data
• Knowledge of the determination of the median may be extended to
determine the quartiles
• The median can easily be defined
• It can be obtained easily from the cumulative frequency curve
54
• It can be used in determining the degrees of skewness
Limitation
• Its value is not determined by each and every observation.
• Median is not capable of algebraic treatment.
• Median is less reliable.
• It is affected by sampling fluctuation.
• In some situations where the no. of observations is even, the value of
the median obtained is usually imaginary.
• The computation of the median using the formulas is not well
understood by most businessmen.
• In business environment the median has got very few applications.
Q1 = L + i/f (m–c)
Decile:
D2 = L +i/f (m-c)
Percentile:
Example
The profit earned by 100 companies during 2003-04 are given bellow
Profits (Kshs.) Number of companies
20-30 4
30-40 8
55
40-50 18
50-60 30
60-70 15
70-80 10
80-90 8
90-100 7
Solution
Calculation of Q1, Q2, Q4, and P80
Profits (Kshs.) f cf
20-30 4 4
30-40 8 12
40-50 18 30
50-60 30 60
60-70 15 75
70-80 10 85
80-90 8 93
90-100 7 100
= 40 +10/18(25-12)
= 40 + 7.22 = 47.22
= 47.22
D4 = L+ i/f (m-c)
= 50 +10/30 (40-30)
= 50 + 3.33
= 53.33
56
= 75
This means that 80 percent of the companies earn an annual profit of
kshs.75 or less
20 percent of the companies earn an annual profit of more than kshs. 75
Mode
This is one of the measures of central tendency. The mode is defined as a
value within a frequency distribution which has the highest frequency.
Sometimes a single value may not exist as such in which case we may refer
to the class with the highest frequency. Such a class is known as a modal
class
The mode is a very important statistical value in business activities quite
often business firms tend to stock specific items which are heavily on
demand e.g. footwear, clothes, construction materials (beams, wires, iron
sheets e.t.c.
The mode can easily be determined form ungrouped data by arranging the
figures given and determining the one with the highest frequency.
When determining the values of the mode from the grouped data we may
use the following methods;-
i. The graphical method which involves use of the histogram
ii. The computation method which involves use of formula
Example
In a social survey in which the main purpose was to establish the
intelligence quotient (IQ) of resident in a given area, the following results
were obtained as tabulated below:
Required
Calculate the modal value of the IQ’s tabulated above using
i. The graphical method and
ii. Formula
57
Graphical method
50
40
30
20
10
= 69.14
Merits of mode
• Mode is not affected by extreme values.
• At can be easily used to decide qualitative phenomenon.
• Mode happens to be meaningful as an average.
• It can be determined from incomplete data provided the
observations with the highest frequency are already known.
• The mode has several applications in business.
58
• The mode can be easily defined.
• It can be determined easily from a graph.
Limitation of mode
• Mode cannot always be computed
• If the data is quite large and ungrouped, determination of the mode
can be quite cumbersome
• Use of the formula to calculate the mode is unfamiliar to most
business people
• The mode may sometimes be non existent or there may be two
modes for a given set of data. In such a case therefore a single mode
may not exist
Geometric mean
This is a measure of central tendency normally used to measure industrial
growth rates. It is defined as the nth root of the product of ‘n’ observations
or values
- i.e. GM = n x1 × x 2 ×... × x n
Example
In 1995 five firms registered the following economic growth rates; 26%. 32%
41% 18% and 36%.
Required
Calculate the GM for the above values
GM = 5 26 32 41 18 36
No. Log
26 1.4150
32 1.5052
41 1.6128
18 1.2533
36 1.5563
7.3446
Therefore Log of GM = 1/5 x 7.3446 = 1.46892
So GM = Antilog of 1.46892
= 29.43
59
Merits
i. It makes use of all the values given (except when x = 0 or
negative)
ii. It is the best measure for industrial growth rates
Demerits
i. The determination of the GM by using logarithms is not familiar
process to all those expected to use it e.g. managers
ii. If the data contains zeros or –ve values, the GM ceases to exist
Harmonic mean
This is a measure of central tendency which is used to determine the
average growth rates for natural economies. It is defined as the reciprocal
of the average of the reciprocals of all the values given by HM.
1
HM =
1 ( 1 + 1 + ... 1 )
n x1 x2 x3
Example
The economic growth rates of five countries were given as 20%, 15%, 25%,
18% and 5%
Calculate the harmonic mean
1
The HM =
1 (1 +1 +1 +1 +1
5 20 15 25 10 5
1
=
0.2(0.05 + 0.07 + 0.04 + 0.10 + 0.2)
1
=
0.092
10.86%
Weighted mean
60
- This is the mean which uses arbitrarily given weights
- It is a useful measure especially where assessment is being done yet the
conditions prevailing are not the same. This is particularly true when
assessment of students is being done given that the subjects being taken
have different levels of difficulties.
Examples
The following table shows that marks scored by a student doing section 3
and 4 of CPA
Weighted mean
Ewx
Ew
18005
=
285
= 63.17%
61