Measures of Central Tendency
Measures of Central Tendency
M SRINIVASAN
TRAINING ASSOCIATE (MATH)
KVS, ZIET MUMBAI
Statistics is a branch of mathematics dealing with
the collection, organization, analysis, interpretation
and presentation of data
Two main statistical methods are used in data
analysis:
Descriptive statistics, which summarize data from a
sample using indexes such as the mean or standard
deviation
Inferential statistics, which draw conclusions from
data that are subject to random variation (e.g.,
observational errors, sampling variation)
MEASURES OF
CENTRAL TENDENCY
Statistical tools are required for
appropriate analysis of the given data
Measures of Central Tendency is a
statistical tool which gives us a single
value in the series which represents
the whose series
MEASURES OF
CENTRAL TENDENCY
Averages are frequently called
measures of central tendency as they
reflect the spirit of the data
An average is sometimes called a
measure of central tendency as
individual values of variables cluster
around it
KINDS OF STATISTICAL AVERAGE
𝑑𝑥
𝑋=𝐴+
𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝑨 = 𝑨𝒔𝒔𝒖𝒎𝒆𝒅 𝑨𝒗𝒆𝒓𝒂𝒈𝒆 𝒐𝒓 𝑴𝒆𝒂𝒏
dx = Deviations from assumed Mean (𝑥 – A)
N = Number of items
Find out the arithmetic average of marks obtained by 10
students in Mathematics
48,32,42,39,36,48,41,44,37,48
Marks Obtained (X) Deviations Calculation
dx = X – 48
48
32
42
39
36
48
41
44
37
48
N = 10
Marks Obtained (X) Deviations Calculation
dx = X – 48
48 0
32 -16
42 -6 𝑑𝑥
𝑋=𝐴+
39 -9 𝑁
−𝟔𝟓
36 -12 = 48 + 𝟏𝟎
48 0 = 41.5
41 -7
44 -4
37 -11
48 0
N = 10 𝒅𝒙 = −𝟔𝟓
DIRECT METHOD
𝑓𝑥 𝑓𝑥
𝑋= =
𝑓 𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝒇 = 𝑭𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚 𝒐𝒇 𝒕𝒉𝒆 𝒗𝒂𝒓𝒊𝒂𝒃𝒍𝒆𝒔
𝒇𝑥= Product of variable with respective frequencies
𝑓𝑥 = Sum of the product of the variables with their frequencies
𝒇 = N = Number of items
The farm size of 220 cultivating
households in a village is
Farm
Size
1 2 3 4 5 6 7 8 Total
No. of
house
holds 23 25 30 40 35 40 20 10 220
1 20
2 25
3 30
4 40
5 35
6 40
7 20
8 10
Form Size No. of fx
x Households 𝑓𝑥 𝑓𝑥
f 𝑋= =
𝑓 𝑁
1 20 20 995
2 25 50 =
220
3 30 90
4 40 160
=4.34
5 35 175
6 40 240
7 20 140
8 10 80
𝒇 = 𝟐𝟐𝟎 𝒇𝒙 = 𝟗𝟗𝟓
ASSUMED MEAN METHOD
We assume any variable as mean and
deviations from the assumed mean 𝒅𝒙 are
calculated
Every deviation should be multiplied by its
respective frequency to obtain 𝒇𝒅𝒙
The summation of the product of frequency and
deviation is divided by number of observations
to get the Arithmetic Mean
ASSUMED MEAN METHOD
𝑓𝑑𝑥 𝑓𝑑𝑥
𝑋 =𝐴+ =𝐴+
𝑓 𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝑨 = 𝑨𝒔𝒔𝒖𝒎𝒆𝒅 𝑨𝒗𝒆𝒓𝒂𝒈𝒆 𝒐𝒓 𝑴𝒆𝒂𝒏
dx = Deviations from assumed Mean (𝑥 – A)
𝒇𝒅𝒙 = 𝑺𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒑𝒓𝒐𝒅𝒖𝒄𝒕 𝒐𝒇 𝒅𝒆𝒗𝒂𝒕𝒊𝒐𝒏 𝒂𝒏𝒅 𝒇𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚
N= 𝑓 = Number of items
The number of telephone calls
received in 250 successive intervals is
NO. of
calls
0 1 2 3 4 5 6 7 Total
Freq.
14 21 25 43 55 41 40 11 250
0 14
1 21
2 25
3 43
4 55
5 41
6 40
7 11
NO. f dx = X - 4 𝒇𝒅𝒙
of
calls
0 14 -4 -56
𝑓𝑑𝑥
1 21 -3 -63 𝑋 =𝐴+
𝑓
2 25 -2 -50 −58
=4+
3 43 -1 -43 250
4 55 0 0 =3.77
5 41 1 41
6 40 2 80
7 11 3 33
𝒇 = 𝟐𝟓𝟎 𝒇𝒅𝒙 = −𝟓𝟖
STEP DEVIATION METHOD
We assume any variable as mean and
deviations from the assumed mean 𝒅𝒙 are
calculated
The deviations thus obtained are divided by a
common factor in order to obtain step deviation
The summation of the product of frequency and
step deviation is divided by number of
observations to get the Arithmetic Mean
STEP DEVIATION METHOD
𝑓𝑑 ′ 𝑥 𝑓𝑑 ′ 𝑥
𝑋=𝐴+ ×𝐶=𝐴+ ×𝐶
𝑓 𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝑨 = 𝑨𝒔𝒔𝒖𝒎𝒆𝒅 𝑨𝒗𝒆𝒓𝒂𝒈𝒆 𝒐𝒓 𝑴𝒆𝒂𝒏
d'x = Step Deviation
(Deviations from assumed Mean (𝑥 – A) divided by common factor C)
𝒇𝒅′ 𝒙 = 𝑺𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒑𝒓𝒐𝒅𝒖𝒄𝒕 𝒐𝒇 𝒔𝒕𝒆𝒑 𝒅𝒆𝒗𝒂𝒕𝒊𝒐𝒏 𝒂𝒏𝒅 𝒇𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚
N= 𝑓 = Number of items; C – Common Factor
Calculate average wage of workers
Wages
No. of
workers
5 15 32 42 15 12 4
Wages Frequency Deviation ′
𝑿 −𝑨 𝒇𝒅𝒙
𝒅𝒙=
from 𝑪
Assumed
Mean C = 100
d=X-A
2400 5
2500 15
2600 32
2700 42
2800 15
2900 12
3000 4
𝑿 − 𝟐𝟓𝟎𝟎
Wages f d = X – 2500 𝒅′ 𝒙 =
𝟏𝟎𝟎
fd’x
2400 5 -100 -1 -5
2500 15 0 0 0
2600 32 100 1 32
2700 42 200 2 84
2800 15 300 3 45
2900 12 400 4 48
3000 4 500 5 20
𝒇 = 𝟏𝟐𝟓 𝑪 = 𝟏𝟎𝟎
𝑓𝑑 ′ 𝑥
𝑋 =𝐴+ ×𝐶
𝑓
224
= 2500 + × 100
125
= 2679.2
We assume that the values of each item in the
class is identical with the mid-value of the class
interval
Mid-point is given by the formula
𝑳𝟏 + 𝑳𝟐
𝟐
𝑳𝟏 = 𝑳𝒐𝒘𝒆𝒓 𝑳𝒊𝒎𝒊𝒕 𝒐𝒇 𝒕𝒉𝒆 𝒄𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍
𝑳𝟏 = 𝑳𝒐𝒘𝒆𝒓 𝑳𝒊𝒎𝒊𝒕 𝒐𝒇 𝒕𝒉𝒆 𝒄𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍
DIRECT METHOD
𝑓𝑥 𝑓𝑥
𝑋= =
𝑓 𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝒙 = 𝒎𝒊𝒅 − 𝒑𝒐𝒊𝒏𝒕 𝒐𝒇 𝒕𝒉𝒆 𝒄𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍
𝒇 = 𝑭𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚 𝒐𝒇 𝒕𝒉𝒆 𝒗𝒂𝒓𝒊𝒂𝒃𝒍𝒆𝒔
𝒇𝑥= Product of mid-points with respective frequencies
𝑓𝑥 = Sum of the product of the midpoints with their frequencies
𝒇 = N = Number of items
The following table gives ploughing time per
hectare taken by 45 farmers in a village
Ploughing Time No. of farmers
480-500 2
500-520 5
520-540 10
540-560 15
560-580 12
580-600 1
Class Mid Frequency fx
point x
480-500 490 2
500-520 510 5
520-540 530 10
540-560 550 15
560-580 570 12
580-600 590 1
Class Mid Frequency fx
point x
480-500 490 2 980
500-520 510 5 2550
520-540 530 10 5300
540-560 550 15 8250
560-580 570 12 6840
580-600 590 1 590
𝒇 = 𝟒𝟓 𝒇𝒙 = 𝟐𝟒𝟓𝟏𝟎
𝒇 = 𝟒𝟓 𝒇𝒙 = 𝟐𝟒𝟓𝟏𝟎
𝑓𝑥 𝑓𝑥
𝑋= =
𝑓 𝑁
24510
=
45
= 544.67
ASSUMED MEAN METHOD
𝑓𝑑𝑥 𝑓𝑑𝑥
𝑋 =𝐴+ =𝐴+
𝑓 𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝑨 = 𝑨𝒔𝒔𝒖𝒎𝒆𝒅 𝑨𝒗𝒆𝒓𝒂𝒈𝒆 𝒐𝒓 𝑴𝒆𝒂𝒏
𝒙 = 𝒎𝒊𝒅 − 𝒑𝒐𝒊𝒏𝒕 𝒐𝒇 𝒕𝒉𝒆 𝒄𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍𝒔
dx = Deviations from assumed Mean (𝑥 – A)
𝒇𝒅𝒙 = 𝑺𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒑𝒓𝒐𝒅𝒖𝒄𝒕 𝒐𝒇 𝒅𝒆𝒗𝒂𝒕𝒊𝒐𝒏 𝒂𝒏𝒅 𝒇𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚
N= 𝑓 = Number of items
The following table gives ploughing time per
hectare taken by 45 farmers in a village
Ploughing Time No. of farmers
480-500 2
500-520 5
520-540 10
540-560 15
560-580 12
580-600 1
Class Mid dx = x - A Frequency fdx
point x
480-500 490 2
500-520 510 5
520-540 530 10
540-560 550 15
560-580 570 12
580-600 590 1
Class Mid dx = x - 550 Frequency fdx
point x f
𝒇 = 𝟒𝟓 𝒇𝒅𝒙 = −𝟐𝟒𝟎
𝒇𝒅𝒙
𝑿=𝑨+
𝒇
−𝟐𝟒𝟎
= 550 +
𝟒𝟓
= 544.67
STEP DEVIATION METHOD
𝑓𝑑 ′ 𝑥 𝑓𝑑 ′ 𝑥
𝑋=𝐴+ ×𝐶=𝐴+ ×𝐶
𝑓 𝑁
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
𝑨 = 𝑨𝒔𝒔𝒖𝒎𝒆𝒅 𝑨𝒗𝒆𝒓𝒂𝒈𝒆 𝒐𝒓 𝑴𝒆𝒂𝒏
𝒙 = 𝒎𝒊𝒅 − 𝒑𝒐𝒊𝒏𝒕 𝒐𝒇 𝒕𝒉𝒆 𝒄𝒍𝒂𝒔𝒔 𝒊𝒏𝒕𝒆𝒓𝒗𝒂𝒍𝒔
d'x = Step Deviation
(Deviations from assumed Mean (𝑥 – A) divided by common factor C)
𝒇𝒅′ 𝒙 = 𝑺𝒖𝒎𝒎𝒂𝒕𝒊𝒐𝒏 𝒐𝒇 𝒑𝒓𝒐𝒅𝒖𝒄𝒕 𝒐𝒇 𝒔𝒕𝒆𝒑 𝒅𝒆𝒗𝒂𝒕𝒊𝒐𝒏 𝒂𝒏𝒅 𝒇𝒓𝒆𝒒𝒖𝒆𝒏𝒄𝒚
N= 𝑓 = Number of items ; C – Common Factor
The following table gives ploughing time per
hectare taken by 45 farmers in a village
Ploughing Time No. of farmers
480-500 2
500-520 5
520-540 10
540-560 15
560-580 12
580-600 1
Class Mid dx = x - A ′
𝑿 −𝑨 Frequency fdx
𝒅𝒙=
point 𝑪
x
480-500 490 2
500-520 510 5
520-540 530 10
540-560 550 15
560-580 570 12
580-600 590 1
Class Mid 𝒅′ 𝒙 =
𝑿 −𝟓𝟓𝟎 Frequency 𝒇𝒅′ 𝒙
𝟐𝟎
point dx f
x = x - 550
𝒇 = 𝟒𝟓 𝑪 = 𝟐𝟎
′
𝒇𝒅 𝒙
𝑿=𝑨+ ×𝑪
𝒇
−𝟏𝟐
= 𝟓𝟓𝟎 + ×20
𝟒𝟓
= 544.67
Simplicity
Representative
Arrangement of data not required
Stability in calculation
Calculated Value
Absurd results
Affected by extreme points
Non-existence value as mean
More stress on items of higher
value
Calculation by observation not
possible
Simple Arithmetic Mean gives equal importance
to all the items in a series
Depending upon the necessity of observation
and purpose of statistical survey all the items
may not deserve equal importance
Thus relative importance of different itmes in a
series may differ
This relative importance is known as weight and
computation of average after loading the
weight is called weighted arithmetic mean
𝑊𝑥
𝑋=
𝑊
𝑿 = 𝑨𝒓𝒊𝒕𝒉𝒎𝒆𝒕𝒊𝒄 𝑴𝒆𝒂𝒏
W = Weight of the individual items
USES OF WEIGHTED MEAN
Marks 0 – 20 20 – 40 40 – 60 60 – 80 80 - 100
Cumulative
5 13 16 20 25
Frequency
Marks 0 – 20 20 – 40 40 – 60 60 – 80 80 - 100
Frequency 5 8 3 4 5
Median is a positional average
Median is the value of the series
which divides the series into two
equal parts
Median is middle value of the series
when items are arranged in ascending
or descending order
The values are to be arranged in ascending or
descending order to locate the median
Total number of items in the series is odd
𝑛+1 𝑡ℎ
𝑀𝑒𝑑𝑖𝑎𝑛 = item
2
Total number of items in the series is even
𝑡ℎ 𝑡ℎ
𝑛 𝑛+2
2 𝑖𝑡𝑒𝑚 + 2 𝑖𝑡𝑒𝑚
𝑀𝑒𝑑𝑖𝑎𝑛 =
2
Calculate the median from the
following data
20, 25, 30, 15, 17, 35, 26, 18, 40, 45, 50
The values in ascending order
15, 17,18, 20, 25, 26, 30, 35, 40, 45, 50
Total number of items in the series is 11 which is odd
𝑛+1 𝑡ℎ 11+1 𝑡ℎ
𝑀𝑒𝑑𝑖𝑎𝑛 = item 𝑀𝑒𝑑𝑖𝑎𝑛 = item
2 2
𝑴𝒆𝒅𝒊𝒂𝒏 = 28
For Discrete Series
The values are to be arranged in
ascending or descending order
The cumulative frequencies are to be
computed
The Median lies at the size of the
item, in whose cumulative frequency
𝒏+𝟏 𝒕𝒉
the value of item lies
𝟐
Calculate Median for the following
distribution
Weekly
Income 58 59 60 61 62 63 64 65 66
No of
workers 2 3 6 15 10 5 4 3 1
Weekly No. of workers Cumulative
income x (f) Frequency
58 2
59 3
60 6
61 15
62 10
63 5
64 4
65 3
66 1
x f c.f
58 2 2
59 3 5 N= 49
𝑵+𝟏
60 6 11 = 𝟐𝟓
𝟐
61 15 26 Middle term is the 25th
62 10 36 item
63 5 41 The item lies in the value
64 4 whose c.f is 26
45
Hence Median = 61
65 3 48
66 1 49
For Continuous Series
Marks 100-120 120 – 140 140 –160 160 – 180 180 - 200
No. of
10 20 30 15 5
Students
Class No. of students Cumulative
Frequency
100 - 120 10 10
120 - 140 20 30
140 – 160 30 60
160 - 180 15 75
180 - 200 5 80
Class f c.f
N= 80
𝑵
= 𝟒𝟎
100 - 120 10 10 𝟐
120 - 140 20 30 40th term lies in the
interval 140 – 160
140 – 160 30 60 Hence 140 – 160 is the
160 - 180 15 75 median class
𝑵
180 - 200 5 80 𝟐
−𝑪.𝑭
𝑴𝒆𝒅𝒊𝒂𝒏 = 𝑳𝟏 + ×𝒊
𝒇
𝑳𝟏 = 𝑳𝒐𝒘𝒆𝒓 𝒍𝒊𝒎𝒊𝒕 𝒐𝒇 𝒎𝒆𝒅𝒊𝒂𝒏 𝒈𝒓𝒐𝒖𝒑
c.f. = Cumulative frequency of group preceding the median group
f = frequency of median group
i = length of median class interval
𝑳𝟏 = 𝑳𝒐𝒘𝒆𝒓 𝒍𝒊𝒎𝒊𝒕 𝒐𝒇 𝒎𝒆𝒅𝒊𝒂𝒏 𝒈𝒓𝒐𝒖𝒑 = 𝟏𝟒𝟎
c.f. = cumulative frequency of group preceding the median group=30
f = frequency of median group = 30
i = length of median class interval=20
𝑵
−𝑪.𝑭
𝟐
𝑴𝒆𝒅𝒊𝒂𝒏 = 𝑳𝟏 + ×𝒊
𝒇
𝟒𝟎 −𝟑𝟎
= 𝟏𝟒𝟎 + × 20
𝟑𝟎
= 146.67
Simplicity
Ideal Average
Graphical Determination
Representing Middle Value
Useful in distribution of unequal
classes
Practical Application
Not based on all observations
Further algebraic treatment not
possible
Inaccurate in case of large data
Affected by fluctuations
Arrangement required
Mode has been derived from the
French word la mode, meaning
fashion
Mode is the value of that item in a
variable which is repeated the
greatest number of times or occurs
more frequently.
For Ungrouped Data
By arraying data
The data is arranged in an array and the
value of item which occurs most
frequently is called mode
By converting data into discrete series
Data are converted into discrete series and
most frequently repeated value is termed
as mode
Find out the mode of the following data:
400, 600, 500, 700, 900, 800, 1000, 400, 700, 600,
500, 900, 800, 700, 700
Wages Frequency
400 2
500 2
600 2
700 4
800 2
900 2
1000 1
Class 10 – 15 15 – 20 20 – 25 25 – 30 30 – 35 35 – 40 40 - 45
Frequency 6 14 32 24 18 10 6