Chapetr - Three: Measure of Central Tendency
Chapetr - Three: Measure of Central Tendency
CHAPETR - THREE
Measure of Central Tendency
o When we want to make comparison b/n groups of numbers it is good to have a single value, which is considered to be a good representative of each
group.
o This single value is called the average of the group.
o Averages are also called measure of central tendency.
o An average, which is representative, is called typical average and average which is not representative and has only a theoretical value is called a
descriptive average.
o A typical average should posses the following.
o It should be strictly defined.
o It should be based on all observation under investigation.
o It should be as little as affected by extreme observations.
o It should be capable of further algebraic treatment.
o It should be ease to calculate and simple to understand.
Objectives:
o To comprehend the data easily.
o To facilitate comparison.
o To make further statistical analysis.
Properties of Summation
i) ∑ = nk, where k is any constant.
Introduction to statistics
iii) ∑ ( ) ∑
iv)∑ ( ) ∑ ∑
xi fi xifi
2 2 4
3 1 3
Introduction to statistics
7 3 21
8 1 8
Total 7 36
∑ = = 5.15
∑
=∑ ∑
Where: xi is the class mark of the ith class and fi is the ith class frequency.
Example: calculate the mean for the following age distribution.
class Frequency
6-10 35
11-15 23
16-20 15
21-25 12
26-30 9
31-35 6
o Solutions:
o First find the class marks.
o Find the product of frequency and class marks
o Find mean using the formula.
class fi xi xifi
6-10 35 8 280
11-15 23 13 299
16-20 15 18 270
21-25 12 23 276
26-30 9 28 252
Introduction to statistics
31-35 6 33 198
total 100 1575
=∑ ∑ = ( ) =15.75
Marks Frequency
40-44 7
45-49 10
50-54 22
55-59 f1
60-64 f2
65-69 6
70-74 3
The mean is 5
( X X ) (3 5) (8 5) (4 5) 0
ii. If 1 if the mean of n1 observation, if 2 is the mean of n2 observations, ........, if k is the mean of nk observations, then the mean of all the
observation in all groups often called the combined mean is given by :-
= = ∑ ̅i
∑
Example:-In a class there are 30 females and 70 males .If females averaged 60 in an examination and boys averaged 72, find the mean of the entire class.
✈solutions:-
Females males
̅ 1= 60 ̅ 2 = 72
n1= 30 n2=70
̅c = = = 68.4
iii. If wrong figure has been used when calculating; the mean of the correct mean can be obtained without repeating the whole process using:
65 + = 69 k.g.
o When a different importance is desired to be giving to different data a weighted mean is appropriate.
o Weights are assigned to each item in proportion to its relative importance.
o Let x1, x2 ,…., xn be the values of the items a series and w1,W2,..., Wn their corresponding weights, the weighted mean denoted by is defined as:-
o
( w1 X 1 w 2 X 2 ... w n X n )
Xw
( w1 w 2 ... w n )
w = ∑ i
∑
• Example:-A student obtained the following percentage in an examination:- English 60, biology 75, mathematics 63, physics 59,and chemistry 55.
Find the students weighted arithmetic mean if weights 1, 2,1,3,3 respectively are allotted to all students.
• Solution :-
= ∑ = = 61.5
∑
G.M= √
o For grouped data
- If the number of observation is more than three or more, the computation of the nth root very tedious, to simplify computation, logarithm is used in terms of
log.
LogG.M = ∑
1 n
For grouped data
G AntiLog f i Log xi
N i 1
Example: - Find the geometric mean of 3,9,27
Solution: - G.M = √ =√ =9
Note: - The geometric mean is useful and appropriate for finding averages of ratios or growth rates.
o H.M = =
∑ ∑
H.M = = , n =∑
∑ ∑
- If x1, x2, x3,…, xn be the value of the items a series and w1,w2,…,wn their corresponding weights, the weighted Harmonic Mean denoted by;
H.Mw =
∑
∑
Solution:- H.M = = 24
Note:- The Harmonic Mean is useful and appropriate in finding average speeds and average rates.
N.B a). A.M>G.M>H.M
b). √ = G.M, Where A.M and H.M. are the usual abbreviations.
V 2 3 4 5
f 5 8 12 1
Mode is 4.
continues series: (class frequency distribution).
Mode =L+ [( ) (
]C
)
Demerit
o It is not rigid.
o It not based on all observations.
o It is not suitable for further mathematical treatment.
o It is not stable average. i.e. it is affected by fluctuations of sampling to some extent
o Often its value is not unique. i.e It may not be uniquely defined
Example: X={1,1,2,2,3,4}, Mode(X)=1 and 2
̃ =
a) 2, 1,8,3,5
b) 6, 5, 2,8,9,4
►Quartiles
Introduction to statistics
o Are the three values, which divided the given data in to four equal parts, they are denoted by Q1, Q2 and Q3.
Q1= lower or first quartile, it covers 25% of the distribution
►Deciles
o Are the nine values, which divide, the series in to 10 equal parts, they are denoted by D1, D2, D3,..., D9
D1= covers 10% of the distribution
► Percentiles
o Are the 99 values, which divide the series in to 100 equal parts. They are denoted by P1, P2,…, P99.
Note that
Reading assignments:
Quartiles, Deciles and Percentiles from the row data
E.g. for the data given below, compute the quartiles, D3, D7, P15 and P88 interpret.
CHAPTER - FOUR
MEASURE OF DISPERSION (VARIATION)
The degree to which a numerical data tends to spread about an average is called dispersion or variation of the data
1. Range (R)
R = X max – X min
o Easy to compute and a quick but not good measure of variability since it fails to take into account how the data are distributed and it is greatly affected
by extreme value.
Introduction to statistics
o The following two distributions have the same range, 13, yet appear to differ greatly in the amount of variability.
Distribution 1: 32 35 36 36 37 38 40 42 42 43 43 45
Distribution 2: 32 32 33 33 33 34 34 34 34 34 35 45
RR= =
Example:
1. If the range and relative range of a series are 4 and 0.25 respectively.
Then what is the value of:
a). smallest observation.
Example:
Q.D =? , C.Q.D =?
Standard Deviation
o There is a problem with variances. Recall that the deviations were squared. That means the units were also squared.
o To get the units back the same as the original data values, the square root must be taken.
o = √ and s = √
Examples: find the variances and standard deviations of the following sample data 5,17,12,10. The data is given in the form of frequency
distribution.
Solutions: ̅ =11
xi 5 10 12 17 total
(Xi- ̅ )2 36 1 1 36 74
class frequency
Introduction to statistics
40-44 7
45-49 10
50-54 22
55-59 15
60-64 12
65-69 6
70-74 3
̅= 55
Xi(C.M) 42 47 52 57 62 67 72 total
2 1183 640 198 60 588 864 867 4400
fi( – ̅)
Exercise:-
Introduction to statistics
1. A meteorologist interested in the consistency of temperatures in three cities during a given week collected the following data. The temperature for the five days of the week in the
three cities were
City -1 25 24 23 26 17
City-2 22 21 24 22 20
City-3 32 27 35 24 28
o Then, which city do you think have the most consistent temperature, based on these data?
2. Two groups of people were trained to perform a certain task and tested to find out which group is faster to learn the task. For the two groups the following
information was given:
Value Group one Group two
Mean 10.4 min 11.9 min
Standard deviation 1.2 min 1.3 min
Moments
∑ ( ̅) ∑ ( ̅)
Mr = =
Examples:
1. Find the first two moments for the following set of numbers 2,3,7
2. Find the first three central moments of the numbers in problem 1.
Solutions:
1. Use the rth moment formula.
̅r = ∑ r
= ̅ 1 = (2+3+7)/3 =4, ̅ 2 = (22+32+72)/3 = 20.67
Measure of Shapes
Skewness
Skewness is concerned with the shape the curve not size
o Skewness is the degree of asymmetry or departure from symmetry of a distribution.
o A skewed frequency distribution is one that is not symmetrical.
o If the frequency curve (smoothed frequency polygon) of a distribution has a longer tail to the right of the central maximum than to the left, the
distribution is said to be skewed to the right or said to be Positive skewness.
o If it has a longer tail to the left of the central maximum than to the right, it is said to be Skewed to the left said to have negative skewness
For the moderately skewed distribution, the relation holds among the three commonly used measure of central tendency. Mean – mode =3*(mean – median)
Q3-Q1 Q3 - Q1
3. The moment coefficient of skewness
α3 = M3 = M3
M2
3/2
(
o The shape of the curve is determined by the value of α3
If α3 > 0 then the distribution is positively skewed .
If α3 = 0 then the distribution is symmetric.
If α3 < 0 then the distribution is negatively skewed.
Examples:
1. Suppose the mean, the mode, and the standard deviation of a certain distribution are 32, 30.5 and 10 respectively. What is the shape of the curve representing
the distribution?
α3 = = α3 = = 0.15
2. In a frequency distribution, the coefficient of the skewness based on the quartiles is given to be 0.5. If the sum of the upper and lower quartiles is 28 and the
median is 11, find the values of the upper and the lower quartiles.
Solutions:
Given:
α3 =0.5, median =Q2=11
Q1+Q3= 28....................................... (*)
Required Q1 and Q3
α3 = (Q3 –Q2) –(Q2-Q1) = Q3+Q1 -2Q2 = 0.5
Q3-Q1 Q3- Q1
Substituting the given value
Q3-Q1=12………………………… (**)
Solving (*) and (**) Q1=8 , Q3=20
Introduction to statistics
3. For a moderately skewed frequency distribution, the mean is 10 and the median is 8.5. If the coefficient of variation is 20%, find the Pearsonian coefficient of
skewness and the probable mode of the distribution.
4. The sum of fifteen observations, whose mode is 8, was found to be 150 with coefficients of variation of 20%. Then, calculate the Pearsonian coefficient of
skewness and give appropriate conclusion.
Kurtosis
o Kurtosis is the degree of peakdness of a distribution, usually taken relative to a normal distribution.
o A distribution having relatively high peak isLeptokurtic
o if a curve representing a distribution is flat topped Platy kurtic
o The normal distribution which is not very high peaked or flat topped Mesokurtic
Measure of Kurtosis
The moment coefficient of kurtosis: denoted by α4
where =
Where:-
M4 = is the 4th moment about mean
M2 = is 2nd moment about mean.
is population standard deviation
The peakdness of depends on the value of α4 :
If α4 > 3 then the curve is leptokurtic.
If α4 = 3 the curve is Mesokurtic
If α4 < 3 then the curve is Platykurtic.
Solutions:-