0% found this document useful (0 votes)

11 views15 pages

Statistucs Theory

The document discusses frequency distribution, explaining both ungrouped and grouped frequency distributions, along with their characteristics and graphical representations such as histograms and frequency polygons. It also covers measures of central tendency, specifically the arithmetic mean, geometric mean, and their calculations with examples. Additionally, it highlights the merits and demerits of these statistical measures.

Uploaded by

78slippery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views15 pages

Statistucs Theory

Uploaded by

78slippery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

CPP STATISTICS

` KKP / MYP CENTRE

Frequency Distribution:

Let us consider the marks obtained by 30 students in an exam. The data is as given below:

7, 7, 2, 5, 5, 3, 1, 3, 2, 1
7, 8, 9, 4, 1, 6, 7, 4, 4, 3
2, 9, 5, 1, 1, 6, 3, 7, 9, 1

This representation of data is called Raw Data. This data doesn’t furnish any useful information and is rather
confusing. A better way to express the data is as given below.

Marks (x) Number of

students (f)
1 6
2 3
3 4
4 3
5 3
6 2
7 5
8 1
9 3

The representation of data as above is known as ‘ungrouped frequency distribution’. Here, marks are called the
variable (x) and the number of students against the marks is known as the frequency (f) of the variable.

If the identity of the individuals about whom particular information is taken is not relevant, nor the order in which the
observations arise, then the first real step of condensation is to divide the observed range of variable into a suitable
number of class intervals. For example, in the above case, the data may be expressed as shown below.

Marks Number of Students

1–3 13
4–6 8
7–9 9

The representation of data as above is known as ‘grouped frequency distribution’ (inclusive type of
Classification).

Note:

(a) The classes of the type 1 – 3, 4 – 6, 7 – 9, etc., in which both the upper and lower limits are included are called
inclusive classes.

(b) The following points may be kept in mind for classification of data into grouped frequency.

 The classes should be clearly defined and should not lead to any ambiguity.
 The classes should be exhaustive i.e. each of the given values should be included in one of the classes.
 The classes should be mutually exclusive and non overlapping
 The classes should be of equal width. However, it cannot be rigidly followed.

If the classes are of varying width, the different class frequencies will not be comparable. The ratio of frequency by
the corresponding width of the class interval is called ‘frequency density’.

KUKATPALLY CENTRE: # 22-97, Plot No.1, Opp. Patel Kunta Huda Park, Vijaynagar Colony, Hyderabad - 500 072. Ph.: 040-64601123
MIYAPUR CENTRE: Above Sai Motors Maruthi Showroom, Allwyn X Road, Miyapur, Hyderabad.
Regd. Off.: 29A, ICES House, Kalu Sarai, Sarvapriya Vihar, New Delhi - 110 016. Ph: 011 - 2651 5949, 2656 9493, Fax: 2651 3942
If we deal with a continuous variable, it is not possible to arrange the data in the class intervals of above type. For
example, let us consider the distribution of age in years. If class intervals are 1 – 3, 4 – 6, 7 – 9…… then the
persons with ages between 6 and 7 years are not taken into consideration. In such cases, we form the class
intervals as shown below.

Age (x) Number of people (f)

1–3 9
3–6 10
6–9 8
9 – 12 3

This form of frequency distribution is known as ‘continuous frequency distribution’ (or) ‘exclusive type of
classification’.

Note: It should be understood that in the above classes, the upper limit of each class are excluded from the
respective classes.

2. Graphical representation of frequency distribution:

Histograms: In drawing the histogram of a given continuous frequency distribution, we first mark off along the x-
axis, all the class intervals on suitable scale. On each class intervals, draw rectangles with heights proportional to
the frequency of the corresponding class interval. The diagram of continuous rectangles so obtained is called
Histogram.

Note: To draw the histograms for an ungrouped frequency distribution of a variable, we shall have to assume that
h h
the frequency corresponding to the variable (x) is spread over the interval x  to x  where h is the jump from
2 2
one value to next. For example, consider the following frequency distribution.

Marks (x) Frequency (f)

1 3
2 7
3 10
4 5
5 4
h
Here, jump h  1   0.5
2

3. Frequency Polygon:

For an ungrouped distribution, the frequency polygon is

obtained by plotting points with abscissa as variable
values and ordinate has corresponding frequencies and
joining the plotted points by means of straight lines.

In case of grouped frequency distribution, the abscissas

of points are mid-values of class intervals.

(i) Arithmetic Mean

(a) The Arithmetic Mean of a set of observations x1,x 2 ,...xn is denoted by x and it is defined as
x1  x 2  ....x n
x
n
1
x  i
n n i1

x f ,
n
1
(b) In case of ungrouped frequency distribution xi | fi , i  1,2,3,....,n the Arithmetic Mean is given by x  i i
N i1

f
n
where N  i
i1

 f
n n
1
(c) In case of continuous frequency distribution, the arithmetic mean, x  xi fi , where N  i and xi is the
N i1 i1
mid value of class interval.

Example (1): Find Arithmetic Mean of following frequency distribution:

x 1 : 2 3 4 5 6 7
f 5 : 9 12 17 14 10 6
 fi xi 299
Solution: x 
 fi 73

Example (2): Find the Arithmetic Mean of the following table:

Marks : 0-10 10-20 20-30 30-40 40-50 50-60

No. of Students : 12 18 27 20 17 6

Solution:

Marks No. of Students Mid values of fi xi

fi  Class intervals
 xi 
0-10 12 5 60
10-20 18 15 270
20-30 27 25 675
30-40 20 35 700
40-50 17 45 765
50-60 6 55 330
N  100
 f x  2800
i i

2800
Arithmetic Mean x   28
100
Note:

(a) Shifting of origin:

x f
n
1
The Arithmetic Mean x  i i
N i1
Let di  xi  A, where A is a fixed constant for all i  1,2....n . Then xi  di  A
n n n n
1 1 A 1
 A.M., x 
N 
i1
fi  di  A  
N 
i 1
fidi 
N i1
fi  A 
N fd
i 1
i i

Solution:

Marks No. of Students Mid values of di  x i  A fi di

fi  Class intervals
 xi 
0-10 12 5 – 20 – 240
10-20 18 15 – 10 – 180
20-30 27 25 (A) 0 0
30-40 20 35 10 200
40-50 17 45 20 340
50-60 6 55 30 180
N  100
 f d  300
i i

Let A  25
300
 Arithmetic Mean  25   28
100
xi  A
(b) Change of Scale: Let di  , for all i  1,2,...n , where A is any constant and h is length of each class.
h
h
Then, x  A 
N f d
i i

Example (4): Find the Arithmetic Mean of the data given in Example (2) by change of scale.

Solution:

Marks No. of Students Mid values of xi  A fi di

di 
fi  Class intervals h
 xi 
0-10 12 5 –2 – 24
10-20 18 15 –1 – 18
20-30 27 25 (A) 0 0
30-40 20 35 1 20
40-50 17 45 2 34
50-60 6 55 3 18
N  100
 f d  30
i i

Let A  25 and h  10
h 10  30 
Then, x  A 
N  f d  25 
i i
100
 28

(c) Let the mean of m variables x1, x 2 ,...x m be x and the mean of y1,y 2 ,....yn be y . Then the mean of combined
mx  ny
data x1, x 2 .....xm ,y1,y 2 ....yn is equal to .
mn

Example (5): The average salary of a male employee is Rs 520 and that of a female is Rs 420. The mean salary of
all the employees is Rs. 500. Find the percentage of male and female employees.

Solution: Let the number of males and females be m and f respectively.

m  520   f  420 
Therefore, 500   m  4f
mf
 Males percentage = 80,
Females percentage = 20

Merits: (a) It is clearly defined.

(b) It is based up on all the observations.

Demerits: (a) A.M. cannot be obtained if a single observation is missing.

(b) It is affected very much by extreme values.
(c) It leads to wrong conclusions if the details of the data from which it is computed are not given.

(ii) Geometric Mean (G.M):

(a) The Geometric Mean of n observations x1, x 2 ,..xn is given by G   x1x 2 x3 .....xn 
1/ n
.
(b) In case of ungrouped frequency distribution xi | fi , i  1,2,...n the Geometric Mean is given by
n
 
1/ N
G  x1f1 x 2f 2 ...xnfn , where N  f .
i1
i

n
1
 logG 
N  f log x
i 1
i i

 
1/ N
(c) In case of grouped frequency distribution Geometric Mean is given by G  x1f1 x 2f 2 ...xnfn , where
n
N f
i1
i and x1,x 2 ,......xn are mid-values of the class intervals.

Note: Let G1 be the G.M. of x1, x 2 ,...x m and G2 be the G.M. of y1,y 2 ,....yn . Let G be the G.M. of combined data
x1, x 2 ,....xm ,y1,y 2 ....yn .
1/  m n  mlogG1  nlogG2
Then, G   x1x 2 ....xm y1y 2 y3 ...yn 
1/  m n 

 G  G1m Gn2   logG 
mn

Merits and demerits of G. M:

Merits: (a) It depends on all observations.

Demerits: (a) G.M. becomes zero if one of the observations is zero.
(b) Sometimes G.M. becomes imaginary if some of the observations are negative.

(iii) Harmonic Mean (H.M):

n n
(a) The Harmonic Mean of n observations x1, x 2 ,x3 ...x n is given by H.M.   .
1 1 1 n
1

x1 x 2
 .... 
xn i1
xi
(b) In case of ungrouped frequency distribution xi | fi , i  1,2,.....n , the Harmonic Mean is given by
n
N
H.M.  n
fi
, where N  f . i


i1
xi
i1

n
N
(c) In case of grouped frequency distribution, Harmonic Mean is given by H.M.  n
fi
, where N   f and i


i1
xi
i1

x1,x 2 ,......xn are mid-values of the class intervals.

Examples (6): A student starts from his house and reaches the college at speed of 10 kmph and back from college
to home at speed of 15 kmph. Find average speed.
Solution:
2x 2 t1
Vavg    12km / h x
t1  t 2 1 1 A B

v1 v 2
t2

Merits: It depends on all the observations.

Demerits: It is not easily understood and difficult to compute.

(iv) Median: Median of a distribution is the value of the variable which divides it into two equal parts.

(a) In case of ungrouped data, if the number of observations is odd then median is the middle value after values
have been arranged in ascending or descending order.

In case of even number of observations, there are two middle terms and median is obtained by taking A.M. of
middle two terms. In fact, any value lying between the middle two values can be taken as the median.

Eg: The median of 7, 4, 5, 9, 10 is 7.

The median of 7, 6, 3, 8, 10, 12 is 7.5

(b) In case of discrete (ungrouped) frequency distribution, median is obtained by considering the cumulative
frequencies. The steps for calculating median are given below.

f .
n
N
Step (1): Find , where N  i
2 i1
N
Step (2): See the less than cumulative frequency (l.c.f) just greater than .
2
Step (3): The corresponding value of x is the median.

Example (7): Find median of following data:

x: 1 2 3 4 5 6 7 8 9
f: 8 10 11 16 20 25 15 9 6

Solution:

x f c.f
1 8 8
2 10 18
3 11 29 N
4 16 45 Here,  60
2
5 20 65 Therefore, Median = 5
6 25 90
7 15 105
8 9 114
9 6 120
N=120

(c) In case of grouped frequency distribution the median is given by the following formula:
h N 
Median      c 
f  2 
where,  is the lower limit of the median class
h is the length of the median class
f is the frequency of the median class

f .
n
c is the cumulative frequency of the class preceding the median class and N  i
i1

Class Frequency c.f

Intervals
x1  x 2 f1 F1
x 2  x3 f2 F2

xk1  xk fk1 (c) Fk1

(I) xk  xk 1 fk (f) Fk

xn  xn1 fn Fn
N  fi

Let xk  xk 1 be median class.

Now if we assume that the variable values are uniformly distributed over the median class.
Then, from the figure,

slopeof AB  slope of AC
N
F
Fk  Fk 1 2 k 1
 
h x  xk
N
c
f hN 
  2  x     c
h x f 2 

Example (8): Find median of following distribution:

Class interval : 20-30 30-40 40-50 50-60 60-70

Frequency : 3 5 20 10 5

Solution:
Class Frequency c.f
Intervals
20-30 3 3
30-40 5 8
40-50 20 28
50-60 10 38
60-70 5 43
N = 43

N
Here ,  21.5 ,   40 , h  10 , f  20 and c  8
2
10
 Median  40  21.5  8  46.75
20

Merits and demerits of median:

Merits: (a) it is clearly defined.

(b) it is easy to calculate.
(c) it is not effected by extreme values.

Demerits: (a) In case of even number of observations, median can’t be determined exactly.
KUKATPALLY CENTRE: # 22-97, Plot No.1, Opp. Patel Kunta Huda Park, Vijaynagar Colony, Hyderabad - 500 072. Ph.: 040-64601123
MIYAPUR CENTRE: Above Sai Motors Maruthi Showroom, Allwyn X Road, Miyapur, Hyderabad.
(b) It does not depend on all the observations.

Uses: Median is the only average to be used while dealing with qualitative data which cannot be measured
quantitatively but still can be arranged in ascending or descending order of magnitude. For example, average
intelligence or average honesty.

(v) Mode: Mode is the value which occurs most frequently in a set of observations and around which other
observations of the set cluster densely.

Eg: The mode of 2, 2, 9, 1, 3, 3, 5, 6, 8, 9 is 2

Merits and demerits:

Merits: (a) it is easy to calculate.

(b) it is not affected by extreme values.

Demerits:
(a) Mode is ill-defined. For example, the data 1, 2, 7, 2, 3, 4, 5, 3 have two modes. This kind of data is called
bimodal.
(b) It is not based upon all the observations

Uses: Mode is an average to be used to find the ideal size.

Note: (a) For a symmetrical distribution mean, median and mode coincide.
For example, for the following data Mean = Median = Mode = 5

x f

1 1

2 3

3 5

4 3

5 1

(b) If the distribution is moderately asymmetrical Mean, Median and Mode obey the following empirical
relation.
Mode  3 Median   2 Mean 

5. Partition values: These are the values which divide the series into a number of equal parts.

(i) The three points which divide the series into 4 equal parts are called quartiles. The first, second and third points
are known as first, second and third quartiles, respectively. The first quartile ( Q1 ) is the value which exceed 25% of
observations and is exceeded by 75% of observations. The second quartile ( Q2 ) is the value which exceed 50% of
observations and is exceeded by 50% of observation. The third quartile ( Q3 ) is the value which exceed 75% of
observations and is exceeded by 25% of observation.

(ii) The nine points which divide the series into ten equal parts are called deciles. The first, second, …., ninth
deciles are represented by D1,D2 ,.....D9 respectively.

(iii) The ninety nine points which divide the series into hundred equal parts are called percentiles. The first, second,
percentiles are represented by P1,P2 ,..... respectively.

x: 0 1 2 3 4 5 6 7 8
f: 1 9 26 59 72 52 29 7 1

Solution:
N
(i)  128  Median  4
2
x f c.f
0 1 1 N
(ii)  64  Q1  3
1 9 10 4
2 26 36 N
 128  Q2  4
3 59 95 2
4 72 167 3N
 192  Q3  5
5 52 219 4
6 29 248
4N
7 7 255 (iii)  102.4  D4  4
10
8 1 256
N  256 27N
(iv)  69.12  P27  3
100

Mode:

A   x k ,fk 1  , B   xk 1,fk 
D   xk ,fk  , C   xk 1,fk 1 
fk  fk 1
y  fk 1   x  xk 
h
f f
y  fk  k 1 k  x  xk 
h
x  xk
 fk  fk 1   2fk  fk 1  fk 1 
h
h  fk  fk 1 
 x  xk 
2fk  fk 1  fk 1
h  fk  fk 1 
 Mode   
2fk  fk 1  fk 1

1. Measure of dispersion:
Consider the series (i) 7, 8, 9 (ii) 8, 8, 8 (iii) 4,8,12

In all these cases, we see that number of observations is 3 and the mean is 8. If we are given that, the mean of 3
observations is 8, we cannot form an idea as to whether it is the average of first series or second or third series or of
any other series of 3 observations whose sum is 24. Thus, we see that measure of central tendency is inadequate to
give us a complete idea of the distribution. They must be supported and supplemented by some other measures.
One such measure is dispersion.

The following are the measures of dispersions:

(i) Range,
(ii) Quartile deviation (or) semi-interquartile range,
(iii) Mean deviation,
(iv) Standard deviation, and
(v) Root mean square deviation.

Range: The range is the difference between two extreme observations of the distribution. If A and B are the greatest
and smallest observations respectively in a distribution, then its range is A-B.

Range is the simplest, but a crude measure of dispersion. Since it is based on two extreme observations, it is not at
all a reliable measure of dispersion.

1
(ii) Quartile Deviation: Quartile deviation Q is given by Q   Q3  Q1  , where Q1 and Q3 are the first and third
2
quartiles of the distribution respectively.

Quartile deviation is definitely a better measure than range as it makes use of 50% of the data but since it ignores
the other 50 % of the data, it cannot be regarded as a reliable measure.

(iii) Mean Deviation: Consider n observations x1, x 2 ,x3 ,.....xn . Then mean deviation from the average A (usually
n
1
A.M, median or mode) is given by
n  x A .
i 1
i

 1
n n
In case of frequency distribution xi | fi , i  1,2,......,n , Mean deviation =
i 1
fi xi  A  , where N 

 i1
fi . 
N   
Since mean deviation is based on all observations, it is better measure of dispersion than quartile deviation.

(iv) Standard Deviation: The standard deviation of n observations x1, x 2 ,x3 ...x n is denoted by  , and it is given by

 x1  x    x 2  x   ....  xn  x 
2 2 2
1
x  x
2
  i , where x  AM of x1,x 2 .....xn .
n n

In case of frequency distribution xi | fi , i  1,2,....n standard deviation is given by

n n
1
 fi  xi  x  , where N  f .
2
 i
N i 1 i1

The square of standard deviation is called variance ( 2 )

n n
1
 fi  xi  x  , where N  f .
2
2  i
N i1 i1

KUKATPALLY CENTRE: # 22-97, Plot No.1, Opp. Patel Kunta Huda Park, Vijaynagar Colony, Hyderabad - 500 072. Ph.: 040-64601123
MIYAPUR CENTRE: Above Sai Motors Maruthi Showroom, Allwyn X Road, Miyapur, Hyderabad.
(v) Root mean square deviation: The root mean square deviation of n observations x1,x 2 ,....xn about any constant
A is denoted by ‘s’ and it is given by

 A  x1 2   A  x2 2  .....   A  xn 2 1
n

A  x 
2
s  i .
n n i 1

n n
1
 fi  xi  A  , where N  f .
2
In case of frequency distribution xi | fi , i  1,2,.....,n, we have s  i
N i 1 i1

s2 is called Mean square deviation.

2. Various formulae for Variance:

n
1
x  x
2
(a) 2  i
n i1
n n 2
1 1 x

n  x   2x  n  x  n n
i1
2
i
i1
i

n
1

n x
i1
2
i i  2x  x   x 2

n
1
 2 
n x i 1
2
i  x2

(b) The shifting of origin doesn’t change value of the variance i.e variance is independent of shifting the origin.

Proof: Let 2x be the variance of n observations x1 , x 2 .....xn .

n
1
 x  x
2
Then, 2x  2
i
n i 1
Let yi  xi  k , for all i  1, 2,...., n and k is a constant .
Let y be the variance of n observations y1 , y 2 .....yn .
2

n
1
 y   y
2
 2y  2
i
n i 1
n
1
x  k  x  k 2 2
 i
n i 1
n n n
k2
1   x 
1 2k

n 
i 1
xi2 
n  i1
xi 
n i1
2
 2kx  k 2

n
1
 x  x 2 2
 i  2x
n i1

(c) The variance is depending on change of scale

Proof: Let 2x be the variance of n observations x1 , x 2 .....xn .
n
1
 x  x
2
Then, 2x  2
i
n i 1
Let yi  hx i , for all i  1, 2,...., n and h is a non zero constant .
Let y be the variance of n observations y1 , y 2 .....yn .
2

1
n
1
n 1 n 
 yi2   y    hxi   hx   x   x
2 2 2 2
 2y   h2  2
  h2 2x
n n n i 
i 1 i 1  i 1 

  x  x  
n
1
 d2  2d  xi  x 
2
 i
n i 1

1
n 1 n 1
n 
x  x   x 
2
  d2  2d  xi 
n
i n n
i 1  i1 i 1

   d  2d  x  x     d2
2 2 2

 s2 = σ2 + d2 where, d2   x  A 
2

3. Coefficient of Dispersion:
Whenever we want to compare, the variability of two series which differ widely in their averages or which are
measured in different units, we do not merely calculate the measure of dispersion but we calculate the coefficients
of dispersion. The coefficient of dispersion (C.D) based on different measures of dispersion are as follows:

A B
(i) Based upon the range: C.D = ,where A and B are greatest and the smallest items in the series.
A B
Q  Q1
(ii) Based upon Quartile deviation: C.D  3
Q3  Q1

Mean deviation
(iii) Based upon Mean deviation: C.D 
Average from which it is calculated


(iv) Based upon standard deviation: C.D =
x

4. Coefficient of variation (C.V.): The coefficient of variation is given by C.V   100
x

Examples:

1. Find the mean deviation about the mean for 38, 70, 48, 40, 42, 55, 63, 46, 54, 44
Sol: Given data is 38, 70, 48, 40, 42, 55, 63, 46, 54, 44.
38+70  48+40  42+55+63+46+54+44 500
The arithmetic mean of the given data is, x    50
10 10
Then the absolute values |xi  x| are 12, 20, 2, 10, 8, 5, 13, 4, 4, 6
10

|xi  x| 84
i 1
 Mean deviation about the mean =   8.4
10 10

2. Find the mean deviation about the median for 13, 17, 16, 11, 13, 10, 16, 11, 18, 12, 17
Sol. Given data is 13, 17, 16, 11, 13, 10, 16, 11, 18, 12, 17.
On keeping the observations in ascending order 10, 11, 11, 12, 13, 13, 16, 16, 17, 17, 18
The median (b) of the given 11 observations is 13.
Then the absolute values |xi  b|are 3, 2, 2, 1, 0, 0, 3, 3, 4, 4, 5.

|xi  b| 27
i 1
Mean deviation about the median    2.45
11 11
3. Find the mean deviation about the mean for the following distribution.
xi 10 11 12 13
fi 3 12 18 12

Sol. Construct the following data to find mean deviation about mean
xi 10 11 12 13
fi 3 12 18 12
xi f i 30 132 216 156

|xi  x| 1.87 0.87 0.13 1.13

f i |xi  x| 5.61 10.44 2.34 13.56

Arithmetic mean, x 
 fi xi  534  11.87
 fi 45
4

 fi |xi  x| 45  31.95  0.71

1 1
Mean about the mean 
 fi i 1

4. Find the mean deviation about the median for the following frequency distribution.
xi 5 7 9 10 12 15
fi 8 6 2 2 2 6

Sol. We form the following table from the given data

xi fi c.f xi  M f i xi  M
5 8 8 2 16
7 6 14 0 0
9 2 16 2 4
10 2 18 3 6
12 2 20 5 10
15 6 26 8 48
N  26 84
N  26 
    13 (Value in the ‘c.f column’ which is just greater than 13 is 14). Median M  7
2  2 

Mean deviation about median M .D 

 fi xi  M  84  3.23
N 26

5. Find the mean deviation about the mean for the following data:
Marks obtained 0-10 10-20 20-30 30-40 40-50
No. of students 5 8 15 16 6

Sol. We take the assumed mean A  25 , Here, C  10 . Hence, we form the following table
xi  25
frequency  f i 
Class Midpoint
di  f i di xi  x f i xi  x
interval xi 10

 fi  50  N  fi di  10  fi xi  x  472
Here, N  50 . Mean x  A  C 

  fi di   25  10 10   25  2  27
   50 
 N 
5

 fi xi  x  50 472  9.44
1 1
 Mean deviation about the mean M.D 
N i 1

6. Find the mean deviation from the median for the following data
Age (years) 20-25 25-30 30-35 35-40 40-45 45-50 50-55 55-60
No. of workers  f i  120 125 175 160 150 140 100 30

Sol. We form the following table for the given data:

Class frequency Cumulative Mid point

xi  med. f i xi  med.
interval
 fi  frequency
c.f.
xi

20 – 25 120 120 22.5 15 1800

25 – 30 125 245 27.5 10 1250
30 – 35 175 420 32.5 5 875
35 – 40 160 580 37.5 0 0
40 – 45 150 730 42.5 5 750
45 – 50 140 870 47.5 10 1400
50 – 55 100 970 52.5 15 1500
55 – 60 30 1000 57.5 20 600

8
N = 1000  fi xi  med.  8175
i1
N 1000
Here th observation   500th observation lies in the class interval 35  40. This is the
2 2
median class.
 500  420 
 Median  L   N /2  p.c. f  / f  i  35    5
 160 
400
 35   35  2.5  37.50
160
8

 fi xi  med.  1000 8175  8.175

1 1
 Mean deviation about median 
N i 1

KUKATPALLY CENTRE: # 22-97, Plot No.1, Opp. Patel Kunta Huda Park, Vijaynagar Colony, Hyderabad - 500 072. Ph.: 040-64601123
MIYAPUR CENTRE: Above Sai Motors Maruthi Showroom, Allwyn X Road, Miyapur, Hyderabad.
Some Important results to be remembered:
nx  xi  x '
1. The A.M. of n numbers x1, x 2 ,x3 ,......, xn is x . If xi is replaced by x ' , then new average is .
n

2. Let x, 2 ,  are Mean, Variance and standard deviation of x1, x 2 ,x3 ,......, xn then,

Mean ( x ) Variance ( 2 ) Standard Deviation (  )

x1  k,x 2  k,x 3  k,......, xn  k x k 2 

k.x1,k.x 2 ,k.x 3 ,......,k.xn (k  0) k.x k 2 .2 k .

a. x1  b,a. x 2  b,a.x 3  b,......,a. xn  b ax  b a2  2 a .

3. Sum of deviations of variants from their A.M. is always zero. i.e.  x  x  0,  f x  x  0 .

i i i

 x  A
2
4. Sum of squares of deviations of variants is minimum when taken about the mean. i.e. if i is

minimum then A  x .

5. A.M. of a combined series: Let x and y are the A.M.s of two series of sizes m and n and X be the mean of

m.x  n.y
the combined data then X  .
mn
6. G.M. of a combined series: Let G1 and G2 are G.M.s of two series of sizes m and n and if G is geometric

m.logG1  n.logG2
mean of combined series, then logG  .
mn

7. Variance of a combined series: If m,n are the sizes, x, y the means and 1, 2 the standard deviation of two

1 m.x  n.y
series, then 2  [m.(12  d12 )  n.(22  d22 )] , where d1  x  X , d2  y  X and X  .
mn mn

n2  1
8. The variance of first n natural numbers is .
12

 xi  a  n and  x  a
n n
 na n,a  1 then the standard deviation of ‘n’ observations x1,x 2 ,,xn
2
9. If i
i1 i1

is a 1 .
10. If the distribution is moderately asymmetrical Mean, Median and Mode obey the empirical relation
Mode = 3.Median – 2.Mean.

Chapter 4 Analysis and Interpretation of Assessment Results
No ratings yet
Chapter 4 Analysis and Interpretation of Assessment Results
36 pages
Grade 10 Maths Summer Packet - 23
100% (1)
Grade 10 Maths Summer Packet - 23
32 pages
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
No ratings yet
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
19 pages
Statistics, mg4
No ratings yet
Statistics, mg4
58 pages
Statistics
No ratings yet
Statistics
30 pages
Statistics Class 11 Notes CBSE Maths Chapter 15 (PDF)
No ratings yet
Statistics Class 11 Notes CBSE Maths Chapter 15 (PDF)
13 pages
Statistics
No ratings yet
Statistics
17 pages
Statistics
No ratings yet
Statistics
12 pages
Data Analysis: Kulwant Singh Kapoor
No ratings yet
Data Analysis: Kulwant Singh Kapoor
60 pages
Statistic
No ratings yet
Statistic
30 pages
Part-I Central Tendency and Dispersion: Unit-3 Basic Statistics
No ratings yet
Part-I Central Tendency and Dispersion: Unit-3 Basic Statistics
32 pages
SST 101 Notes Courtesy of Rose Njeru
No ratings yet
SST 101 Notes Courtesy of Rose Njeru
76 pages
WithSolutionStatistics Sheet1
No ratings yet
WithSolutionStatistics Sheet1
52 pages
11statistics Sheet
No ratings yet
11statistics Sheet
41 pages
Statistics
100% (1)
Statistics
72 pages
Module 3 Measures of Central Tendency Final
No ratings yet
Module 3 Measures of Central Tendency Final
43 pages
Faraz Statistics
No ratings yet
Faraz Statistics
15 pages
MMW Module 4 - Statistics
No ratings yet
MMW Module 4 - Statistics
18 pages
Math CBSE Class 10th Statistics
No ratings yet
Math CBSE Class 10th Statistics
28 pages
Descriptive Statistics and Probability
No ratings yet
Descriptive Statistics and Probability
34 pages
14 Statistics 1
No ratings yet
14 Statistics 1
74 pages
Frequency Distribution & Measures of Central Tendency-1
No ratings yet
Frequency Distribution & Measures of Central Tendency-1
34 pages
CHAPTER 14 Statistics PYQ's Jkbose
No ratings yet
CHAPTER 14 Statistics PYQ's Jkbose
15 pages
Stati 4
No ratings yet
Stati 4
26 pages
Formula and Notes For Class 11 Maths Download PDF Chapter 15. Statistics
No ratings yet
Formula and Notes For Class 11 Maths Download PDF Chapter 15. Statistics
16 pages
Lecture - Measures of Central Tendency
No ratings yet
Lecture - Measures of Central Tendency
19 pages
Lecture Statistics
No ratings yet
Lecture Statistics
23 pages
Measures of Central Tendancies
No ratings yet
Measures of Central Tendancies
82 pages
Statistics
No ratings yet
Statistics
39 pages
Central Tendency (Stats)
100% (1)
Central Tendency (Stats)
89 pages
Statistics
No ratings yet
Statistics
35 pages
Fresher Four
No ratings yet
Fresher Four
33 pages
Statistics
No ratings yet
Statistics
6 pages
Chapter One Illustration
No ratings yet
Chapter One Illustration
9 pages
Chapter 7 Statistics (Central Tendency)
No ratings yet
Chapter 7 Statistics (Central Tendency)
9 pages
Basic Statistics
No ratings yet
Basic Statistics
57 pages
Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability
No ratings yet
Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability
119 pages
SMA 160 Stds Notes PDF
No ratings yet
SMA 160 Stds Notes PDF
41 pages
2 - Measures of Central Tendency - Maed 003
No ratings yet
2 - Measures of Central Tendency - Maed 003
28 pages
Statistics To Arithmetic Mean: Probability and Bio-Statistics
No ratings yet
Statistics To Arithmetic Mean: Probability and Bio-Statistics
43 pages
Maths Statistics LN PDF
100% (1)
Maths Statistics LN PDF
36 pages
Statistics Real Life Statistics:: The Various Terms Related To Statistics Are
No ratings yet
Statistics Real Life Statistics:: The Various Terms Related To Statistics Are
6 pages
MMW Module 4
No ratings yet
MMW Module 4
41 pages
CH 1 Central Tendency Class
No ratings yet
CH 1 Central Tendency Class
34 pages
Lecture 4 (Grouped Data)
No ratings yet
Lecture 4 (Grouped Data)
10 pages
Stat Unit 1
No ratings yet
Stat Unit 1
125 pages
Chapter 15 (3) NNN
No ratings yet
Chapter 15 (3) NNN
16 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
44 pages
Wikimama Class 11 CH 15 Statistics
No ratings yet
Wikimama Class 11 CH 15 Statistics
8 pages
Business Mathematics
No ratings yet
Business Mathematics
61 pages
Stat Review Lecture (Complete)
No ratings yet
Stat Review Lecture (Complete)
18 pages
Module 4 Part 1 - 082406
No ratings yet
Module 4 Part 1 - 082406
31 pages
Selfstudys Com File
No ratings yet
Selfstudys Com File
33 pages
Lecture Related ICE Location
No ratings yet
Lecture Related ICE Location
24 pages
Mean
No ratings yet
Mean
31 pages
Orca Share Media1680255383263 7047501875043098045
No ratings yet
Orca Share Media1680255383263 7047501875043098045
9 pages
Statistics 1
No ratings yet
Statistics 1
34 pages
Chapter Five
No ratings yet
Chapter Five
10 pages
61 Sample Chapter PDF
No ratings yet
61 Sample Chapter PDF
14 pages
Central Tendency
100% (1)
Central Tendency
216 pages
PwC's Executive Directors - Practices and Remuneration Trends Report 2020
No ratings yet
PwC's Executive Directors - Practices and Remuneration Trends Report 2020
97 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
Cumulative Frequency (MJ 2019 To 2005)
No ratings yet
Cumulative Frequency (MJ 2019 To 2005)
28 pages
PANDAS - Series Dataframes
No ratings yet
PANDAS - Series Dataframes
118 pages
Measures of Position For Ungrouped Data
No ratings yet
Measures of Position For Ungrouped Data
18 pages
Semester 1 Exam
No ratings yet
Semester 1 Exam
39 pages
Unit 9 Measures of Variation and Skewness: 9.0 Objectives
No ratings yet
Unit 9 Measures of Variation and Skewness: 9.0 Objectives
24 pages
04-Topically S1-A4 Good
No ratings yet
04-Topically S1-A4 Good
24 pages
Data Analyst Cheat Sheet FROM Parth Roy
No ratings yet
Data Analyst Cheat Sheet FROM Parth Roy
59 pages
Economics (3041)
No ratings yet
Economics (3041)
2 pages
Calculating Measures of Position: Grade 10
No ratings yet
Calculating Measures of Position: Grade 10
10 pages
Quartiles For Ungrouped Data:: Measures of Location and Position
No ratings yet
Quartiles For Ungrouped Data:: Measures of Location and Position
3 pages
GRADE8
No ratings yet
GRADE8
10 pages
Statistics Using Excel - Group9
No ratings yet
Statistics Using Excel - Group9
22 pages
Unit 5 Fod (1) (Repaired)
No ratings yet
Unit 5 Fod (1) (Repaired)
28 pages
Lesson 11-6 Analysis Data
No ratings yet
Lesson 11-6 Analysis Data
57 pages
Statistics Theory Notes
No ratings yet
Statistics Theory Notes
21 pages
A Review of Statistical Outlier Methods
No ratings yet
A Review of Statistical Outlier Methods
8 pages
Introduction To Probability and Statistics - William Mendenhall, Robert J - Beaver, Barbara M - Beaver - 2009 - Anna's Archive
No ratings yet
Introduction To Probability and Statistics - William Mendenhall, Robert J - Beaver, Barbara M - Beaver - 2009 - Anna's Archive
164 pages
Sample Statistics Population Parameters: PTH Percentile
No ratings yet
Sample Statistics Population Parameters: PTH Percentile
6 pages
STATS 2nd Quarter Tutor
No ratings yet
STATS 2nd Quarter Tutor
3 pages
AL S1 1.2 Representation of Data
No ratings yet
AL S1 1.2 Representation of Data
20 pages
Team 19 Project Report
No ratings yet
Team 19 Project Report
27 pages
Introduction To Probability and Statistics 13th Edition: de Mendenhall, Beaver Et Beaver. STT1700 (Automne 2008)
No ratings yet
Introduction To Probability and Statistics 13th Edition: de Mendenhall, Beaver Et Beaver. STT1700 (Automne 2008)
47 pages
MATH01 Midterm Quiz No. 1 Statistics
No ratings yet
MATH01 Midterm Quiz No. 1 Statistics
4 pages
Lec 1
No ratings yet
Lec 1
54 pages
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Statistucs Theory

Uploaded by

Statistucs Theory

Uploaded by

CPP STATISTICS

` KKP / MYP CENTRE

Marks (x) Number of

Marks Number of Students

Age (x) Number of people (f)

2. Graphical representation of frequency distribution:

Marks (x) Frequency (f)

For an ungrouped distribution, the frequency polygon is

In case of grouped frequency distribution, the abscissas

(i) Arithmetic Mean

Example (1): Find Arithmetic Mean of following frequency distribution:

Example (2): Find the Arithmetic Mean of the following table:

Marks : 0-10 10-20 20-30 30-40 40-50 50-60

Marks No. of Students Mid values of fi xi

(a) Shifting of origin:

Marks No. of Students Mid values of di  x i  A fi di

Marks No. of Students Mid values of xi  A fi di

Solution: Let the number of males and females be m and f respectively.

Merits: (a) It is clearly defined.

Demerits: (a) A.M. cannot be obtained if a single observation is missing.

(ii) Geometric Mean (G.M):

Merits and demerits of G. M:

Merits: (a) It depends on all observations.

(iii) Harmonic Mean (H.M):

x1,x 2 ,......xn are mid-values of the class intervals.

Merits: It depends on all the observations.

Eg: The median of 7, 4, 5, 9, 10 is 7.

Example (7): Find median of following data:

Class Frequency c.f

xk1  xk fk1 (c) Fk1

Let xk  xk 1 be median class.

Example (8): Find median of following distribution:

Class interval : 20-30 30-40 40-50 50-60 60-70

Merits and demerits of median:

Merits: (a) it is clearly defined.

Eg: The mode of 2, 2, 9, 1, 3, 3, 5, 6, 8, 9 is 2

Merits and demerits:

Merits: (a) it is easy to calculate.

Uses: Mode is an average to be used to find the ideal size.

The following are the measures of dispersions:

In case of frequency distribution xi | fi , i  1,2,....n standard deviation is given by

The square of standard deviation is called variance ( 2 )

s2 is called Mean square deviation.

2. Various formulae for Variance:

Proof: Let 2x be the variance of n observations x1 , x 2 .....xn .

(c) The variance is depending on change of scale

|xi  x| 1.87 0.87 0.13 1.13

f i |xi  x| 5.61 10.44 2.34 13.56

 fi |xi  x| 45  31.95  0.71

Sol. We form the following table from the given data

Mean deviation about median M .D 

Sol. We form the following table for the given data:

Class frequency Cumulative Mid point

20 – 25 120 120 22.5 15 1800

 fi xi  med.  1000 8175  8.175

Mean ( x ) Variance ( 2 ) Standard Deviation (  )

x1  k,x 2  k,x 3  k,......, xn  k x k 2 

k.x1,k.x 2 ,k.x 3 ,......,k.xn (k  0) k.x k 2 .2 k .

a. x1  b,a. x 2  b,a.x 3  b,......,a. xn  b ax  b a2  2 a .

3. Sum of deviations of variants from their A.M. is always zero. i.e.  x  x  0,  f x  x  0 .

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.