0% found this document useful (0 votes)
37 views10 pages

Statistics

This document provides definitions and explanations of basic statistical terms including: 1. Data can be raw (directly observed) or grouped into classes to make it more meaningful. Key terms include observation, frequency, range, class boundaries, class interval, and class mark. 2. There are two types of grouped frequency distributions - exclusive and inclusive forms. 3. Measures of central tendency described include the arithmetic mean (average) and median. The mean can be calculated for ungrouped and grouped data using direct, assumed mean, and step-deviation methods. The median is the middle value when data is arranged in order.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
37 views10 pages

Statistics

This document provides definitions and explanations of basic statistical terms including: 1. Data can be raw (directly observed) or grouped into classes to make it more meaningful. Key terms include observation, frequency, range, class boundaries, class interval, and class mark. 2. There are two types of grouped frequency distributions - exclusive and inclusive forms. 3. Measures of central tendency described include the arithmetic mean (average) and median. The mean can be calculated for ungrouped and grouped data using direct, assumed mean, and step-deviation methods. The median is the middle value when data is arranged in order.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

14.

STATISTICS
n Some Basic Terms
l Data : The word data means information in the form of numerical figures or a set of given facts.
l Raw data : Data obtained from direct observation is called raw data.
The marks obtained by 10 students in a monthly test is an example of raw data or ungrouped data.
l Grouped data : To present the data in a more meaningful way, we condense the data into convenient
number of classes or groups, generally not exceeding 10 and not less than 5. This helps us in perceiving
at a glance, certain salient features of data.
l Observation : Each numerical figure in a data is called an observation.
l Frequency of an observation : The number of times a particular observation occurs is called its frequency.
l Range : The difference between the maximum and the minimum values of the given observations is called the
range of the data.
Given x1, x2..... xn (n individual observations)
Range = (Maximum Value) – (Minimum Value)
l Class boundaries or true upper and true lower limits
} In the exclusive form, the upper and lower limits of a class are respectively known as the true upper limit
and true lower limit.
} In the inclusive form, the number midway between the upper limit of a class and lower limit of the
subsequent class gives the true upper limit of the class and the true lower limit of the subsequent class .
l Class interval : Each group into which the raw data is condensed, is called a class-interval.
l Class size :The difference between the true upper limit and the true lower limit of a class is called its class size.
l Class mark of a class
æ True upper limit + True lower limit ö
Class mark = ç ÷
è 2 ø

The difference between any two successive class marks gives the class size.

n Types of grouped frequency distribution


l Exclusive form (or continuous interval form) : A frequency distribution in which the upper limit of each
class is excluded and lower limit is included, is called an exclusive form.
Ex. Suppose the marks obtained by some students in an examination are given.
We may consider the classes 0 – 10, 10 – 20 etc. In class 0 – 10, we include 0 and exclude 10. In class
10 - 20, we include 10 and exclude 20.
l Inclusive form (or discontinuous interval form) : A frequency distribution in which each upper limit as well
as lower limit is included, is called an inclusive form. Thus, we have classes of the form 0 – 10, 11 – 20, 21 – 30
etc. In 0 – 10, both 0 and 10 are included.
n Measures of central tendency
l Arithmetic mean
The average of numbers in arithmetic is known as the Arithmetic Mean or simply the mean of these numbers in
statistics.

Sum of observations
Mean =
Number of observations
l Mean of ungrouped Data
The mean of n observations x1, x2, ...., xn is given by

(x 1 + x 2 + x 3 + ...... + x n ) å x i
Mean, x = =
n n

where the symbol å, called sigma stands for the summation of the terms.

l Mean for an ungrouped frequency distribution


} Direct method
Let n observations consist of values x1, x2, ...., xn of a variable x, occurring with frequencies f1, f2, .... , fn
respectively.
Then, the mean of these observations is given by :

(f1 x1 + f2 x 2 + ... + fn x n ) å fi x i
Mean, x = =
(f1 + f2 + ..... + fn ) å fi

} Shortcut method or assumed mean method


In this case, to calculate the mean, we follow the following steps :
Step-1 : Find the class mark xi of each class using

lower limit + Upper limit


xi =
2

Step-2: Choose a suitable value of xi in the middle as the assumed mean and denote it by 'a'.
Step-3: Find di = xi – a for each i
Step-4: Find fi × di for each i
Step-5 : Find N = Sfi

S f id i
Step-6 : Calculate the mean, ( x ) by using the formula x = a + .
N

} Step-deviation method or short-cut method

Sometimes, the values of x and f are so large that the calculation of mean by assumed mean method
becomes quite inconvenient. In this case, we follow the following steps:
Step-1 : Find the class mark xi of each class by using

lower limit + Upper limit


xi =
2

Step-2: Choose a suitable values of xi in the middle as the assumed mean and denote it by 'a'.
Step-3 : Find h = (upper limit –lower limit) for each class.

xi – a
Step-4 : Find ui = for each class.
h

Step-5 : Find fiui for each i.

ì Sfi ´ u i ü
Step-6 : Calculate, the mean by using the formula x = a + í ý × h, where N = Sfi.
î N þ
Important facts about mean
n
} The algebraic sum of deviations taken about the mean is zero. i.e., å (x
i= 1
i - x) = 0

} The value of the mean depends on all the observations.

a+b
} The A.M. of two numbers a and b is
2

} Combined mean : If x1 and x2 are the arithmetic means of two series with n1 and n2. Observations
respectively, then the combined mean is :

n1 x1 + n2 x2
xc =
n1 + n2

The above result can be extended to any number of groups of data.


} If x is the mean of x1, x2,..... xn, then the mean of x1 + a, x2 + a, x3 + a,.... xn + a is x + a, for
all values of a.

x1 x 2 x x
} If x is the mean of x1, x2,....xn, then the mean of ax1, ax2,....axn is a x and that of , ,.... n is
a a a a

æ n +1 ö
} The mean of the first n natural numbers is ç ÷
è 2 ø
(n + 1)(2n + 1)
} The mean of the squares of the first n natural numbers =
6
2
æ n(n + 1) ö
} The mean of the cubes of the first n natural numbers = ç ÷
è 2 ø
l Median
} Median of ungrouped data
After arranging the given data in an ascending or a descending order of magnitude, the value of the
middle-most observation is called the median of the data.
} Method for finding the median of an ungrouped data
Arrange the given data in an increasing or decreasing order of magnitude. Let the total number of
observations be n.

æ n +1 ö
(i) If n is odd, then median = value of ç ÷ th observation.
è 2 ø
(ii) If n is even, then median
1 ìæ n ö æn ö ü
= í th observation + ç + 1 ÷ th observation ý
2 îçè 2 ÷ø è2 ø þ
} Median of a grouped data

Median : It is a measure of central tendency which gives the value of the middle most observation in the
data. In a grouped data, it is not possible to find the middle observation by looking at the cumulative
frequencies as the middle observation will be some value in a class interval. It is, therefore, necessary to
find the value inside a class that divides the whole distribution into two halves.

N
Median Class : The class whose cumulative frequency is greater than is called the median class.
2
To calculate the median of a grouped data, we follow the following steps
Step-1 : Prepare the cumulative frequency table corresponding to the given frequency distribution and obtain
N = Sfi.

N
Step-2 : Find
2

N
Step-3 : Look at the cumulative frequency just greater than and find the corresponding class (Median class).
2

ìN ü
ïï 2 – C ïï
Step-4 : Use the formula Median, M = l + í ý´ h
ï f ï
ïî ïþ
Where l = Lower limit of median class.
f = Frequency of the median class.
C = Cumulative frequency of the class preceding the median class.
h = Size of the median class.
N = Sfi.
Important facts about median
} The median does not take into consideration all the items.
} The sum of absolute deviations taken about the median is the least.
} The median can be calculated graphically while the mean cannot be.
} The median is not effected by extreme values.
} The sum of deviations taken about median is less than the sum of absolute deviations taken from any
other observation in the data.

l Mode

Mode is that value among the observations which occurs most often i.e. the value of the observation having the
maximum frequency.
} Mode of a grouped data

In a grouped frequency distribution, it is not possible to determine the mode by looking at the frequencies.

Modal Class : The class of a frequency distribution having maximum frequency is called modal class of a

frequency distribution.

The mode is a value inside the modal class and is calculated by using the formula.

ì f1 – f0 ü
Mode = l + í 2f - f - f ý ´ h
î 1 0 2þ

Where l = Lower limit of the modal class.


h = Size of class interval
f1 = Frequency of modal class
f0 = Frequency of the class preceding the modal class
f2 = Frequency of the class succeeding the modal class.
Important facts about mode
} For a given data, the mode may or may not exist. In a series of observations, if no item occurs more
than once, then the mode is said to be ill-defined.
} If the mode exists for a given data, it may or may not be unique.
} Data having unique mode is uni-model while data having two modes is bi-model.
} It can be calculated graphically.
} It is not effected by extreme values.
} it can be used for open-ended distribution and qualitative data.

l Cumulative frequency polygon curve (Ogive)


Cumulative frequency is of two types and corresponding to these, the ogive is also of two types.
• Less than series • More than series
• Less than series : To construct a cumulative frequency polygon and an ogive, we follow these steps :
Step-1 : Mark the upper class limit along x-axis and the corresponding cumulative frequencies along y-axis.
Step-2 : Plot these points successively by line segments. We get a polygon, called cumulative frequency polygon.
Step-3 : Plot these points successively by smooth curves, we get a curve called cumulative frequency curve or
an ogive.
• More than series To construct a cumulative frequency polygon and an ogive, we follow these steps:
Step-1 : Mark the lower class limits along x-axis and the corresponding cumulative frequencies along y-axis.
Step-2 : Plot these points successively by line segments, we get a polygon, called cumulative frequency polygon.
Step-3 : Plot these points successively by smooth curves, we get a curve, called cumulative frequency curve or
an ogive.

Application of an ogive
Ogive can be used to find the median of a frequency distribution. To find the median, we follow these steps.
Method-I
Step-1 : Draw anyone of the two types of frequency curves on the graph paper.

N
Step-2 : Compute where (N = Sfi ) and mark the corresponding points on the y-axis.
2

Step-3 : Draw a line parallel to x-axis from the point marked in step 2, cutting the cumulative frequency curve
at a point P.
Step-4 : Draw perpendicular PM from P on the x-axis. The x - coordinate of point M gives the median.
Method-II
Step-1 : Draw less than type and more than type cumulative frequency curves on the graph paper.
Step-2 : Mark the point of intersecting (P) of the two curves drawn in step 1.
Step-3 : Draw perpendicular PM from P on the x-axis. The x- coordinate of point M gives the median.
n Mean deviation
l Mean deviation (M.D.) for ungrouped or raw data

M.D. =
å|x - x| where x is each observation
i
n

x is arithmetic mean, or median or mode as specified in the problem, and n is the number of observations.
l Mean deviation for discrete data

Mean deviation (M.D.) =


åx i - x fi
or
å fD
N N

Where xi is each of the given observations and fi is their corresponding frequencies.

N = å fi (sum of the frequencies) and D = xi - x

l Mean deviation for grouped data

M.D. =
åf i xi - x
or
å fD
N N

Where xi is mid-value of each class, fi is the corresponding frequency.


x is mean or median or mode.
N is sum of frequencies.

D = x-x

Important facts based on M.D.


} M.D. depends on all observations.
} By default, M.D. is to be computed about mean.
} M.D. about median is the least.

|a - b|
} M.D. of two numbers of a and b is .
2
STATISTICS EXERCISE
Direction (Q.1-2) : These questions are based on 6. The mean of 1,3,4,5,7,4, is m. The numbers
the following data (figure) 3,2,2,4,3,3 have mean m – 1 and median q, then
p+q=
(1) 7 (2) 6 (3) 5 (4) 4
7. A class of 40 students is divided into four groups
y-axis
named as A, B, C and D. Percentage of marks
60 scored by them are given below group wise in a
55
table.
50
45
Cumulative frequency

40
A B C D
35
30 20 42 10 21
25 30 51 25 69
20 40 45 85 70
15 25 58 73 86
10 22 53 98 53
5 45 64 43 68
0 65 72 64 99
x-axis
0 10 20 30 40 50 60 70 80 90 100 110 120
True limits of the classes
By using coefficient of range say which of the groups
has shown good performance.
(1) A (2) B (3) C (4) D
The given figure represents the percentage of
8. Life (in h) of 10 bulbs from each of four different
marks on x-axis and the number of students on y-
companies A, B, C and D are given below in the
axis.
table.
1. Find the number of students who scored less than
or equal to 50% marks.
(1) 35 (2) 15 (3) 20 (4) 30
A B C D
2. Find the number of students who scored greater than
120 700 950 530
or equal to 90% of marks.
1600 502 330 650
(1) 47 (2) 45 (3) 5 (4) 10 280 1430 1200 720
3. In a class of 15 students, on an average, each 420 625 500 550
student got 12 books. If exactly two students 800 780 445 748
received same number of books, and remaining 770 335 1260 570
students books average be an integer then which 270 224 375 635
of the following could be the number of books 455 1124 1130 804
150 473 185 500
received by each of the two students who received
same number of books?
(1) 11 (2) 15 (3) 20 (4) 25
By using the coefficient of range which company has
4. If the mean of observations x1, x2, ..., x10 is x then shown the best consistency in its quality?
the mean of x1 + a, x2 + a, x3 + a,...,xn + a is (1) A (2) B (3) C (4) D
9. If the mode of the observations 5, 4, 4, 3, 5, x, 3,
x 4, 3, 5, 4, 3 and 5 is 3, then find the median of the
(1) ax (2) x – a (3) x + a (4)
a observations.
(1) 3 (2) 4 (3) 5 (4) 3.5
n2
5. The mean of first n odd natural numbers is , then n 10. If the ratio of mean and median of a certain data is
81
2 : 3, then find the ratio of its mode and mean.
(1) 9 (2) 81 (3) 27 (4) 18 (1) 2 : 5 (2) 3 : 2 (3) 5 : 2 (4) 1 : 2
11. If the mean of n observations ax1, ax2, ax3,..., axn 21. The mean of the following distribution is 5, then find
is ax then, (ax1 - ax) + (ax 2 - ax)+...+ (axn - ax) = the value of B.

(1) ax (2) – ax x 3 5 7 4
(3) 0 (4) ax1 + axn f 2 A 5 B
12. If the arithmetic mean of the observations x1, x2, x3
...... x n is 1, then the arithmetic mean of (1) 10 (2) 6 (3) 8 (4) None

x1 x 2 x3 x n a+b a-b
, , ,... 22. The mean deviation of and (where a
k k k k 2 2
(k > 0) is and b > 0) is ________.
(1) greater than 1 (2) less than 1
b a
(3) equal to 1 (4) None of these (1) (2) (3) a (4) b
13. Range of 14, 12, 17, 18, 16 and x is 20. Find 2 2
x(x > 0) 23. If the mean of x + 2, 2x + 3, 3x + 4 ansd 4x + 5
(1) 2 (2) 28 is x + 2, the find the value of x.
(3) 32 (4) Can’t be determined (1) 0 (2) 1 (3) –1 (4) 2
14. The mean of a set of observation is a. If each 24. The range of 15, 14, x, 25, 30, 35 is 23. Find the
observation is multiplied by b and each product is least possible value of x.
decreased by c, then the mean of new set of (1) 14 (2) 12 (3) 13 (4) 11
observations is___________. 25. Find the median of the following data.

a a C.I. 0-10 10-20 20-30 30-40


(1) +c (2) ab – c (3) –c (4) ab + c
b b
f 12 13 25 20
15. The mean deviation of first 8 composite numbers
is___________. (1) 25 (2) 23 (3) 24 (4) 26
(1) 2.9375 (2) 4.83 26. The arithmetic mean (average) of five numbers is
(3) 5.315 (4) 3.5625 – 5. the sum of two of them is 50, what is the aver-
16. Find mode of the following discrete series. age of the other three ?
(1) 25 (2) 10 (3) –10 (4) – 25
x 1 2 3 4 5 6 7 8 27. The median of the following discrete series is
f 5 4 6 8 12 3 9 10
x 3 6 5 8 12 7
(1) 4 (2) 8 (3) 5 (4) 7 f 5 2 4 6 7 6
17. The mean of 10 numbers is 30. If 4 is subtracted
from every number, what will be the new mean ? (1) 7 (2) 8 (3) 9 (4) 4
(1) 30 (2) 70 (3) 26 (4) 35 28. fnd the mode of the following numbers
18. Find the mean deviation (approximately) from the 1, 0, 1, 2, 2, 5, 6, 2, 1, 7, 8, 6.
mode for the following ungrouped data. (1) 2 (2) 1 (3) 3 (4) None
20, 25, 30, 18, 15, 40 29. If mean of the following distribution is 13, then the
(1) 6.71 (2) 4.52 (3) 7.61 (4) 5.33 value of p is
19. The arithmetic mean of 12 observations is 15. If two
x 5 10 12 17 16 20
observations 20 and 25 are removed then the
arithmetic mean of remaining observations is f 9 3 p 8 7 5
(1) 14.5 (2) 13.5 (3) 12.5 (4) 13 (1) 6 (2) 7
20. The arithmetic mean and mode of a data is 24 and (3) 10 (4) 4
12 respectively, then the median of the data 30. If the ratio of mode and median of a certain data is
is_________. 6 : 5, then find the ratio of its mean and median.
(1) 20 (2) 18 (1) 8 : 9 (2) 9 : 10
(3) 20 (4) 22 (3) 9 : 7 (4) 8 : 11
31. If the arithmetic mean of the following distribution 39. If the difference of mode and median of a data is
is 8.2, then find the value of p. 24, then the difference of median and mean is
(1) 12 (2) 24
x 1 3 5 9 11 13 (3) 8 (4) 36
f 3 2 7 p 4 8 40. The mode of the observations 2x + 3, 3x – 2, 4x +
3, x – 1, 5x + 2 (x is a positive integer) can be
(1) 5 (2) 6 (3) 9 (4) None (1) 3 (2) 5
32. The median of the series 8, 12, 15, 7, x, 19 and (3) 7 (4) 9
22 lies in the interval. 41. The median of 21 obseravtions is 18. If two
(1) [12, 15] (2) [7, 15] (3) [15, 17] (4) [9, 12] observations 15 and 24 are included to the
33. The mode of the following distribution observations, the median of new series is
(1) 15 (2) 18
C.I. 1-5 6-10 11-15 16-20 21-25
(3) 24 (4) 16
f 4 7 10 8 6 42. Find the median of the following observations
(1) 14.5 (2) 16.5 (3) 10.5 (4) 13.5 6, 19, 14, 46, 16, 42, 26, 32, 38
34. The mean of the following data is (1) 26 (2) 28
(3) 30 (4) 32
C.I. 10-15 15-20 20-25 25-30 30-35 43. The upper class limit of inclusive type class interval
f 5 7 3 4 8 10-20 is_______.
(1) 22 (2) 23.05 (1) 10.5 (2) 20
(3) 24.05 (4) 27.05 (3) 20.5 (4) 17.5
35. The median of the following frequency distribution is 44. The daily wages (in rupees) of 100 labourers is given
in the following frequency distribution. Find the
C.I. 0-10 10-20 20-30 30-40 40-50 mean of the data.
f 5 8 7 10 20
Daily
(1) 35 (2) 30 (3) 40 (4) 45 21-40 41-60 61-80 81-100 101-120
wages
36. Complete frequency distribution is given as follows
No. of 10 25 40 20 5
labours
Class 0-4 4-8 8-12 12-16 16-20 20-24
Frequency 8 12 x1 25 x2 7 (1) 67 (2) 67.5 (3) 68 (4) None
45. Find the mode of the following discrete series.
Total 68
x 1 3 5 6 12 15
Given that the median is 13.76, determine the
missing frequencies. f 5 7 3 8 6 5
(1) 2,14 (2) 4,12 (1) 3 (2) 12 (3) 8 (4) 6
(3) 3,13 (4) None 46. The number of students absent in a class were re-
5n corded everyday for 120 days and the information
37. The mean of first n natural numbers is . Find n. is given in the following frequency table. Find the
9
mean number of students absent per day.
(1) 5 (2) 4
(3) 9 (4) None
No. of students
38. Mean of a certain number of obseravtions is m. If 0 1 2 3 4 5 6 7
absent
each observation is divided by x (x ¹ 0) and increased No. of days 1 4 10 50 34 15 4 2
by y, then the mean of new obseravations is
(1) 3 (2) 3.25
mx + y
(1) mx + y (2) (3) 3.525 (4) None
x 47. If a mode exceeds a mean by 12 then the mode
exceeds the median by ______.
m + xy
(3) (4) m + xy (1) 4 (2) 8
x
(3) 6 (4) 10
48. The heights of 31 students in a class are given below. 55. If the mean of x1 and x2 is M1 and that x1, x2, x3,
x4 is M2. ax1, ax2, x3/a, x4/a is
Height
126 127 128 129 130 131 132 M1 + M 2 aM1 + (M2 /2)
(in cm) (1) (2)
2 2
Number of
7 3 4 2 5 6 4
Students 1 1
(3) [(a2–1) M1+2M2] (4) [2(a2–1)M1+M2]
2a 2a
Find the median of the above frequency distribution.
(1) 129 (2) 130 (3) 128 (4) 131 1
56. If the mean of x and is M, the mean of x3 and
49. Mean of 11 observations is 17.5. If an observation x
15 is deleted, find the mean of the remaining ob- 1
servations. is
x3
(1) 17.5 (2) 17.25
(M2 – 3)
(3) 17.75 (4) None (1) M = (2) M(4M2–3)
2
50. If 6,4,5 and 3 occur with frequencies 2,2,5 and 4
respectively, the arithmetic mean is (3) M3 (4) M3+3
(1) 6 (2) 4.38 (3) 6.25 (4) 5.42 57. If the mean of a, b, c is M and ab+bc+ca=0, then
51. Find the arithmetic mean of the first 567 natural the mean of a2, b2, c2 is
numbers. (1) 5 M2 (2) 3 M2
(1) 284 (2) 283.5 (3) M2 (4) 9 M2
(3) 283 (4) None of these x i – 25
52. The mean of 15 numbers is 25. If each number is 58. If u i = ,Sfiui = 20, Sfi, = 100 then X is
10
multiplied by 4, mean of the new numbers is :
equal to
(1) 60 (2) 100
(1) 27 (2) 25 (3) 30 (4) 35
(3) 10 (4) None of these
59. If the mean of a, b, c is M and ab+bc+ca=0, the
53. The average marks, in a class of 30 students, are
mean of a2, b2, c2 is KM2 then find the value of
found to be 45. On checking two mistakes we found
K=
after correction. If one student got 45 marks more
(1) 1 (2) 2 (3) 3 (4) 4
and another students got 15 marks less, then
correct average marks are 1
60. If the mean of x and is M, the mean of x2 and
(1) 45 (2) 44 (3) 47 (4) 46 x
54. If the arithmetic mean of n numbers of a series is
1
X and the sum of (n–1) numbers is k, then then nth is KM2–1 then find the value of K=
x2
number is
(1) M2–1 (2) 2M2–1 (3) 1–M2 (4) 2–M2
(1) n+k (2) n X + k (3) n X – k (4) n–k

ANSWER KEY

Que. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Ans. 2 3 4 3 2 1 2 4 2 3 3 4 3 2 1 3 3 3 2
Que. 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40
Ans. 2 1 3 2 3 4 1 2 2 2 2 1 4 2 1 3 3 3 1
Que. 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60
Ans. 2 1 2 2 4 3 2 1 3 2 1 2 4 3 3 2 2 1 3

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy