Chapter 24 Nios
Chapter 24 Nios
Statistics
24 Notes
Statistics is a special and an important branch of mathematics which deals mainly with data
and their representations. In this lesson, we shall make a beginning of this study of this
branch of mathematics with collection, classification, presentation and analysis of data.
We shall study how to classify the given data into ungrouped as well as grouped frequency
distributions. We shall also learn about cumulative frequency of a class and cumulative
frequency table.
Further we shall learn graphical representation of data in the form of bar charts, histograms
and frequency polygons.
OBJECTIVES
After studying this lesson, you will be able to
• know meaning of ‘statistics’ in singular and plural form;
• differentiate between primary and secondary data;
• understand the meaning of a class, class mark, class limits, discrete and
continuous data, frequency of a class, class size or class width through examples;
• condense and represent data into a frequency table;
• form a cumulative frequency table of a frequency distribution;
• draw a bar chart or bar graph of a frequency distribution;
• draw a bar chart or bar graph for the given data;
• draw a histogram and frequency polygon for a given continuous data;
• read and interpret given bar graphs, histograms.
(a) Statistics, in singular sense, means the subject which deals with _______, _____,
analysis of data as well as drawing of meaningful _______ from the data.
(c) The data are said to be __________ if the investigator himself is responsible for
its collection.
(d) Data taken from governmental or private agencies in the form of published reports
are called __________ data.
(e) Statistics is the science which deals with collection, organisation, analysis and
interpretation of the ____________.
2. Javed wanted to know the size of shoes worn by the maximum number of persons in
a locality. So, he goes to each and every house and notes down the information on a
sheet. The data so collected is an example of ___________ data.
Notes
3. To find the number of absentees in each day of each class from I to XII, you collect the
information from the school records. The data so collected is an example of _______
data.
60
we decide to have 9 classes. Than the size of each class is ≈7.
9
Step 4: Next, we set up the class limits using the size of the interval determined in
Notes Step 3. We make sure that we have a class to include the minimum as well as a
class to include the maximum value occurring in the data. The classes should be
non-overlapping, no gaps between the classes, and classes should be of the
same size.
Step 5: We take each item (observation) from the data, one at a time, and put a tally
mark (|) against the class to which it belongs. For the sake of convenience, we
record the tally marks in bunches of five, the fifth one crossing the other four
diagonally as ||||.
Step 6: By counting tally marks in each class, we get the frequency of that class. (obviously,
the total of all frequencies should be equal to the total number of observations in
the data)
Step 7: The frequency table should be given a proper title so as to convey exactly what
the table is about.
Using the above steps, we obtain the following table for the marks obtained by 20 students.
Frequency Table of the marks obtained by 20 students in a mathematics test
Class Interval Tally Marks Frequency
(Marks out of 100)
28-34 ||| 3
35-41 | 1
42-48 | 1
49-55 – 0
56-62 |||| ||| 8
63-69 || 2
70-76 |||| 4
77-83 – 0
84-90 | 1
Total 20
The above table is called a frequency distribution table for grouped data or briefly, a
grouped frequency table. The data in the above form are called grouped data.
In the above table, the class 28-34 includes the observations 28, 29, 30, 31, 32, 33 and
34; class 35-41 includes 35, 36, 37, 38, 39, 40 and 41 and so on. So, there is no
overlapping.
For the class 28-34, 28 is called the lower class limit and 34, the upper class limit, and
so on.
From this type of presentation, we can draw better conclusions about the data. Some of
these are. Notes
Suppose two students of weights 35.5 kg and 50.54 kg are admitted in this class. In which
class (interval) will we include them? Can we include 35.5 in class 31-35? In class 36-40?
No! The class 31-35 includes numbers upto 35 and the class 36-40, includes numbers
from 36 onwards. So, there are gaps in between the upper and lower limits of two
consecutive classes. To overcome this difficulty, we divide the intervals in such a way that
the upper and lower limits of consecutive classes are the same. For this, we find the
difference between the upper limit of a class and the lower limit of its succeeding class. We
than add half of this difference to each of the upper limits and subtract the same from each
of the lower limits. For example
Consider the classes 31-35 and 36-40
The lower limit of 36-40 is 36
The upper limit of 31-35 is 35
The difference = 36 – 35 = 1
1
So, half the difference = = 0.5
2
So, the new class interval formed from 31-35 is (31 – 0.5) – (35 + 0.5), i.e., 30.5 – 35.5.
Similarly, class 36-40 will be (36 – 0.5) – (40 + 0.5), i.e., 35.5 – 40.5 and so on.
This way, the new classes will be
30.5-35.5, 35.5-40.5, 40.5-45.5, 45.5-50.5, 50.5-55.5, 55.5-60.5, 60.5-65.5,
65.5-70.5 and 70.5-75.5. These are now continuous classes.
Note that the width of the class is again the same, i.e., 5. These changed limits are called
30.5-35.5 10
40.5-45.5 15
45.5-50.5 4
60.5-65.5 4
65.5-70.5 3
70.5-75.5 2
Total 52
Note: Here, in the above case, we could have also taken the classes as 30-35, 35-40,
40-45, ..., 65-70 and 70-75.
Example 24.1: Construct a frequency table for the following data which give the daily
wages (in rupees) of 32 persons. Use class intervals of size 10.
Earlier, we have discussed presentation of data by tables. There is another way to present Notes
the data called graphical representation which is more convenient for the purpose of
comparison among the individual items. Bar chart (graph) is one of the graphical
representation of numerical data. For example Fig 24.1 represents the data given in the
table regarding blood groups.
Blood groups of 35 students in a class
Blood Group Number of students
A 13
B 9
AB 6
O 7
Total 35
We can represent this data by Fig. 24.1
14
13
12
11
10
Number of Students
9
8
7
6
5
4
3
2
1
0
A B AB O
B l oo d G r ou p
Fig. 24.1
This is called a bar chart or bar graph.
Bars (rectangles) of unifoirm width are drawn with equal spaces in between them, on the
horizontal axis-called x-axis. The heights of the rectangles are shown along the vertical
axis-called y-axis and are proportional to their respective frequencies (number of students).
O
Blood Group
AB
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Number of Students
Fig. 24.2
There is not much difference between the bar graphs in Fig. 24.1 and Fig. 24.2 except that
it depends upon the person’s liking to represent data with vertical bars or with horizontal
bars. Generally vertical bar graphs are preferred.
Example 24.4: Given below (Fig. 24.3) is the bar graph of the number of students in
Class IX during academic years 2001-02 to 2005-06. Read the bar graph and answer the
following questions:
(i) What is the information given by the bar graph?
(ii) In which year is the number of students in the class, 250?
(iii) State whether true or false:
The enrolment during 2002-03 is twice that of 2001-02.
350
300
Notes
Number of Students
250
200
150
100
50
0
2001-02 2002-03 2003-04 2004-05 2005-06
Academic Year
Fig. 24.3
Solution:
(i) The bar graph represents the number of students in class IX of a school during academic
year 2001-02 to 2005-06.
(ii) In 2003-04, the number of students in the class was 250.
(iii) Enrolment in 2002-03 = 200
Enrolment in 2001-02 = 150
200 4 1
= =1 < 2
150 3 3
Therefore, the given statement is false.
Example 24.5: The bar graph given in Fig. 24.4 represents the circulation of newspapers
in six languages in a town (the figures are in hundreds). Read the bar graph and answer the
following questions:
(i) Find the total number of newspapers read in Hindi, English and Punjabi.
(ii) Find the excess of the number of newspapers read in Hindi over those of Urdu, Marathi
and Tamil together.
(iii) In which language is the number of newspapers read the least?
(iv) Write, in increasing order, the number of newspapers read in different languages.
800
700
Notes
600
Students
hundred’s)
500
(inof
400
Number
Newspapers
300
200
100
0
English Hindi Punjabi Urdu Marathi Tamil
Language
Fig. 24.4
Solution:
(i) Number of newspapers (in hundreds) read in Hindi, English and
Punjabi = 800 + 700 + 400 = 1900
(ii) Number of newspapers (in hundreds) read in Hindi = 800
Number of newspapers (in hundreds) in Urdu,
Marathi and Tamil = 200 + 300 +100 = 600
So, difference (in hundreds) = 800 – 600) = 200
(iii) In Tamil, the number of newspapers read is the least.
(iv) Tamil, Urdu, Marathi, Punjabi, English, Hindi
Construction of Bar Graphs
Solution:
Step 1: Take a graph paper and draw two perpendicular lines and call them horizontal Notes
and vertical axes (Fig. 24.5)
Step 2: Along the horizontal axis, represent the information ‘years’ and along the vertical
axis, represent the corresponding ‘loans (in crores of rupees)’.
Step 3: Along the horizontal axis, choose a uniform (equal) width of bars and a uniform
gap between them, according to the space available.
Step 4: Choose a suitable scale along the vertical axis in view of the data given to us.
Step 5: Calculate the heights of the bars for different years as given below:
1
2000 : × 25 = 2.5 units
10
1
2001 : × 30 = 3 units
10
1
2002 : × 40 = 4 units
10
1
2003 : × 55 = 5.5 units
10
1
2004 : × 60 = 6 units
10
Step 6: Draw five bars of equal width and heights obtained in Step 5 above, the
corresponding years marked on the horizontal axis, with equal spacing between
them as shown in Fig. 24.5.
70
Notes
60
40
30
20
10
0
2000 2001 2002 2003 2004
Years
Fig. 24.5
Thus, Fig. 24.5 gives the required bar graph.
Example 24.7: The data below shows the number of students present in different classes
on a particular day.
Class VI VII VIII IX X
Number of students present 40 45 35 40 50
Represent the above data by a bar graph.
Solution: The bar graph for the above data is shown in Fig. 24.6.
60
Number of students present
50
40
30
20
10
0
VI VII VIII IX X
Classes
Fig. 24.6
60
55
50
45
Number of students
40
35
30
25
20
15
10
5
0
Playing Reading story Watching T V List ening to Paint ing
books music
Pre ferred activity
Fig. 24.7
6
Notes
5
0
Bus Car Bicycle Foot
Mode of transport
Fig. 24.8
Study the bar graph and answer the following questions:
(i) How many members of staff come to school on bicycle?
(ii) How many member of staff come to school by bus?
(iii) What is the most common mode of transfport of the members of staff?
3. The bar graph given below shows the number of players in each team of 4 given
games:
Volleyball
T able T ennis
Games
Football
Basket ball
0 1 2 3 4 5 6 7 8 9 10 11 12
Number of players
Fig. 24.9
1600
1400
1200
Number of trees planted
1000
800
600
400
200
0
2003 2004 2005 2006 2007 2008
Years
Fig. 24. 10
Study the above bar graph and answer the following questions:
(i) What is the total number of trees planted by the agency from 2003 to 2008?
(ii) In which year is the number of trees planted the maximum?
(iii) In which year is the number of trees planted the minimum?
(iv) In which year, the number of trees planted is less than the number of trees planted in
the year preceding it?
5. The expenditure of a company under different heads (in lakh of rupees) for a year is
given below:
Head Expenditure (in lakhs of rupees)
Salary of employees 200
Travelling allowances 100
Electricity and water 50
Rent 125
Others 150
Construct a bar chart to represent this data.
20 30 40 50 80 70 80
Fig. 24.11
Fig. 24.11 shows the histogram for the frequency distribution of marks obtained by 20
students in a class test.
Example 24.10: Draw a histogram for the following data:
Notes
Height 125-130 130-135 135-140 140-145 145-150 150-155 155-160
(in cm)
Number of 1 2 3 5 4 3 2
students
Solution: Following the steps as suggested in the above example, the histogram representing
the given data is given below:
Fig. 24.12
Frequency Polygon
There is yet another way of representing a grouped frequency distribution graphically. This
is called frequency polygen. To see what we mean, consider the histogram in Fig. 24.13.
E
B
A
20 30 40 50 80 70 80
H
Fig. 24.13
D G
C H
A I
Fig. 24.14
Example 24.12: Marks (out of 50) obtained by 30 students of Class IX in a mathematics
test are given in the following table:
Marks 0-10 10-20 20-30 30-40 40-50
Number of students 5 8 6 7 4
Draw a frequency polygon for this data.
Solution: Let us first draw a histogram for this data (Fig. 24.15)
Mark the mid points B, C, D, E and F of the tops of the rectangles as shown in Fig. 24.15.
Here, the first class is 0-10. So, to find the class preceding 0-10, we extend the horizontal
axis in the negative direction and find the mid point of the imaginary class (–10)-0. Let us
C
E
D
Notes
B F
A
I
H G
O 10 20 30 40 50 60
Fig. 24.15
join B to the mid point of the class (015010)-0. Let A be the mid point where this line
segment meets the vertial axis. Let G be the mid point of the class 50-60 (succeeding the
last class). Let the line segment FG intersects the length of the last rectangle at I (Fig.
24.15). Then OABCDEFIH is the required frequency polygen representing the given
data.
Note: Why have we not taken the points before O and G? This is so because marks
obtained by the students cannot go below 0 and beyond maximum marks 50. In the figure,
extreme line segments are only partly drawn and then brought down vertically to 0 and 50.
Frequency polygon can also be drawn independently without drawing histogram. We will
illustrate it through the following example.
Example 24.13: Draw a frequency polygon for the data given in Example 24.9, without
drawing a histogram for the data.
Solution: To draw a frequency polygon without drawing a histogram, we go through the
following steps.
Step 1: Draw two lines perpendicualar to each other.
Step 2: Find the class marks of the classes.
20 + 30 30 + 40 40 + 50 50 + 60 60 + 70 70 + 80
Here they are: , , , , and
2 2 2 2 2 2
i.e. the class marks are 25, 35, 45, 55, 65 and 75 respectively.
Step 3: Plot the points B (25, 1), C(35, 3), D(45, 1), E(55, 6), F(65, 4) and G(75, 5),
i.e., (class mark, frequency)
Step 4: Join the points B, C, D, E, F and G by line segments and complete the polygon
as explained earlier.
E (55, 6)
Notes
G (75, 5)
F (65, 4)
Frequency
C (35, 3)
B (25, 1) D (45, 1)
A
20 30 40 50 60 70 80
H
Fig. 24.16
Reading a Histogram
Consider the following example:
Example 24.14: Study the histogram given below and answer the following questions:
20 25 30 35 40 45 50 55
Fig. 24.17
(i) What is the number of teachers in the oldest and the youngest group in the school?
(ii) In which age group is the number of teachers maximum?
(v) How many students have the height more than or equal to 140 but less than 155?
Notes
Fig. 24.18
LET US SUM UP
• Statistics is that branch of mathematics which deals with collection, organisation, analysis
and interpretation of data.
• Statistics is used in both plural and singular sense.
• The data collected from the respondents “as it is” is called raw data.
• Data are said to be primary if the investigator himself collects it through his/her own
designed tools.
• Data taken from other sources such as printed reports, and not collected by the
experimenter himself, is called secondary data.
• The raw data arranged in ascending or decending order is called “arrayed data”.
• When the arrayed data are arranged with frequencies, they are said to form a frequency
table for ungrouped data or a ungrouped frequency distribution table.
• When the data are divided into groups/classes, they are called grouped data.
• The difference between the maximum and minimum observations occuring in the data
is called the range of the raw data.
• The number of classes have to be decided according to the range of the data and size
of class.
TERMINAL EXERCISE
1. Fill in the blanks by appropriate words/phrases to make each of the following statements
true:
(i) When the data are condensed in classes of equal size with frequencies, they are
called ________ data and the table is called _______ table.
(ii) When the class limits are adjusted to make them continuous, the class limits are
renamed as ________.
(iii) The number of observations falling in a particular class is called its _______.
(iv) The difference between the upper limit and lower limit of a class is called
_________.
(v) The sum of frequencies of a class and all classes prior to that class is called
________ frequency of that class.
(vi) Class size = Difference between ________ and _____ of the class.
(vii) The raw data arranged in ascending or descending order is called an _______
data.
Notes
(viii) The difference between the maximum and minimum observations occuring in the
data is called the _________ of the raw data.
2. The number of TV sets in each of 30 households are given below:
1, 2, 2, 4, 2, 1, 1, 1, 2, 1, 3, 1, 1, 1, 3
1, 2, 2, 1, 2, 0, 3, 3, 1, 2, 1, ,1 0, 1, 1
Construct a frequency table for the data.
3. The number of vehicles owned by each of 50 families are listed below:
2, 1, 2, 1, 1, 1, 2, 1, 2, 1, 0, 1, 1, 2, 3, 1, 1, 1,
2, 2, 1, 1, 3, 1, 1, 2, 1, 0, 1, 2, 1, 2, 1, 1, 4, 1
3, 1, 1, 1, 2, 2, 2, 2, 1, 1, 3, 2, 1, 2
Construct a frequency distribution table for the data.
4. The weight (in grams) of 40 New Year’s cards were found as:
10.4 6.3 8.7 7.3 8.8 9.1 6.7 11.1 14.0 12.2
11.3 9.4 8.6 7.1 8.4 10.0 9.1 8.8 10.3 10.2
7.3 8.6 9.7 10.9 13.6 9.8 8.9 9.2 10.8 9.4
6.2 8.8 9.4 9.9 10.1 11.4 11.8 11.2 10.1 8.3
Prepare a grouped frequency distribution using the class 5.5-7.5, 7.5-9.5 etc.
5. The lengths, in centimetres, to the nearest centimeter of 30 carrots are given below:
15 21 20 10 18 18 16 18 20 20
18 16 13 15 15 16 13 14 14 16
12 15 17 12 14 15 13 11 14 17
Construct a frequency table for the data using equal class sizes and taking one class as
10-12 (12 excluded).
6. The following is the distribution of weights (in kg) of 40 persons:
Fig. 24.19
(i) What is the information given by the bar graph?
(ii) On which day is number of students born the maximum?
(iii) How many more students were born on Thursday than that on Tuesday.
(iv) What is the total number of students in the class?
10 15 20 25 30 35 40
Fig. 24.20
Study the above histogram and answer the following questions:
(i) What was the total number of literate females in the town in the age group 10 to 40?
(ii) In which age group, the number of literate females was the highest?
(iii) In which two age groups was the number of literate females the same?