2020 - Statistics 1 - Session 2
2020 - Statistics 1 - Session 2
When we have a lot of categories, we sometimes may need to group the data. It is
often better to display the data in a table. This section will look at different ways to
organise data, but first here are a few things to note. Please take note of the terms
which have been highlighted.
There are two types of frequency distribution:
a) Inclusive - In inclusive type of frequency distribution, the upper limit of
one class does not coincide with the lower limit of the next class.
b) Exclusive - In exclusive type of frequency distribution, the upper limit of
one class coincides with the lower limit of the next class.
The following example illustrates the inclusive type of frequency distribution.
Consider the following ungrouped marks (out of 50) given to 30 students:
The range is the difference between the largest and the smallest observations in a
sample. For the above ungrouped data the range is 49 - 12 = 37.
Consider the class 11 - 15. The lower and upper limits of a class interval are
known as class limits. If a student scores 11 marks or 15 marks, he will be put in
this class. For this class, 11 is the lower limit and 15 is the upper limit and both
are included in the class; hence this distribution is inclusive.
But in case of 'exclusive' frequency, one of the limits of the class is excluded
from the class; the above frequency distribution can be reformed in the following
exclusive way also:
(11 - 15) is . Similarly, class midpoints for other intervals are 18, 23,
28…
∑ 𝒇𝒙 ∑ 𝒇𝒙
𝑴𝒆𝒂𝒏 (𝒙
̅) = =
∑𝒇 𝒏
We already know how to find the mean from a frequency table. Finding the mean
for grouped or continuous data is very similar.
The grouped frequency table shows the number of CDs bought by a class of
children in the past year.
The groups 10-19, 20-29, 30-39 and 40-49 are called classes.
Frequency
Number of CDs
(f)
0-4 10
5-9 12
10-14 6
15-19 2
>19 0
We know that 10 children have bought either 0, 1, 2, 3 or 4 CDs, but we do not
know exactly how many each child bought.
If we assumed that each child bought 4 CDs, it is likely that our estimate of the
mean would be too big.
If we assumed that each child bought 0 CDs, it is likely that our estimate would be
too small.
It therefore seems sensible to use the mid-point of the group and assume that each
child bought 2.
Finding the mid-points of the other groups, we get:
Mid-point, f x
Number of CDs Frequency
x
0-4 10 2 20
5-9 12 7 84
10-14 6 12 72
15-19 2 17 34
>19 0 - 0
2. The table below shows the distribution of weight of 100 adults measured to
the nearest kilogram: