GCE As Level Representation of Data Histograms
GCE As Level Representation of Data Histograms
SMIYL
April 2023
Histogram
A histogram is used to represent grouped continuous data. However,
it does not show all the data points. It consists of bars of different
widths joined together. There are no gaps between the bars. This
is the difference between a bar chart and a histogram. Data for
a histogram is usually displayed in the form of a class (a range of
values), with its respective frequency. We can use that information
to help us calculate the information we need to draw a histogram:
• class width
• frequency density
The modal class is the class with the highest frequency i.e the highest
bar on the histogram.
Median
1
To calculate the median, we use the formula,
1
q2 = n
2
Note: Since the data is continuous we cannot find the exact value of
the median, but we can find the median class.
Lower quartile
Upper quartile
Interquartile Range
IQR = q3 − q1
Mean
2
To calculate the mean when data is displayed in the form of a his-
togram, we need to first find the mid interval. This is the middle
value for each class i.e the midpoint. We use the formula,
Σxf
x=
Σf
Standard Deviation
3
Class Width 10 10 20 20 40
Frequency Density 1.6 5.4 3.9 1.6 0.5
Plot the classes on the x-axis, ensuring that each bar has the
corresponding class width. Then plot the frequency density
on the y-axis. Label the x-axis with the class name ’Time (t
seconds)’. Label the y-axis with ’frequency density’.
5
Frequency density
10 20 40 60 100
Time (t seconds)
(b) Calculate an estimate for the mean time taken by these 200 players.
4
Substitute into the formula,
x = 32.15
(c) Find the greatest possible value of the interquartile range of these
times.
IQR = q3 − q1
q3 = 60
5
Let’s find the minimum value of the lower quartile,
1
q1 = n
4
1
q1 = (200)
4
q1 = 50
q1 = 10
IQR = 50
2. The numbers of chocolate bars sold per day in a cinema over a period of
100 days are summarised in the following table. (9709/51/M/J/20 number
7)
6
No. of chocolate bars sold 0.5 − 10.5 10.5 − 15.5 15.5 − 30.5 30.5 − 50.5 50.5 − 60.5
No. of days 18 24 30 20 8
Plot the classes on the x-axis, ensuring that each bar has the
corresponding class width. Then plot the frequency density
on the y-axis. Label the x-axis with the class name ’Number
of chocolate bars sold’. Label the y-axis with ’frequency
density’.
4
Frequency density
7
The formula for interquartile range is,
IQR = q3 − q1
8
(c) Calculate estimates of the mean and standard deviation of the num-
ber of chocolate bars sold.
x = 23.55
Σx2 f
Let’s start by finding Σf ,
Σx2 f 5.52 (18) + 132 (24) + 232 (30) + 40.52 (20) + 55.52 (8)
=
Σf 100
Σx2 f 77 917.5
=
Σf 100
9
Let’s substitute into the formula,
s
Σx2 f
σ= − x2
Σf
r
77 917.5
σ= − (23.55)2
100
σ = 14.98574322
σ = 15.0
x = 23.55asdf σ = 15.0
10