0% found this document useful (0 votes)
11 views15 pages

WS 23 Statistics Lesson 1

This worksheet for Grade 12 covers essential statistical concepts including histograms, frequency polygons, ogives, variance, standard deviation, and the identification of outliers. It provides detailed instructions on how to create histograms, calculate standard deviation using a scientific calculator, and interpret box and whisker plots. Additionally, it includes worked examples to illustrate the application of these concepts in analyzing data sets.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views15 pages

WS 23 Statistics Lesson 1

This worksheet for Grade 12 covers essential statistical concepts including histograms, frequency polygons, ogives, variance, standard deviation, and the identification of outliers. It provides detailed instructions on how to create histograms, calculate standard deviation using a scientific calculator, and interpret box and whisker plots. Additionally, it includes worked examples to illustrate the application of these concepts in analyzing data sets.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

GRADE 12 – WORKSHEET 23

LESSON 1 - STATISTICS
OUTCOMES:
1. Histograms
2. Frequency polygons
3. Ogives (cumulative frequency curves)
4. Variance and standard deviation of ungrouped data
5. Symmetric and skewed data
6. Identification of outliers.

INTRODUCTION - OVERVIEW OF BASIC CONCEPTS


HISTOGRAMS AND FREQUENCY POLYGONS
A histogram gives us a visual interpretation of data. It looks very similar to a bar graph, but
there are definite differences between them.

HISTOGRAM BAR GRAPH


 It is a representation of grouped data.  It is a representation of ungrouped
 There is no gap between the bars. data that does not have to be
numerical.
 Presents CONTINUOUS data.
 There is generally a gap between the
 For example, you draw a bars.
HISTOGRAM to show the number of
people whose heights (h) lie in the  Displays CATEGORICAL data
following intervals (measured in cm):  For example you draw a BAR
150 ≤ h < 160; 160 ≤ h < 170; etc GRAPH to show the number of
learners in a class who wear glasses
and the number who do not wear
glasses.

A HISTOGRAM SHOULD HAVE THE FOLLOWING:


 A title which describes the information that is contained in the histogram.
 A horizontal axis with a label which shows the scale of values into which the data fit
(grouped data intervals)
 A vertical axis with a label which shows the number of times the data within the interval
occurred (frequency)
 Adjacent bars (i.e. there are no gaps between the bars).

PORTLAND HS Page 1 of 15
GRADE 12 – WORKSHEET 23
HOW TO COMPLETE A TALLY TABLE (FREQUENCY TABLE)
 A tally is a way of collecting information by making an appropriate mark for each item.
 A line is drawn for each item counted:
 Every fifth line is drawn across the other four : . This makes it easy to add up the
number of items checked.
HOW TO DRAW A HISTOGRAM
 Label the horizontal and vertical axes.
 Represent the frequency on the vertical axis and the classes on the horizontal axis.
 Use the frequencies (number of learners) as the height of the vertical bars for each class.
OGIVES (CUMULATIVE FREQUENCY CURVES)
 Cumulative frequency gives us a running total of the frequency, so, we keep adding onto
the frequency from the first interval to the last interval.
 We can show this results in a cumulative frequency table.
 We can represent the cumulative results from a cumulative frequency table with a
cumulative frequency graph or ogive.
 This graph always starts on the 𝒙-axis and usually forms an 𝑆-shaped curve, ending with
the cumulative frequency (𝑦-value)The endpoint of each interval is plotted against the
cumulative frequency.
VARIANCE AND STANDARD DEVIATION
Sometimes the mean is a more useful measure of central tendency than the median. The
measure of dispersion (spread) around the mean are called the VARIANCE and the
STANDARD DEVIATION. The standard deviation is the measure of spread most
commonly used in statistical practice when the mean is used to calculate central tendency.
The standard deviation can be difficult to interpret as a single number on its own. Basically, a
small standard deviation means that the values in a statistical data set are on average close to
the mean data set. A large standard deviation means that the value in the data set is on
average further away from the mean.
The standard deviation measures how concentrated that data are around the mean – the more
concentrated, the smaller the standard deviation.
STEPS FOR CALCULATING THE STANDARD DEVIATION WITH A
SCIENTIFIC CALCULATOR:
Using a CASIO fx-82 ES PLUS calculator:

PORTLAND HS Page 2 of 15
GRADE 12 – WORKSHEET 23
 Press MODE then STAT then 1 – VAR
 Enter all data one at a time pressing = after each entry.
 Press the orange AC button.
 Press SHIFT then 1
 Press 4 for VAR.
 In order to calculate the mean press 2:𝑥̂ and press =
 Once all these steps have been completed, simply press AC SHIFT STAT then VAR.
 Now press 𝟑: 𝝈 to calculate the standard deviation.
The INTERQUARTILE RANGE measures a spread round the MEAN, so it has to do with
the positions of data and NOT their actual values.
The STANDARD DEVIATION measures a spread around the MEAN, using the actual
valus of the data and NOT JUST THERE POSITIONS.
SYMMETRIC AND SKEWED DATA (BOX AND WHISKER PLOT)
Average (Mean): Add up all the numbers and divide by the amount of numbers.
Range: Highest number minus lowest number.
Mode (Modal): The MOST COMMON VALUE in a data set. We find the MODE by
looking for the values that are repeated. It is possible to have:
 One mode
 Two modes
 More than two modes
 No mode
Median: When the data is arranged in ascending order, the MEDIAN is the MIDDLE
VALUE. If there are TWO MIDDLE VALUES, the median is half way between the two
middle numbers (add the two numbers and divide by 2). The median divides the data into two
halves.

 Quartiles are the three values Q1, Q2 and Q3 that divide a data set into four
approximately equal parts. Each part consists of approximately 25% of the elements of
the data set.
 Q1 is the lower quartile; Q2 is the middle quartile or median and Q3 is the upper
quartile.

PORTLAND HS Page 3 of 15
GRADE 12 – WORKSHEET 23
 The median divides an ordered data set into two halves.
 The median is also the 2nd quartile (M or Q2).
HOW TO FIND THE QUARTILES:
1. Put the data items in order (MUST BE ASCENDING ORDER) and find the median.
2. Find the midpoint of the data items to the left of the median. This is the lower quartile
(Q1).
3. Find the midpoint of the data items to the right of the median. This is the upper
quartile (Q3).
As you know quartiles divide data into four equal sets of data. The longer whisker of the box
and whisker diagram means that the data is more spread than the shorter whisker.

A BOX AND WHISKER PLOT can show whether a data set is symmetrical, positively
skewed of negatively skewed. This box and whisker plot is NOT symmetrical because the
whiskers are not the same length and the median is not in the centre of the box. The whisker
on the left is longer than the whisker on the right, which shows that the data on the left of the
box is more spread out. The box is longer to the right of the median than the left of the
median. We say that the data is POSITIVELY SKEWED or SKEWED to the RIGHT.

IDENTIFICATION OF OUTLIERS.
In statistics, an outlier is a data point that differs significantly from other observations.
Inter-quartile range = 𝑄3 − 𝑄1
How to determine outliers:

PORTLAND HS Page 4 of 15
GRADE 12 – WORKSHEET 23
First determine the interquartile range
Determine: 𝑄1 − 1,5 × 𝐼𝑄𝑅
If the minimum is less than the value of 𝑸𝟏 − 𝟏, 𝟓 × 𝑰𝑸𝑹, then it is an outlier
Determine: 𝑄3 + 1,5 × 𝐼𝑄𝑅
If the maximum is more than the value of 𝑸𝟑 + 𝟏, 𝟓 × 𝑰𝑸𝑹, then it is an outlier
WORKED EXAMPLE 1: FREQUENCY TABLE AND HISTOGRAM
The following table lists the marks (given as percentage) obtained by the Grade 11
learners of Musi High School in their mathematics test:

24 70 50 22 63 45 48 52 56 38

65 68 65 17 32 60 62 53 63 45

49 44 56 12 55 83 54 22 67 54

34 77 46 50 58 80 81 39 84 75

55 76 73 80 66 71 62 40 23 76

a) Organise the data using a grouped frequency table


Tally Frequency
Percentages (Number of
(t) learners)

10≤ t < 20 2
20 ≤ t < 30 / 4
30 ≤ t < 40 4
40 ≤ t < 50 7
50 ≤ t < 60 11
60 ≤ t < 70 10
70 ≤ t < 80 / 7
80 ≤ t < 90 5
TOTAL 50

PORTLAND HS Page 5 of 15
GRADE 12 – WORKSHEET 23
b) Draw a histogram to illustrate the data

Mathematics test marks


12

10
Number of learners

0
10 ≤ t < 20 20 ≤ t < 30 30 ≤ t < 40 40 ≤ t < 50 50 ≤ t < 60 60 ≤ t < 70 70 ≤ t < 80 80 ≤ t < 90
Percentages

WORKED EXAMPLE 2: CUMULATIVE FREQUENCY GRAPH (OGIVE)


In an English class, 30 learners completed a test out of 20 marks. Here is a list of their
results.

14 10 11 19 15 11 13 11 9 11 12 17 10 14 13

17 13 13 9 12 16 6 9 11 11 13 20 7 14 17

Mark out Frequency Cumulative


Tally
of 𝟐𝟎 (number of learners) frequency
6 1 1
7 1 1+1=2
8 0 2+0=2
9 3 2+3=5
10 2
11 6
12 2
13 5
14 3
15 1
16 1
17 3
18 0
PORTLAND HS Page 6 of 15
GRADE 12 – WORKSHEET 23
19 1
20 1 29+1=30
NOTES:
 Keep adding onto frequency from row before. For example, 7 + 6 = 13
 The last number is the same as total number of learners, in this case 30.
With this data set, it would be more useful to group the data. We can use intervals of 5 and
make a cumulative frequency table for grouped data.

Frequency
Class interval Cumulative frequency
(how many learners)
0<𝑥≤5 0 0
5 < 𝑥 ≤ 10 7 7
10 < 𝑥 ≤ 15 17 24
15 < 𝑥 ≤ 20 6 30
Represent the data in the cumulative frequency table of grouped data with a cumulative
frequancy graph (OGIVE)
 The 𝑥-axis needs the points 5; 10; 15 and 20 to mark the end of each interval.
 The 𝑦-axis represents the cumulative frequency from 0 to 30.
 For plotting the points, use the end of each class interval on the 𝑥-axis and the
cumulative frequancy on the 𝑦-axis. You need to plot these following points:
(0 ; 0) , (5 ; 0) , (10; 7) , (15 ; 24) , (20 ; 30)
 Join the plotted points.

PORTLAND HS Page 7 of 15
GRADE 12 – WORKSHEET 23
NOTES:
 𝒙-coordinate – use upper limit of each interval.
 𝒚-coordinate – cumulative frequency.
 If the frequency of the first interval is not 𝟎, then include an interval before the
given one and make use of 𝟎 as its frequency.
WORKED EXAMPLE 3: VARIANCE AND STANDARD DEVIATION
These are the results of a mathematics test for a Grade 11 class of 20 students.

52 44 62 66 60 57 95 78 71 62

100 69 62 72 73 55 32 83 78 80

1. Calculate the mean mark for the class.


Mean = 67,55
2. Calculate the standard deviation of the marks
Standard deviation = 15,75
3. What percentage of the students scored within one standard deviation of the mean?

𝟓𝟏, 𝟖 𝟔𝟕, 𝟓𝟓 𝟖𝟑, 𝟑


16 scores lie in the interval (51,8 ; 83,3)
16
16 out of 20 of the marks lie within one standard deviation of the mean; × 100 =
20
80%

WORKED EXAMPLE 4: SKEWED DATA AND OUTLIERS


These are the scores of students in a Science test:

90 85 10 75 70 60 78 80 82 80 55 84

a) Draw a box and whisker diagram for the given data.


Work out the FIVE NUMBER SUMMARY:
10 ; 55 ; 60 ; 70 ; 75 ; 78 ; 80 ; 80 ; 82 ; 84 ; 85 ; 90
Minimum number: 10
Maximum number: 90
78+80
Median (𝑄2 ): = 79
2
60+70
Lower quartile(𝑄1 ): = 65
2
82+84
Upper quartile(𝑄3 ): = 83
2

PORTLAND HS Page 8 of 15
GRADE 12 – WORKSHEET 23

b) Determine the interquartile range.


Interquartile range = upper quartile – lower quartile
IQR = 83 – 65
IQR = 18

c) State whether the data is skewed or not.


The data is skewed to the left (negatively skewed)

d) State whether 10 is an outlier or not.


IQR = 18
Determine: 𝑄1 − 1,5 × 𝐼𝑄𝑅
Determine: 65 − 1,5 × 18
Determine: 38
10 is smaller than 38 ∴ 10 is an outlier

HOMEWORK ACTIVITY
1 The data below shows the number of people visiting a local clinic per day to be
vaccinated against measles.

5 12 19 29 35 23 15 33 37 21

26 18 23 18 13 21 18 22 20

1.1 Determine the mean of the given data. 𝟐𝟏, 𝟒𝟕

1.2 Calculate the standard deviation of the data. 𝟕, 𝟖𝟏

1.3 Determine the number of people vaccinated against measles that lies within ONE
standard deviation of the mean. 𝟏𝟑

1.4 Determine the interquartile range for the data. 𝟖

1.5 Draw a box and whisker diagram to represent the data and comment on the
skewness of the data. Skewed to the right

PORTLAND HS Page 9 of 15
GRADE 12 – WORKSHEET 23
1.6 Identify any outliers in the data set. Substantiate your answer. YES, 𝟓 is an outlier

2 15 households were surveyed in suburb A to find out how much each one spent on
electricity for a ten-day period. The results in rand are:

90 102 50 125 141 220 196 78 137 142

123 157 118 165 121

2.1 Determine the median for the above data? 𝒎𝒆𝒅𝒊𝒂𝒏 = 𝟏𝟐𝟓

2.2 Determine the upper and lower quartiles. 𝑸𝟏 = 𝟏𝟎𝟐 and 𝑸𝟑 = 𝟏𝟓𝟕

2.3 Draw a box and whisker diagram for the data and comment on the skewness of the
data. Skewed to the right

2.4 Determine the inter quartile range. 𝑰𝑸𝑹 = 𝟓𝟓

2.5 Use a calculator to determine the standard deviation for this data. 𝑺𝑫 = 𝟒𝟐, 𝟎𝟑

2.6 How many learners fall within one standard deviation from the mean. 11

2.7 Determine whether the data set contains any outliers. Substantiate your answer.
No outliers

3 Below are the percentage scores that 15 learners obtained in a Physical Science
Examination
72 57 63 81 60 51 96 66 78 54 39 69 90
30 39

3.1 Determine the median for the above data?

3.2 Determine the upper and lower quartiles.

3.3 Draw a box and whisker diagram for the data and comment on the skewness of the
data.

3.4 Determine the inter quartile range.

3.5 Use a calculator to determine the standard deviation for this data.

3.6 How many learners fall within one standard deviation from the mean.

3.7 Determine whether the data set contains any outliers.

PORTLAND HS Page 10 of 15
GRADE 12 – WORKSHEET 23
4 The table below shows the distances (in kilometres) travelled daily by a sales
representative for 21 working days in a certain month.

131 132 140 140 141 144 146

147 149 150 151 159 167 169

169 172 174 175 178 187 189

4.1 Calculate the mean distance travelled by the sales representatives.

4.2 Write down the five-number summary for this set of data

4.3 Draw a box and whisker diagram for the data and comment on the skewness of the
data.

4.4 Calculate the standard deviation of the distance travelled.

5 The speeds of 55 cars passing through a certain section of a road are monitored for
one hour. The speed limit on this section of the road is 60 km per hour. A
histogram is drawn to represent the data.

5.1 Identify the modal class of data. between 𝟓𝟎 and 𝟔𝟎

5.2 Use the histogram to:

5.2.1 Complete a cumulative frequency table.

PORTLAND HS Page 11 of 15
GRADE 12 – WORKSHEET 23

5.2.2 Draw an ogive (cumulative frequency graph) of the above data.

PORTLAND HS Page 12 of 15
GRADE 12 – WORKSHEET 23
6 The table below shows the amount of time (in hours) that learners aged between
14 and 18 spent watching television during 3 weeks of the holiday.

6.1 Draw an ogive (cumulative frequency curve) to represent the above data.

6.2 Write down the modal class of the data. 𝟒𝟎 ≤ 𝒕 < 𝟔𝟎

6.3 Use the ogive to estimate the number of learners who watched television more
than 80% of the time. ∴ 𝟏𝟕𝟐 − 𝟏𝟔𝟒 = 𝟖 Learners

PORTLAND HS Page 13 of 15
GRADE 12 – WORKSHEET 23
6.4 Estimate the mean time (in hours) that learners spent watching television during 3
weeks of the holiday. 𝟒𝟔, 𝟓𝟏 hours

7 A company recorded the number of messages sent by e-mail; over a period of 60


working days. The data is shown in the table below.

Number of messages Number of days


10 < 𝑥 ≤ 20 2
20 < 𝑥 ≤ 30 8
30 < 𝑥 ≤ 40 5
40 < 𝑥 ≤ 50 10
50 < 𝑥 ≤ 60 12
60 < 𝑥 ≤ 70 18
70 < 𝑥 ≤ 80 3
80 < 𝑥 ≤ 90 2
7.1 Estimate the mean number of messages sent per day, rounded off to TWO decimal
places. 𝟓𝟏, 𝟑𝟑 messages per day

7.2 Draw a cumulative frequency graph (ogive) of the data.

PORTLAND HS Page 14 of 15
GRADE 12 – WORKSHEET 23
7.3 Hence, estimate the number of days on which 65 or more messages were sent.
𝟏𝟒 days

PORTLAND HS Page 15 of 15

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy