Summary Statistics Q & A
Summary Statistics Q & A
STANDARD 2
Statistical Analysis (Std 2), S1 Data Analysis (Y11) 1. Statistics, STD2 S1 2005 HSC 1 MC
Summary Statistics - No Graph (Std 2) What is the mean of the set of scores?
Summary Statistics - Box Plots (Std 2)
Teacher: Ciara Duffy
(A) 6
Exam Equivalent Time: 82.5 minutes (based on allocation of 1.5 minutes per mark)
(B) 7
(C) 8
(D) 9
10 15 20 25 30 35 40 45 50 55
IMPORTANT FEATURES AND TIPS FROM EXAM HISTORY What is the median of this set of data?
A. 15
MS-S1 Data Analysis has contributed 6.1% per new syllabus Std2 exam since its introduction in 2019.
B. 20
We have split this area into four categories for the purposes of analysis: 1-Bar Charts and Histograms
(0.8%), 2-Other Charts (1.5%), 3-Summary Statistics - Box Plots (1.3%) and 4-Summary Statistics - No C. 30
Graph (2.5%).
D. 35
This analysis looks at the sub-topic Summary Statistics - No Graph (2.5%).
ANALYSIS - What to Expect and Common pitfalls 3. Statistics, STD2 S1 2009 HSC 3 MC
The eye colours of a sample of children were recorded.
Summary Statistics - No Graph questions require students to understand and calculate statistics
such as median, mean, standard deviation (by calculator) and five-number summaries, given a simple When analysing this data, which of the following could be found?
data set.
(A) Mean
A deep understanding of mean calculations is a must, particularly where data sets are "adjusted" and
(B) Median
related mean calculations are required. This question type is regularly asked (most recently in 2022)
and consistently produces sub-50% mean marks (review 2020 Std2 28, 2016 Std2 27b). (C) Mode
Questions on outliers/IQRs have been asked in 5 of the last 8 years, most recently in 2021. We (D) Range
highlight 2018 Q26e for revision attention, which required calculating an IQR from a cumulative
frequency table and was poorly understood.
Standard deviation questions have seen an uptick in recent years and students must be able to find
the std dev of a small data set by calculator (review 2017 Std2 27a) and also explain the measure
conceptually (review 2020 Std2 24).
Marker Comments have highlighted past issues for many students in finding the mean of grouped
data, where they must use the "class centres" for their calculations. This caused major issues in both
2014 and 2006 and deserves attention.
4. Statistics, STD2 S1 2004 HSC 12 MC 7. Statistics, STD2 S1 2008 HSC 10 MC
This box-and-whisker plot represents a set of scores. The marks for a Science test and a Mathematics test are presented in box-and-whisker plots.
If the range of the dataset is 8, what is the minimum value of the dataset?
A. 2
B. 3
C. 4
D. 7
In History, 112 students completed the test. The number of students who scored above 30 marks
was the same for the History test and the Geography test.
How many students completed the Geography test?
(A) 8
(B) 50
(C) 56
What percentage of these students spend between 40 minutes and 60 minutes per day on exercise?
(D) 112
(A) 17%
(B) 20%
(C) 25%
(D) 50%
12. Statistics, STD2 S1 2018 HSC 11 MC 14. Statistics, STD2 S1 2011 HSC 7 MC
RAP Data - Bottom 10%: School result (63%) was 6% below state average (69%) A set of data is displayed in this box-and-whisker plot.
Write down a set of six data values that has a range of 12, a mode of 12 and a minimum value of 12.
(2 marks)
Part i: RAP Data - Bottom 14%: School result (70%) was 5% below state average (75%) b. The data from the table are shown in the following Pareto chart.
Part ii: RAP Data - Bottom 17%: School result (35%) was 4% below state average (39%)
Part iii: RAP Data - Bottom 23%: School result (27%) was 2% below state average (29%)
Terry and Kim each sat twenty class tests. Terry’s results on the tests are displayed in the box-and-
whisker plot shown in part (i).
i. Kim’s 5-number summary for the tests is 67, 69, 71, 73, 75.
Draw a box-and-whisker plot to display Kim’s results below that of Terry’s results. (1 mark)
iii. Terry claims that his results were better than Kim’s. Is he correct?
Justify your answer by referring to the summary statistics and the skewness of the distributions.
(4 marks)
24. Statistics, STD2 S5 2013 HSC 29b 27. Statistics, STD1 S1 2020 HSC 24
Part ii: RAP Data - Bottom 6%: School result (12%) was 9% below state average (21%) a. The ages in years, of ten people at the local cinema last Saturday afternoon are shown.
Ali’s class sits two Geography tests. The results of her class on the first Geography test are shown.
The data for Team B was analysed to create the box-plot shown.
Compare the distributions of the number of goals scored by the two teams. Support your answer with
the construction of a box-plot for the data for Team A. (5 marks)
29. Statistics, STD2 S2 2020 HSC 28 Worked Solutions
Consider the following dataset.
1. Statistics, STD2 S1 2005 HSC 1 MC
Suppose a new value, , is added to this dataset, giving the following.
It is known that is greater than 15. It is also known that the difference between the means of the
two datasets is equal to ten times the difference between the medians of the two datasets.
Calculate the value of . (4 marks)
Copyright © 2004-22 The State of New South Wales (Board of Studies, Teaching and Educational Standards NSW)
2. Statistics, STD2 S1 2017 HSC 1 MC
ii.
i.
iii.
23. Statistics, STD2 S1 2022 HSC 19 26. Statistics, STD2 S1 2018 HSC 26e
a.
b.
♦ Mean mark part (b)
41%.
27. Statistics, STD1 S1 2020 HSC 24 29. Statistics, STD2 S2 2020 HSC 28
a.
Mean mark 53%.
♦ Mean mark part (a) 39%.
b.