Statistics Question Bank
Statistics Question Bank
By I. Mudzingwa
Powered by VaChingwere
With Answers
With A……….s
Given that the probability of at least one delay occurring in a period of n weeks is
greater than 0,875, find the least possible value of n. JUN 2008 [4]
12.The owners of a motel in Mutare have noticed that in a long run 40% of the people
who stop and inquire about a room for the night, actually book a room. How many
inquiries must the owners answer to be 99% sure of at least one booking? [5]
NOV 2009
13.Data from the Consumer Council of Zimbabwe shows that 42% of Zimbabweans
eat breakfast every day. Find the probability that in a random sample of 300
Zimbabweans, the number who eat breakfast every day is
i) at most 100 [6]
ii) from 130 to 140 [4]
NOV 2009
14.Three players A, B and C, in that order, throw a fair cubical die. The first to throw
a 6 wins. The game is continued until one of the players wins.
(a) Find the probability that A wins
(i) on his first throw, [1]
ii) on his second throw, [2]
iii) the game. [3]
30
(b) Given that the probability that B wins is , find the probability that C
91
wins. [2]
NOV 2010
15.In a chemical industry workmen had a 20 % chance of suffering from an
occupational disease. Find the number of workmen who could have been selected
at random before the probability that at least one of them contracted the disease,
become greater than 0,9. [5]
NOV2011
16.a) State the condition under which a normal distribution be used to approximate a
binomial distribution. [1]
b) It is estimated that 20% of people undergoing medical review are men. If a
random sample of 100 people is undergoing medical review, find the probability
that more than 30 are men. [5]
NOV 2011
17.The number of patients admitted at a medical centre each day is found to have a
Poisson distribution with mean 2.
a) Evaluate the probability that on a particular day there will be no admission [2]
b) At the beginning of one day, the hospital have 5 beds available. Calculate the
probability that this will be an insufficient number for the day. [5]
c) Calculate the probability that there will be exactly three admissions altogether on
two consecutive days. [3]
d) 150 patients are attended to, at the centre on one particular day the probability
that a patient will be admitted is 0,02. Using a suitable approximation, find the
probability that exactly 4 patients are admitted. [3]
NOV 2011
18.In an Olympiad Quiz Examination paper, there are 100 questions. Each question
has 5 suggested answers and a candidate has to choose the correct one. Given that
Mary is equally likely to choose any of the 5 answers in each question since she
was guessing, use a suitable approximation to find the probability that she get at
least 27 correct answers. [4]
NOV 2013
19.In a certain factory, there are two machines producing the same brand of fuses. The
first machine produces 10 % and the second machine produces 90 % of the fuses. It
is known that the probability that the first machine produces a defective fuse is 1 %
and the probability that the second machine produces a defective fuse is 5 %.
i) Find the probability that a fuse drawn at random from the production line is
defective. [2]
ii) Given that fuse is defective, find the probability that it was produced by the first
machine. [3]
NOV 2013
20.The number of people joining a queue in a supermarket between 6.30 am and 7.00
am on a week day follows a Poisson distribution with mean of 2 people joining the
queue per minute. Find the probability that
a) Five people join the queue in one minute, [2]
b) more than four people join the queue in one minute [3]
c) less than four people join the queue in a 2 minute interval. [4]
NOV 2013
21.Every year a local cellular network provider holds a competition. The proportion
that a dollar spent on airtime wins a prize is 1 in 110.
a) Show that the probability that a subscriber who spent $50 on airtime wins at
least one price is 0,365 correct to 3 significant figures. [3]
b) Find the probability that in a group of
i) 10 subscribers each spending $50 on airtime, 3 or more win at least one prize,
ii) 100 subscribers each spending $50 on airtime, 40 or more win at least one prize.
NOV 2013 [8]
22.The probability that a learner driver passes his/her test at Vehicle Inspection
Department is. A learner counts the number of attempts, n until he/she passes the
driving test,
a) State a suitable statistical distribution which can be used to model the above
situation. [1]
b) Find the mean and variance of the distribution. [3]
c) Find the smallest value of n, for which there is a probability of at least 0.7, that
the learner need only n or fewer trials to pass the test. [4]
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
5
Vachingwere. With God everything is possible
JUN 2013
23.Mangoes are boxed into cartons, each containing 500 mangoes. The probability
that a mango is rotten is 0,002. Buyers of these cartons of mangoes will return any
carton that contain 4 or more rotten mangoes.
a) i) Find the expected number of rotten mangoes per carton. [1]
ii) State, giving a reason, the most appropriate statistical distribution which can
be used to model the above situation. [2]
b) Find the probability that i) a carton of mangoes is not returned, ii) if two cartons
of mangoes are chosen at random, they contain at least three rotten mangoes. [7]
JUN 2013
24.The probability that a boy hits a target is 0.8. Assuming that shots are independent
of each other and suppose that during each practice period, the boy fires shots until
he hits the target.
(i) Find the mean and standard deviation of the number of shots fired per practice
period. [3]
ii) Find the probability that the boy will need to take at least five shots to hit the
target. [2]
JUN 2014
25.The probability that a seed is grown under specified conditions will germinate and
produce a plant is 0.8. The minimum number of seeds, n, are to be planted under
three conditions to ensure that a probability of at least 0.9 that 60 or more seeds
will germinate and produce a plant.
(i) Using a suitable approximation show that 𝑛2 − 149.2 + 5531.6 ≥ 0 [4]
ii) 1. Solve the inequality in part i). 2. Hence or otherwise find the minimum value
of n, the number of seeds to be planted. [3]
JUN 2014
26.An insurance company receives on average 3 claims on any given week. Find the
probability that the company receives
(a) at least 2 claims in any given week, [3]
(b) one claim a day, assuming that the company works 5 days in a week, [3]
(c) a total of 2 claims during 3 consecutive weeks, [3]
(d) at least 2 claims in exactly one of the 3 consecutive weeks. [3]
JUN 2015
27.The number of passengers arriving at a taxi rank per hour was found to have a
Poisson distribution with mean 2.
a) Calculate the probability that in a particular hour there will be no passenger
arriving. [2]
b) At the beginning of an hour there will be 4 taxis available for hire. Calculate the
probability that this will be an insufficient for the hour assuming that each taxi
allows only one passenger. [3]
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
6
Vachingwere. With God everything is possible
the seeds in a packet are rotten. A packet containing 5 or more rotten potatoes is
said to be substandard.
i) Calculate the probability that a packet of potato seeds is substandard. [3]
ii) A load consist of 20 randomly chosen packets of potato seeds. Find the
probability that the load will consist of exactly 2 packets which are substandard. [3]
NOV 2018
34.In a certain school, 90% of the learners are right handed. Find the probability that
in a random sample of
i) 8 learners, exactly 6 will be right handed. [3]
ii) 20 learners , fewer than 18 will be right handed. [4]
iii) 200 learners, at most 182 will be right handed. [4]
NOV 2018
35.A company receives on average 6 orders per day. Find the probability that
i) no more than 2 orders will be received on a given day. [3]
ii) on a given half day, no orders will be received. [3]
NOV 2018
36.70% of all the cellphones sold by an electrical shop have a certain application.
(a) Find the probability that out 15 customers who buy a cellphone, less than 13
chose one with that application. [3]
(b) Use a suitable approximation to find the probability that, out of 60 customers
who buy cellphones, more than 45 choose one with that application. [5]
NOV 2019
37.The number of people who use a lift in a multi-storey building follows a Poisson
distribution with mean of 2 in a minute. Find the probability that
a) Exactly 3 people use the lift in a minute, [2]
b) less than 4 people use a lift in a period of 2 minutes, [3]
c) more than 2 people use a lift in 3 minute period. [3]
NOV 2019
38.a) The probability that a form 3 learner passes a given test at a particular school is
0.6.
i) In a class of 15 form 3 learners find the probability that 1. Exactly 4 learners pass
the test, 2. Less than 13 learners pass the test. [6]
ii) In a stream of 200 form 3 learners, find the probability that more than 150 pass
the test. [6]
b) if 𝑋~𝐺𝑒𝑜(0.25), calculate
i) the variance of 𝑋, [2]
ii) 𝑃(𝑋 > 3). Nov 2019 [2]
39.A school has two photocopiers 𝑋 and 𝑌, the number of times per week that 𝑋
breaks down has a Poisson distribution with mean 0.3, while independently the
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
8
Vachingwere. With God everything is possible
number of times that 𝑌 breaks down in a week follows a Poisson distribution with
mean 0.2. Find the probability that in the next 4 weeks.
(i) 𝑋 will not breakdown at all. [4]
(ii) There will be a total of 3 breaks down. [3]
(iii) Each photocopier will breakdown exactly twice. [3]
SPMN P 1
40.An insurance company receives on average 3 claims on any given week. Find the
probability that the company receives
(a) at least 2 claims in any given week, [4]
(b) one claim in a day, assuming that the company works for 5 days in a week, [4]
(c) a total of 2 claims during 3 consecutive weeks, [4]
(d) at least 2 claims in exactly one of the 3 consecutive weeks. [4]
SPMN P2
Chi-Squared Test
1. A random sample of men and women indicated their views on adopting a national
dress as summarized below
In favour Opposed Undecided Total
Women 118 62 25 205
Men 84 78 37 199
Total 202 140 62 404
Vigour
Leaf colour Good Average Weak
Green 55 79 4
Yellow-green 11 60 15
Yellow 16 65 19
Test at the 5% level of significance whether vigour and leaf colour are related. [13]
Jun 2004
3. The table below shows the interruption of service per day due to a photocopying
machine breakdown
Interruptions 0 1 2 3 4 or more
per day
No of days 27 28 30 12 3
Test whether a Poisson distribution with parameter 𝜆 = 1is a suitable model at the
5 % significance level. [9]
Nov 2004
4. Most of the business done by an estate agent in Harare occurs during the months of
March to October. The records for 1994 showed that the number of houses whose
purchases were completed during these months were:
Month M A M J J A S O
Number of houses 15 18 20 24 20 22 21 20
purchased
In allocating staff to deal with the work for 1995 the management worked on the
assumption that no one of these is likely to be busier than another. At 5% level of
significance, carry out a test to determine whether the assumption is justified. [9]
Nov 2007
5. In a seed viability test, 600 seeds were planted in rows of 6.The number of seeds
that germinated in each row was counted and the results are shown in the table
below.
Number of seeds 0 1 2 3 4 5 6
germinating per
row
Observed 1 4 7 29 33 18 8
number of rows
(a) Calculate
(i) the mean of seeds germinating per row, [2]
(ii) the expected frequencies corresponding to these observed values for a
binomial distribution with the same mean as that in (i) [5]
(b) Carry out the appropriate 𝜒 2 – test, at the 5% level of significance, to determine
whether the observed values confirm that the number of seeds germinating follow a
binomial distribution. [10]
Nov 2008
6. The personnel department of a company in Chegutu is doing a study about job
satisfaction, classifying it as either high, medium or low. A random sample of 310
employees was given a test designed to diagnose the level of job satisfaction.
Results were recorded according to salary levels.
Job Number of Number of Number of
satisfaction employees earning employees employees
under $10 million earning $10- earning over
$20 million $20 million
High 20 20 10
Medium 100 65 35
Low 40 15 5
Use a 𝜒 2 - test to determine if salary and job satisfaction are independent at the 5%
level of significance. [13]
Jun 2008
7. A random sample of 400 students was asked to indicate their view on the infusion
of environmental issues in their college curriculum. The results are summarised in
the following table.
In favour Opposed Undecided
Females 115 60 36
Males 90 85 14
Breakdowns (X) 0 1 2 3 4 5 or
more
Frequency 15 25 30 21 9 0
Test the hypothesis that X has a Poisson distribution. [12]
Nov 2009
9. The number of electrical faults at a station as observed over a period of 160m days.
The following table gives the frequency distribution of the observations.
Number of electrical 0 1 2 3 4 5 6 7
faults
Number of days 25 35 35 25 20 10 7 3
Apply the 𝜒 2 -test at the 5 % level of significance to determine if the number of
electrical faults follows a Poisson distribution. [15]
Nov 2010
10.A sports director wants to know whether the interest distribution of form one
students in sporting disciplines is different from form two interest distribution. The
form two interest distribution is given in table 1
Table 1
Sporting discipline Percentage
Cricket 21.1
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
12
Vachingwere. With God everything is possible
Hockey 27.0
Rugby 33.9
Soccer 18.0
A random sample of 200 form ones was taken and gave the results in table 2
Table 2
Sporting Frequency
discipline
Cricket 42
Hockey 62
Rugby 64
Soccer 32
14.A policemen attending to accidents claims that the type of an accident depends on
the colour of car involved. The table below shows the results of 200 accidents
attended to.
Minor Serious Fatal Total
Black 15 23 22 60
White 35 24 11 70
Red 20 23 27 70
Total 70 70 60 200
Test at the 10 % level of significance whether the data supports the policemen’s
claim. [11]
Jun 2014
15.One hundred Electrical components are tested to see how many defects each has.
The results
are shown in the table.
Number of 0 1 2 3 4 5 6 ≥7
defects
Number of 11 22 26 24 9 5 3 0
components
(i) Calculate the mean of the distribution. [2]
(ii) Calculate the expected frequencies (correct to 1 d.p) of the associated
Poisson distribution having the same mean. [3]
2
(iii) Perform a 𝜒 goodness of fit test to determine whether or not the above data
come from Poisson distribution using 5 % level of significance. [9]
Jun 2014
16.The number of electrical faults at a station as observed over a period of 160m days.
The following table gives the frequency distribution of the observations.
Number of 0 1 2 3 4 5 6 7
electrical faults
Number of days 25 35 35 25 20 10 7 3
Age in years
17 18 19 20
Fail 21 33 25 10
Pass 24 28 50 9
Test at 5% significance level whether there is an association between the age of a
learner and passing at first attempt. [10]
Nov 2017
20.The discrete random variable X is distributed as shown in the table below
X 0 1 2 3 4
Frequency 46 44 20 8 2
(a) Calculate the mean value of X. [1]
(b) (i) Find the frequencies that would correspond to a Poisson model with the same
mean.
ii) Test at the 5% level of significance whether the data follows a Poisson
distribution with the same mean. [9]
Jun 2017
21.An agriculture class decided to test three new types of fertilizer, X, Y and Z on the
bean crop in the school garden. They applied the fertilisers to 75 beds of bean
plants. The yield per bed of beans was classified as high, medium or low. The
results are summarised in the table below.
Type of fertilizer
Yield X Y Z
High 12 15 3
Medium 8 8 8
Low 5 7 9
Test at the 1% level of significance whether there is association between type of
fertiliser and yield. [11]
Jun 2017
Data Representation
67 76 85 42 93 48 93 46 52 63
82 72 44 66 87 78 47 66 50 72
82 56 58
(a) Construct a stem and leaf diagram to represent this data. [2]
(b) Using a scale of 2 cm to represent 10 km draw a box and whisker plot to
represent this data. [4]
(c) Give one advantage of using
(i) a stem and leaf diagram. [1]
(ii) a box and whisker plot. [1]
Nov 2003
2. The heights of 100 plants in a garden measured to the nearest centimetre are
summarized in the following table.
(a) Draw a cumulative frequency curve for the distribution using 2cm to represent
50 cm on the horizontal axis and 2 cm to represent 10 plants on the vertical axis.
[4]
(b) The heights of the shortest and the tallest plants were 20 cm and 380 cm
respectively. Use this information and your answer in (a) above to draw a box
and whisker plot for the data. [5]
(c) Name a way of representing data so that the details of the original data are
retained. [1]
Jun 2004
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
17
Vachingwere. With God everything is possible
3. The masses in grams of 24 sweets in a bag are represented by the stem and leaf
diagram shown below. The leaves are not ordered.
Stem Leaf
0.7 2 3 9
0.8 0 8
0.9 1 9 1 8 4
1.0 3 8 6 1
1.1 3 3 9 3
1.2 1 2
1.3 9 3
1.4 4 5 Key: 0.7|2=0.72
(a) Find
(i) the median of this distribution, [1]
(ii) the mode of the distribution. [1]
(b) A sweet of mass more than 1.2g is classified as large. Calculate the mean of large
sweets that the bag contains. [2]
Jun 2008
4. The stem and leaf diagram below shows the pocket money received by a group of
girls in the year 1980
Stem Leaf
0 50 50 50 75
1 00 00 00 50 75
2 00 00 00 50 50
3 00 25 30 75
4 50
5 50 Key 3|30 = $3.30
Find the mean and standard deviation of the distribution of the pocket money
received by the girls. [3]
Nov 2008
5. The table shows the bus fares in thousands of dollars paid by 19 football fans
selected at random from a football crowd.
73 85 48 80 53 75 55
58 62 69 63 64 73 65
55 54 55 45 55
(a) Construct a stem and leaf diagram representing this data. [3]
(b) Calculate the median and inter-quartile range. [4]
Nov 2009
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
18
Vachingwere. With God everything is possible
6. The following table gives the frequency distribution of the number of computers
sold during the past months at all different computer stores in Harare.
1 3
2 6
3 1
4 1 3
5 0 2 6 8
6 1 2 2 2 7
7 0 3 4 5 5 8 9
8 0 3 4 4 8
9 2 7 7 8 Key 4|1 means 41%
1. A Domestic Workers Union claims that the average hourly rate paid to domestic
workers is $15.85. The house-wives league in this country wishes to test this claim.
They conducted a survey amongst a sample of 1 225 domestic workers throughout
the country. They found the sample mean hourly rate to be $16.03. Assume that the
population standard deviation of hourly rates paid to domestic workers is $2.87.
Test the hypothesis at the 2% significance level that the average hourly rate paid to
domestic workers in this country is more than $15.85. [6]
Nov 2003
2. It is given that about 95% of values of a standard normal distribution lie between a
and b. where a < b and P(a < x < b) = 0.95.
(i) Show that 𝑏 = 𝜇 + 1.96𝜎 and 𝑎 = 𝜇 − 1.96𝜎. [5]
𝑎+𝑏
(ii) Hence show that 𝜇 = [2]
2
Nov 2003
3. On any day, the amount of time measured in hours, that a viewer spends watching
television is a continuous random variable T, with a cumulative distribution
function given by
0 𝑡≤0
𝐹(𝑡) = {1 − 𝑘(15 − 𝑡) 2
0 ≤ 𝑡 ≤ 15
1 𝑡 ≥ 15
Where k is a constant.
1
(i) Show that 𝑘 = . [2]
225
(ii) Show that for 0 ≤ t ≤ 15, the probability density function of T is given by
2 2
𝑓(𝑡) = – [2]
15 225
(iii) Find the median of T. [3]
Nov 2003
4. Biscuits are produced with weight W grams where W – N (10:4) and are packed at
random into boxes consisting of 25 biscuits. Find the probability that
(a) a biscuit chosen at random weights less than 9.5g. [2]
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
22
Vachingwere. With God everything is possible
(b) the contents of a box weigh between 247g and 253 g. [3]
(c) the mean weight of the biscuits in the box is greater than 10.2g. [3]
Nov 2003
5. The difference between the actual and the scheduled time arrival for a commuter
train is normally distributed with a mean of 5 minutes (ie on average it is 5 minutes
late) and standard deviation of 11 minutes. On a randomly chosen day, calculate
the probability that the train will be
(i) more than 5 minutes late [2]
(ii) late [3]
(iii) at least 10 minutes late [2]
Jun 2004
6. the probability density function of a life time, X hours of a bulb is given by
𝜋
𝑘𝐶𝑜𝑥 ( )
𝑓(𝑥) = { 200 , 𝑓𝑜𝑟 0 ≤ 𝑥 ≤ 100
0 𝑒𝑙𝑠𝑒𝑤ℎ𝑒𝑟𝑒
𝜋
(i) show that k=( ) [2]
200
(ii) find E(X) [3]
(iii) Find the probability that a bulb chosen at random will have a lifetime
exceeding 80 hours. [3]
Jun 2004
7. A number X is randomly selected from the interval (-л, л). Find the cumulative
distribution function of X. [4]
Nov 2004
2
, 0 ≤ x ≤ 1
8. The function 𝑓(𝑥) = {3x(2 – x)
0, otherwise.
(i) Verify that f(x) is a probability density function. [2]
(ii) Find 𝑃(𝑋 < ½) [1]
(iii) Calculate the probability that 2 of 3 independent values of X observed will
be less than ½ [3]
Nov 2004
9. (a) Mercy travels from her Harare office to her home by commuter omnibus from
station A to station B. her walking times to station A from the office and from
station B to her home add up to 20 minutes. The variable factors measured in
minutes are as shown in the table below
Mean Standard deviation
Waiting time 30 54
Bus journey 50 25
Assuming that these two factors are independent and normally distributed, find the
probability that the whole journey takes
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
23
Vachingwere. With God everything is possible
Nov 2009
14.A continuous random variable has a probability density function,
𝑘𝑥, 0≤𝑥≤3
𝑓(𝑥) = {3𝑘(4 − 𝑥), 3 ≤ 𝑥 ≤ 4
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
where k is a constant.
a) Find the value of k, and sketch the graph of f(x). [4]
b) Find the probability that x > 2. [3]
Nov 2010
15.After some rain the depth of moisture, X meters, in Arda Gardens can be taken as a
continuous random variable with a probability density function
12𝑥
(𝑏 − 𝑥), 0 ≤ 𝑥 ≤ 1
𝑓(𝑥) = { 5
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
a) Find the value of b. [3]
b) Calculate the probability that the depth of moisture exceeds 0,9. [3]
Nov 2014
16.The probability density function of the lifespan, X months, of a bulb is given by
𝑘
, 1≤𝑥≤3
𝑓(𝑥) = { 𝑥(4−𝑥)
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
a) Find the value of k. [3]
4
b) Given that E(X) = 2, show that Var(X) = 4− [3]
1𝑛3
c) Find the probability that a bulb chosen at random will have a lifespan exceeding
2 months. [2]
Nov 2013
17.A continuous random variable has a probability density function f(x) given below
1
, 0 ≤ 𝑥 ≤ 0.5
2
𝑓(𝑥) = {1
(3 − 𝑥), 0.5 ≤ 𝑥 ≤ 3
5
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
a) Sketch the graph of f(x). [2]
b) Find the median. [3]
c) Evaluate P(x<1.2). [3]
Jun 2013
18.A continuous random variable, X, has a probability density function defined as
0.1𝑥 + 𝑘, 4≤𝑥≤6
𝑓(𝑥) = { 0.3, 6≤𝑥≤8
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
Find
2
a) Show that = . [2]
9
b)Construct the probability density function of X. [2]
Jun 2016
21.A continuous random variable X has probability density function f(x) given by
𝑎−𝑥
2( )
𝑎2
𝑓(𝑥) = { , 0≤𝑥≤𝑎 where a is a constant
0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
a) Find E(X) in terms of a. [2]
2 2
b) Show that the expression for the median reduces to 2m -4am+a =0 where m is
the median. [3]
Nov 2017
22.The random variable X is normally distributed with mean 𝜇 and standard
deviation 𝜎. Given that 𝑃(𝑋 > 3.6) = 0.5 and 𝑃(𝑋 > 2.8) = 0.6554, find the
value of 𝜇 and the value of 𝜎. [5]
Nov 2017
23.The continuous random variable X has a probability density function given by
2𝑒 −𝑘𝑥 , 𝑥 ≥ 0
𝑓(𝑥) = { where k is an integer,
0, 𝑥 < 0
a) Show that k = 2. [2]
b) Find the i) cumulative function of X, [2]
ii) exact value of the median. [2]
Jun 2017
24.The random variable X is normally distributed with mean 𝜇 and variance 𝜎2. Given
that 𝑃(𝑋 > 65) = 0.01 and P(X<20) = 0.02, find 𝜇 and 𝜎. [7]
Jun 2017
1
(b) Find the value m such that 𝑃(𝐵1 + 𝐵2 < 𝑚) = where B1 and B2 are
4
independent observations. [7]
Nov 2014
5. The length and height of a brick are independent normal variables with means and
standard deviations as shown in the table.
8. A manufacturer of vehicles sells two types of vehicles, heavy and light vehicles.
The cost of each type of vehicle in thousands dollars are shown in the table below.
Mean Standard
cost deviation
Light 252 2
Heavy 1 012 5
(i) A vehicle of each type is selected at random. Find the probability that the heavy
vehicle costs less than 4 times the light vehicle. [5]
(ii) One heavy and four light vehicles are selected at random. Find the probability
that the cost of a heavy vehicle is less than th total cost of four light vehicles.[5]
Jun 2014
9. The mass, mg, of a randomly chosen key-holder is known to follow a normal
distribution with mean 20g and a standard deviation of 4g. The mass M grams of a
randomly chosen key-holder is also known to follow a normal distribution with
mean of 12 g and standard deviation of 9 grams.
(a) Find the probability that the combined mass of
(i) 2 randomly chosen key-holders and 3 randomly chosen keys is greater
than 78g,
(ii) 3 key-holders is greater than the combined mass of 6 keys. [8]
(b) Determine the probability that a randomly chosen key-holder is more than twice
the mass of a randomly chosen key. [5]
Jun 2015
10.The masses, in grams, of the contents and packaging of a randomly chosen packet
of powdered milk of brand M may be taken to have a normal distribution with
mean and standard deviation given in the table.
Mean Standard deviation
Contents 500 8
Packaging 20 2
packets brand N weighs more than the contents of four randomly chosen
packets of brand M. [5]
Jun 2016
11.(a) State any one advantage and any one disadvantage of using a stem and leaf
diagram as a method of representing data. [2]
(c) A group of 30 students had their heights measured correct to the nearest
centimeter. The results are shown below.
167 174 156 180 162 169 177 154 165 174
160 184 169 179 151 163 173 148 171 168
158 158 167 166 149 153 171 162 182 162
(i) Using five stems, plot a stem and leaf diagram for the above
information. [3]
(ii) Find the median. [1]
(iii) Find the inter-quartile range. [3]
Nov 2017
12.The random variables, R and S, are normally distributed . Given that 𝑅~𝑁(54,36)
and 𝑆~𝑁(48,25)
(a) Find
(i) the value of r and s such that 𝑃(𝑅 ≤ 𝑟) = 𝑃(𝑆 ≥ 𝑠) = 0.484. [7]
(ii) 𝑃(𝑅 ≥ 𝑆) [2]
(b) Six independent observations of R are taken. Find the probability that the sum
of six observations is less than 300. [5]
Jun 2017
Number of 4 5 6 7 8 9
people
Number of 2 3 7 6 4 2
visits
Calculate the mean and standard deviation of the people in the queue [3]
Nov 2004
5. Three flower vendors X, Y and Z have equal chances of selling their flowers. X has
80 red and 20 white, Y has 30 red and 40 white and Z has 10 red and 60 white
flowers. On Valentine’s Day, Kudzai wants to buy a flower.
(i) Find the probability that she picks a red flower [3]
(ii) Given that she bought a red flower find the probability that it was from Y [3]
Nov 2004
6. An unbiased six sided die is thrown three times.
Calculate the probability that
(i) The total score is an even number, [2]
(ii) The total score is an even number given that a 5 appears at the first throw.[4]
Jun 2005
7. A loaded die is such that the probability of the face turning up is proportional to the
number of X on the face. The prbability distribution of the discrete random
variable X is given in the table below.
𝑥 1 2 3 4 5 6
𝑃(𝑋 = 𝑥) 𝑘 2𝑘 3𝑘 4𝑘 5𝑘 6𝑘
2
(a) verify that 𝑃(𝑋 = 𝑥) = [2]
21
13
(b) given that 𝐸(𝑋) = , 𝑓𝑖𝑛𝑑 𝑉𝑎𝑟(𝑋) [2]
3
Jun 2006
8. A biased die produces a score, Y, for which the probability distribution is given in
the table below.
𝑦 1 2 3 4 5 6
𝑃(𝑌 = 𝑦) 𝑥 2𝑥 3𝑥 4𝑥 5𝑥 6𝑥
9. A bag contains 5 white balls and 3 red balls. Two players, A and B, take turns at
drawing one bag from the bag at random, and balls are not replaced. The player
who first gets two red balls is the winner, and the drawing stops as soon as either
player has drawn two red balls. Player A draws first. Find the probability that
player A is the winner given that the winning player wins on his second draw. [5]
Nov 2006
10. A building society gives both adjustable-rate mortgage and fixed-rate mortgages on
residential property. It breaks residential property into 3 categories: low density
houses, high density houses and blocks of flats. The following table gives
probabilities appropriate to this situation.
13. A roulette wheel contains 38 numbers of which 18 are red, 18 are black and 2 are
green. When a roulette is spun, it is equally likely to land on any of the 38
numbers. In two plays at the wheel, find the probablility that
(a) The ball lands on red both times [2]
(b) The ball lands on green the first time and on black the second time. [2]
Nov 2011
14. The meteorological department of a certain country adopts a simple model of the
weather in which each day is classified as either fine or rainy. The probability that
a fine day is followed by another fine day is 0.8. The probability that a rainy day is
followed by a fine day is 0.4. The probability that 1 February is fine is 0.75.
Using a tree diagram or otherwise, find the probability that
(a) The 3rd of February is fine, [3]
st rd
(b) The 1 February was rainy given that 3 February is fine [3]
Nov 2011
15. An unbiased tetrahedral die has the number 1 written on one face, the number 2 on
the other face and the number 3 on the remaining two faces. The die is thrown
twice and X is the product of the scores obtained from the two throws. Find
(a) The probability distribution of X [4]
(b) 𝐸(𝑋) and 𝑉𝑎𝑟(𝑋) [4]
Nov 2011
16. In a certain court, there are only two verdicts on passing judgement, namely
“convitded” or “discharged”. Of all the cases that have been tried by this court,
80% of the verdicts were convictions. Suppose that when the court’s verdict is
“convicted” or “discharged”, the respective probabilities of the accused person
being innocent are 0.07 and 0.4 respectively.
By the use of a tree diagram, find
(a) The probability that a person tried by this court is innocent, [3]
(b) The conditional probability that an innocent person tried by this court is
convicted. [3]
Jun 2012
17. A die is weighed in such a way that the probability of each face coming up is
proportional to the face value, x.
(a) Construct the probability distribution of X [2]
(b) Calculate E(X) [2]
Jun 2018
18. A school selects 55% of its lower sixth pupils from its own O level pupils and the
remainder comes from other schools. It is established that 90% of accepted A-level
students who did their O-level outside the school pass their A-level studies, and
that 70% of those who did their O-level studies at the school pass their A-level
studies. A pupil is selected at random from the recent A-level graduate of the
school. Find the probability that the pupil
(i) Passes A-level studies. [4]
(ii) Did O-level outside the school, given that the pupil passes A-level studies[2]
Nov 2008
19. A fair die is tossed three times. Find the probability that
(i) Exactly one six is obtained. [2]
(ii) The first score is even, the second is odd and the third is either a one or a
two. [3]
Jun 2008
20. A discrete random variable X takes the values 0, 1 and 2 only, with probabilities
4
P0, P1 and P2 respectively. Find the values of P0, P1 and P2 given E(X) = and
3
5
Var(X) = . [8]
9
Jun 2008
21. Three tickets for a musical show are sent to a high school musical club. Fifteen
girls and ten boys would like a ticket. If the three people to receive a ticket are
chosen at random, find the probability that they will be
(i) exactly 2 boys, [3]
(ii) at least 2 girls. [3]
Nov 2009
22. Three tickets for a musical show are sent to a high school musical club. Fifteen
girls and ten boys would like a ticket. If the three people to receive a ticket are
chosen at random, find the probability that they will be
(i) exactly 2 boys, [3]
(ii) at least 2 girls. [3]
Nov 2009
23. The diagram above shows a triangular prism with two equilateral triangular faces
and three rectangular faces. The rectangular faces are numbered 1, 2 and 3 whilst
the triangular faces are numbered 4 and 5. When the prism is tossed the probability
that it lands on each rectangular face is 2k and the probability that it lands on each
triangular face is k.
27. The distribution table shows prizes corresponding to six values on a fair spinner
used in a game. The spinner lands only on one of the six values.
Value 1 2 3 4 5 6
Prize in 2 2 6 4 10 6
$
(a) Find the probability of the spinner landing on
(i) a prime number,
(ii) a value that gives a prize of not less than $4. [2]
(b) Calculate the expected prize for a single game. [2]
Jun 2015
28. Two tetrahedral dice with faces marked 0, 1, 2, 3 are thrown and the number on
which each lands on is noted. The score is the sum of the 2 numbers. By means of
an outcome table or otherwise, find the probability that
(i) the score is a prime number, [3]
(ii) one die lands on a 3 given that the score is a prime number. [3]
Jun 2015
29. Bag A contains 3 red balls and 2 white balls. Bag B contains 2 red balls and 3
white balls. A bag is selected at random and the two balls are drawn from it, one
after the other without replacement.
(a) Find the probability that the two balls drawn are red. [2]
(b) Given that the two balls are red, find the probability that they are from bag [3]
Jun 2016
30. A and B play against each other in a game. Each result is either a win for A or a
win for B. the probability of A winning the first game is 0.6. If A wins a particular
game, the probability of winning the next game is 0.7. If A loses a particular game,
the probability of winning the next game is 0.4. Find the probability that
(i) A loses the second game, [2]
(ii) A wins the first game, given that A loses the second game. [3]
Nov 2017
31. A random variable X, has the probability distribution given below.
X 0 1 2 3
P(X=x) 0.35 0.2 p q
Given that E(X2) = 3,
(i) find the value of p and the value of q,
(ii) Calculate the Var(X), correct to 2 decimal places. [7]
Nov 2017
Regression:Bivariate Data
Zimsec past exam papers
1. The pressure P and volume V of a fixed mass of gas are related by an equation of
the form
𝑃𝑉 𝑎 = 𝑘, where k and a are constants.
From this equation obtain a linear equation, 𝑦 = 𝑚𝑥 + 𝑐, where 𝑥 = 𝐼𝑛𝑃 and
𝑦 = 𝐼𝑛𝑉. [2]
In six experiments of the fixed mass of a gas, in each of which P was controlled
and V measured. The results satisfied
𝛴𝑥 = 2.420, 𝛴𝑦 = −1.708,
𝛴𝑥2 = 3.171, 𝛴𝑦2 = 1.561,
𝛴𝑥𝑦 = −2.224.
2. The IQs of a group of 6 students who sat for a mathematical examination were
measured. Their IQs and examination marks were recorded
4. For this question answers must be given correct to 3 significant figures where
appropriate.
The yield per hectare of a crop depends on the amount of rainfall in the growing
season. The value of the yield, X, in tones per hectare and the rainfall, Y, in
centimeters per nine successive growing seasons are given in the table below
X 8 10 15 6 11 12 13 11 9
Y 14 10 18 13 14 13 16 11 12
(c) Calculate the regression line of P on N in the form P = a + bN. Draw this line on
your graph and use it to estimate P when N = 35. [7]
(d) (i) Calculate the product moment correlation coefficient for the given data. [3]
ii) Interpret the result of this calculation in terms of your scatter diagram. [2]
Nov 2008
7. Participants to a ZIMSEC workshop on syllabus interpretation were asked to report
the distance d, they drove in kilometers and the time t, taken in minutes. The table
below gives a random sample of the values reported.
d(km) 263 211 290 580 473 377
t(min) 180 210 240 420 390 330
∑(𝑑 − 300) = 394, ∑(𝑑 − 300)2 = 123 648, ∑(𝑡 − 200) = 570,
a) Plot a scatter diagram showing study time T, against the mark, M. [3]
b) Calculate the equation of regression line M = a + bT where a and b are constants
to be determined. [5]
c) Draw the regression line on the graph and use it to estimate the study in hours
and minutes for a student who scored 41 marks. [4]
d) Find the product moment correlation coefficient and comment on the
relationship between study times and test marks. [4]
Nov 2011
11.The values of y, length of a spring in cm, were measured for preselected values of
x, the load in Newtons, and are shown in the table:
x newtons 1 2 3 4 5 6 7 8 9 10
y cm 10.7 11.3 12.0 12.4 13.0 13.7 14.5 15.1 15.6 16.0
70 8
80 9
140 17
95 10
(a) Find the equation of the regression line of the amount of fuel used (Y) on the speed
(X) [4]
(b) Use your equation, in a) , to estimate where possible, the amount of fuel likely to
be used when travelling at
(i) 105 km/hr
(ii) 50 km/hr [4]
(c) Find the product moment correlation coefficient and comment on the relationship
between the speed and the amount of fuel used. [4]
Jun 2015
15.A taxi operator keeps records of the performance of is cars. The total distance
travelled by a car since it was purchased as new is denoted by m and the distance it
can travel with 1 litre of petrol is denoted by d. For 7 cars of the same make and
model, the values of d against m for each car are shown in the table below.
(c) Use the equation of the regression line to estimate the height , in cm, of a boy
whose weight is 40kg. [2]
(d) (i) Find the product moment correlation coefficient. [2]
(ii) Comment the relationship between the weights and the heights of the boys. [2]
Nov 2017
17.Marks X, and Y obtained by each of ten candidates in Mathematics are given in the
table below. X is the mark for paper 1 and Y is the mark for paper 2.
X 86 93 73 66 88 96 80 70 95 63
Y 71 76 61 52 75 94 71 60 85 55
(b) The company replaces all its bolt producing machines causing the proportion of
defective bolts to drop to 0.5%. It now accepts batches only if there are no
defective bolts in a sample of 8. Calculate the change in proportion after the
replacement of the machines. [5]
Nov 2003
3. In 1995, a newspaper reported that for families residing in its circulating area, the
distribution of the daily expenditure for food consumed away from home had an
average of $814.11 and a standard deviation of $20.58. in order to check this claim
an Economist randomly sampled 100 families residing in the area.
Assuming that the newspaper claim was true
(i) Describe the sampling distribution of the mean daily expenditure. [2]
(ii) Calculate the probability that the sample mean daily expenditure for food
purchased away from home was at most $820.00. [3]
Jun 2004
4. (a) Much emphasis has recently been placed on preventive behaviour because of
the AIDS pandemic. In one study at an AIDS awareness campaign conference, 100
questionnaires were issued out randomly. Assuming that the population mean and
standard deviation of the questionnaire scores are 38 and 5 respectively.
(i) state the sampling distribution of the sample mean questionnaire score, [1]
(ii) calculate the probability that the sample mean score exceeds 39.1. [4]
Given that the questionnaire mean score was 39.1, state and explain the nature of
the sample. [2]
(b) A population of locusts has mean mass μg and standard deviation 6g. A random
sample of size 100 is taken. State the distribution of the sample mean mass. [2]
Given that the actual masses in the sample are summarized by
𝛴(𝑥 – 50) = 270 and 𝛴(𝑥 – 50)2 = 2540, where x g is the mass of a locust,
find
(i) unbiased estimates of μ and δ, [3]
(ii) a 95% confidence interval for the population mean mass. [3]
Twenty different random samples are taken and a 95% confidence interval for 𝜇 is
calculated for each sample.
State the expectation of the number of these confidence intervals that will contain
μ. [1]
Nov 2004
5. (a) A large number of samples of size n are taken from 𝑁(100,225),. Given that
95% of the sample means are less than 105, estimate the value of n. [5]
(b) The random variables X and Y are independent and normally distributed, X
being 𝑁( 4,9) and Y being 𝑁(5,16). Given that a sample of 20 observations is
taken from the distribution of X and a sample of 25 from the distribution of Y, find
𝑃(𝑌̅ > 𝑋̅). [5]
Nov 2008
6. The diameters of 25 steel rods are found to have a mean of 0,980 cm and a standard
deviation of 0,015 cm. Assuming that the diameters of the steel rods are normally
distributed with the same variance, find 99% confidence limits for the population
mean. [4]
Jun 2008
7. A random sample of size 40 is selected from a particular population of fish in a
fishery pond. The random variable X denotes the length of fish in centimetres.
Given that the actual length in the sample are summarised by ∑(𝑥 − 20) =
19 ∑(𝑥 − 20)2 = 69 , find the unbiased estimates of
(a) (i) the population mean. [1]
(ii) the population variance. [2]
(b) The 95% confidence interval for the mean life of light bulbs constructed from a
sample of size 36 is (1023,3hrs; 1161,7hrs) Assuming that the life of light bulbs
is normally distributed find the 99% confidence interval for the mean life of this
brand of bulbs. [6]
Nov 2009
8. The following data have been collected for a sample from a population that is
normally distributed.
5, 10, 8, 11, 12, 6, 15, 13
(a) Calculate the unbiased estimate of the population mean, and the standard
deviation. [3]
(b) Find a 95% confidence interval for the population mean. [5]
Nov 2010
9. The ice-cream vendor records his daily takings ($x) over a period of 30 days. The
results were summarised by ∑ 𝑥 = 900 and ∑ 𝑥 2 = 34 000.
(a) Find the unbiased estimates of
(i) the population mean,
(ii) the population variance. [3]
(b) Calculate at 95 % confidence interval the mean amount he receives assuming
that his daily takings are normally distributed. [3]
Nov 2013
10.A random variable X is normally distributed with mean 15 and standard deviation
6. If a random sample of 40 is chosen and found to have a mean 𝑋̅, find
(i) P(𝑋̅ > 16) [4]
(ii) the sample size n such that P (𝑋̅ > 15.5)=0.05. [5]
Jun 2014
11.The masses of letters posted by a certain school are normally distributed with mean
15 g. It is found that the masses of 95% of the letters are within 10 g of the mean.
Find,
Nov 2017
17. The random variables, R and S, are normally distributed . Given that R~N(54,36)
and S~N(48,25)
(a) Find
(i) the value of r and s such that 𝑃(𝑅 ≤ 𝑟) = 𝑃(𝑆 ≥ 𝑠) = 0.48 [7]
(ii) 𝑃(𝑅 ≥ 𝑆) [2]
(b) Six independent observations of R are taken. Find the probability that the sum
of six observations is less than 300. [5]
Jun 2017
Significance testing
Zimsec past exam papers
1. (a) The distribution of a population is known to have mean 9.27 and standard
deviation 1.40. A sample of 36 was taken from this population and it gave a mean
of 8.39. Test whether there is evidence at the 1 % level that the mean has
decreased. [6]
(b)An animal breeder claims that the length of a certain species of animals is
distributed normally with mean 44cm. In order to test the truth of this claim, a
sample of 21 such animals was taken and it was found that 𝑥̅ = 42 cm and 𝑠 =
6 𝑐𝑚. Is there evidence at 5% level to refuse the breeder’s claim? [6]
Nov 2008
2. The diameters of 25 steel rods are found to have a mean of 0,980 cm and a standard
deviation of 0,015 cm. Assuming that the diameters of the steel rods are normally
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
49
Vachingwere. With God everything is possible
distributed with the same variance, find 99% confidence limits for the population
mean. [4]
Jun 2008
3. A manufacture of an item used for the production of metal rods claims that new
machine that he has acquired has resulted in an improved product. The old machine
is known to have given 20% defectives per output. Test at 5% significance level the
validity of the claim if out of a sample of 20 items 2 were found to be defective.
Use the binomial test. [7]
Nov 2009
4. Prior to the institution of a new safety program, that average number of on-the-job
accidents per day at a factory was 4.5. To determine if the safety program has been
effective in reducing the average number of accidents per day, a random sample of
30 days is taken after the new safety program. The number of accidents per day is
recorded. The sample mean and standard deviation were computed as follows.
𝑥 = 3.7 and 𝑠 = 1.85
Is there sufficient evidence to conclude at 1% significance level that the average
number of on-the-job- accidents per day at factory has decreased since instituting
the safety program? [7]
Jun 2004
5. The Zimbabwe consumer report (1999) states that the mean retail cost of Nokia
5110 cellular phone was $600.00. A random sample of 10 stores in Harare, gave
the following prices for this model,
593 621 545 561 609 555 588 575 619 599
(a) Calculate the mean and standard deviation of the above data. [3]
(b) Assuming that the retail costs of these cellular phones are normally distributed,
test at 10% level of significance whether this information indicates that the
population mean of the cost of the cellular phones is less than $600, 00. [6]
Nov 2007
3. (a) In an election held in 2007, 60% of the voters voted for Party A. In a poll of
opinion conducted last week, 250 potential voters were asked how they would vote
if there was an election now. 135 of the voters said they would vote for Party A.
Investigate at 5% level of significance whether the proportion of the voters in
favour of A has decreased significantly. [6]
(b) Ambulance Services claims that it takes an average of 8,9 minutes to respond to
emergency calls. To verify this claim, the Agency which licences ambulance
services timed 50 responses to emergency calls. The observed data gave a mean of
9,3 minutes and standard deviation of 1,8 minutes. Test at 5% significance level
whether there is evidence to justify Ambulance service’s claim. [8]
Nov 2010
4. A milling company found that the bag of flour it packs weighs 10kg each on
average. A random sample of 50 bags is examined and the mass, x kg, of the
contents of each bag is recorded. It is found that ∑(𝑥 − 10) = −12.3 and
∑(𝑥 − 10)2 = 37.7
(a) Estimate the population mean and variance of the mass of the contents of a bag.
[4]
(b) Test at 10 % level of significance, whether the milling company is overstating
the average mass of the contents of each bag. [6]
Nov 2013
5. A sample of 10 items are taken from a production line to check if the machine is
functioning properly. The components produced by the machine are set to have a
mean diameter of 2 cm and a standard deviation of 0.03 cm. The ten items had their
diameters measured and the results were:
2.17 1.93 2.02 1.97 2.00 1.02 2.02 1.89 1.99 2.01
Test at 5 % level of significance whether the components produced by the machine
are of the required standard. [9]
Jun 2013
6. (a) Distinguish between a 1-tailed and a 2-tailed test. [2]
(b) A political party claims that it commands 60 % of the voters. To test this, a
random sample of 300 potential voters was asked which party they would vote for.
160 confirmed that they would vote for that party. Establish whether this sample
supports the claim by the party. Test at 10 % level of significance. [7]
Jun 2014
7. The following are television prices in dollars taken in 40 different shops
40 130 170 240 360 520 170 130
240 360 520 120 220 170 330 480
160 290 200 120 480 160 210 330
70 140 180 260 370 90 150 200
280 450 80 420 190 140 270 120
(i) Construct a stem and leaf diagram for the data. [3]
(ii) Find 1. the median,
2. the quartiles. [2]
(iii) Draw a box and whisker plot. [2]
Jun 2014
8. (a) Distinguish between 1 tailed and 2 tailed test. [2]
(b) It is claimed that rural secondary school pupils travel a distance of more than
12km to school. To test this claim, a random sample of 100 pupils were asked to
keep a record of the distances they travel to school. The random sample showed an
average distance of 14,5 km with a standard deviation of 4.8km. Test at 0.05 level
of significance whether the claim is true. [6]
Making statistics more enjoyable. inomudzingwa@gmail.com 0773748536
51
Vachingwere. With God everything is possible
Jun 2015
9. A manufacturer of orange juice claims that the volumes of packets which the firm
produces are normally distributed with mean 1 000ml and variance 16. A consumer
right inspector tests a sample of 20 packets and finds that the average volume is
997.5 ml. Test at 1% significance level to establish whether or not the manufacturer
is overstating the volume of the contents. [5]
Nov 2017