ST107 Exam Paper 2012
ST107 Exam Paper 2012
ST107
Quantitative Methods (Statistics)
Instructions to candidates
Answer all three questions in Section A. All questions in Section B will be given equal weight
(20% each). Section B contains five questions, of which you may attempt as many questions
as you wish, but only your best three questions will count towards your final mark.
Full working must be shown to gain all marks for each question.
(a) Maria has an important job interview in the morning. To ensure she wakes up in time,
she sets two alarm clocks which ring with probabilities 0.97 and 0.99 respectively.
What is the probability that at least one of the alarm clocks will wake her up?
(b) Suppose that the distribution of a set of values has a population mean of 50 and a
population variance of 6. If 4 is subtracted from each score and then each score is
divided by 2, what will be the new mean and variance of the set of values?
(c) If X ∼ Bin(n, π) state, in terms of n and π, what the approximating Poisson and
normal distributions would be.
(e) When and why can a standard normal distribution be used to approximate a
Student’s t distribution?
(f) In a hypothesis test for a single proportion using a normal distribution, we have
H0 : π = 0.7 and H1 : π < 0.7. The test statistic is computed to be z = −1.676.
For α = 0.10, α = 0.05 and α = 0.01, state whether or not the null hypothesis is
rejected.
yb = 51.7 + 3.47x
where x is the number of hours studied and y is examination score. Based on this
predicted relationship, approximately how many hours should a student study to get
at least 85 on the examination?
Find:
3. A well-known fast food retailer forecasts its percentage sales growth for its next financial
year for each of 32 sales regions. The forecasts are shown in the following table:
(a) Construct and label carefully a stem-and-leaf diagram for this dataset. Use your
diagram to find the median and lower and upper quartiles.
(8 marks)
(b) From the shape of the stem-and-leaf diagram, state whether the distribution of the
sample is skewed and, if so, give the direction of the skewness.
(2 marks)
(c) Without making any further calculations, state whether you would expect the sample
mean to be the same as, smaller than, or larger than the sample median and give a
brief reason for this.
(2 marks)
You may attempt as many questions as you wish from this section, but only your best three
questions will count towards your final mark. Each question carries 20 marks.
4. (a) 20% of men show early signs of losing their hair. 2% of men carry a gene that is
related to hair loss. 80% of men who carry the gene experience early hair loss.
i. What is the probability that a man carries the gene and experiences early hair
loss?
ii. What is the probability that a man carries the gene, given that he experiences
early hair loss?
(5 marks)
(b) Given two events A and B, state whether each of the following is true or false.
Use formulae or words to justify your answer. (No marks will be awarded without a
justification.)
i. If A and B are independent such that P (A) > 0 and P (B) > 0, then:
(b) The local fair has a game for children. Children pay $0.25 to draw a marble at random
from a covered jar which contains 100 marbles, of which 4 are silver, 1 is gold, and
the remaining marbles are white. If a silver marble is drawn, the child wins $1. If a
gold marble is drawn, the child wins $10. If a white marble is drawn, the child wins
nothing. What is the expected amount of winnings for this game, per play?
(4 marks)
(c) If X ∼ N (3, 2), Y ∼ N (2, 3) and X and Y are independent, calculate P (X < Y ).
(4 marks)
(d) In the general population 38% have type O+ blood. Use a suitable normal
approximation to find the probability of finding more than 100 people with type O+
blood in a random sample of 300 people.
(6 marks)
(b) A random variable X can take the values 0, 1 and 2. We know that:
3α α α
P (X = 0) = 1 − , P (X = 1) = and P (X = 2) = .
4 2 4
One observation is taken and we want to estimate α, such that 0 < α < 4/3.
Consider the estimators T1 = X and T2 = 2X(X − 1).
i. Show that they are both unbiased.
ii. Would you prefer estimator T1 or T2 ? Justify your choice.
(10 marks)
(c) Briefly explain the difference between an estimator and a point estimate.
(3 marks)
2 ) and N (µ , σ 2 ),
(d) Independent samples of size m and n are drawn from N (µX , σX Y Y
respectively. Give the sampling distribution of X̄ − Ȳ .
(3 marks)
Store A Store B
x̄ 34.3 days 38.6 days
s 2.5 days 3.4 days
n 41 31
(b) A UK government agency carries out a large-scale random survey of public attitudes
towards the recession. 83 of the 500 workers surveyed indicated they were worried
about losing their job. Newspaper reports claim 20% of workers fear losing their
job. Is such a high percentage claim justified? State and carry out an appropriate
hypothesis test at two levels and explain your results.
(7 marks)
(b) It is assumed that there is a linear relationship between the yield obtained from apple
trees and the amount of fertiliser supplied to them. In order to test this assumption,
nine apple trees of the same type were randomly selected and supplied weekly with
a fixed quantity (x grams) of fertiliser. The yield of each apple tree (y kilograms) was
recorded.
Tree 1 2 3 4 5 6 7 8 9
x 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
y 3.9 4.3 5.5 6.4 6.9 7.1 7.3 7.7 8.0
1. Sample statistics
• Sample variance:
n
2 1 X Sxx
s = (xi − x̄)2 = .
n−1 i=1
n−1
2. Probability theory
P (A ∩ B)
P (A | B) =
P (B)
or:
P (A | B) P (B) = P (A ∩ B) = P (B | A) P (A).
P (A) = P (A | B) P (B) + P (A | B c ) P (B c )
or:
x−1
Negative binomial π r (1 − π)x−r for x = r, r + 1, r + 2, . . . r/π r(1 − π)/π 2
r−1
(x − µ)2
1
Normal √ exp − for −∞ < x < ∞ µ σ2
2πσ 2 2σ 2
n
P √
µ (σ known) x̄ = xi /n σ/ n
i=1
n
P √
µ (σ unknown) x̄ = xi /n s/ n
i=1
p
π p = r/n C.I.: p(1 − p)/n
p
H.T.: π0 (1 − π0 )/n
(n1 −1)s2 2
1 +(n2 −1)s2
σ12 = σ22 s2p = n1 +n2 −2
—
p
π1 − π2 p1 − p2 = r1 /n1 − r2 /n2 C.I.: p1 (1 − p1 )/n1 + p2 (1 − p2 )/n2
p
H.T.: p(1 − p)(1/n1 + 1/n2 )
1
Residual variance s2 = 2
(Syy − Sxy /Sxx ) σ2
n−2
1
or s2 = (Syy − βb12 Sxx )
n−2
√ √
Under H0 : ρ = 0, the test statistic r n − 2/ 1 − r 2 ∼ tn−2 .
[END OF PAPER]