MilesK MAS183 Assessment2
MilesK MAS183 Assessment2
1.
(a)
(b)
P(had malaria/had sickle cell total)
P(0.06/0.23) = 0.26
(c)
P(no malaria/sickle cell total)
P(0.17/0.23) = 0.74
(d)
Statistical independence happens when one of the outcomes does not alter the chance of the
other. Information from the question states that ‘sickle cell appears to protect against
malaria’. To test for statistical independence, we use:
- P(A and B) = P(A) x P(B | A) and P(A | B) = P(A)
OR
- P(B) x P (A | B) and P(B | A) = P(B)
The probability rule of independence P(A and B) = P(A) x P(B) can be used
Malaria Present and Sickle Cell Positive = 0.06
0.0805 is not equal to 0.06 and therefore having malaria is not statistically independent of
having sickle cell
2.
For an observation to be a binomial distribution, five conditions need to be considered
- A definitive number of trials, n
- Only two possible outcomes: ‘success’ and ‘failure’
- The probability of success, p, is the same for each trial
- Trials succeed or fail independently of each other
- The variable under consideration, X, is the number of successes out of the n trials
3.
(a) what is the probability distribution of X?
X follows a binomial distribution as the n trials are independent of each other and have the
same probability of success.
Therefore, X ~ Bin(n,p)
= X ~ Bin(20, 0.70)
(b) calculate
i. The mean and standard deviation
Mean
µ = np
µ = 20 x 0.70
µ = 14
Standard deviation
σ = √ np (1 -p)
σ = √ (20 x 0.70) (1 -0.70)
σ = √ (14) (0.30)
σ = √ 4.2
σ = 2.0494
σ = √ np (1 -p)
σ = √ (20 x 0.70) (1 -0.70)
σ = √ (14) (0.30)
σ = √ 4.2
σ = 2.0494
(12,20,0.70)
[1] 0.2277282
P (X < 12) = 1 – P (X ≥ 12)
= 1 – 0.2277282
= 0.7723
P (X < 12) = 0.7723
It would not be unusual as the probability is relatively high, meaning it is likely to happen. It
is also within 2 standard deviations of the mean making it a likely event
4.
(a) What proportion of healthy adults have blood arsenic concentrations between 2 and
4 μg/dL?
Z=x-µ/σ
X ~ N (µ = 3.72, σ = 1.18)
Z = 2 – 3.72 / 1.18 = - 1.458
Z = 4 – 3.72 / 1.18 = 0.237
P (Z ≤ 1.4576) = 0.0721
P (Z ≤ 0.2372) = 0.5948
0.5948 - 0.0721 = 0.5227
0.5227 of healthy adults have blood arsenic concentrations between 2 and 4 μg/dL
(b) Choosing a healthy adult at random, what is the chance that their blood arsenic
concentration exceeds 6.2 μg/dL?
The chances of randomly selecting a healthy adult whose blood arsenic concentration exceeds
6.2 μg/dL is 0.0179
(c) What are the lower and upper limits of the middle 70% of blood arsenic
concentrations in healthy human adults?
Lower limit:
> qnorm(0.15, 3.72, 1.18)
[1] 2.497009
Upper limit:
> qnorm(0.15, 3.72, 1.18, FALSE)
[1] 4.942991
5.
(a)
Simple Random Samples (SRS) will be used where every case has the same chance of being
included in the sample and every sample has the chance of being selected.
It would be ideal to use a three-dimensional grid because it allows for measurements across
the surface as well as at 5 meters below. In addition, it ensures variance so that each square
has equal chances of being measured accurately. This also opens up the idea that sources of
nitrogen are not just from the farms on the lakeshore.
To ensure this validity of the method and the effectiveness of SRS, the lake should be divided
into 5x3 squares and numbered which assures the samples are selected at random. Then, the
numbers that corresponded to the grid on the lake would be picked using a random number
generator.
(b) As indicated above, two sampling methods are being considered. Which of these
should be preferred if the government agency is primarily concerned with –
(i) regulating fertiliser use on farms, or
Samples should be collected from the water sources mouths that feed into the lake
(ii) environmental management, including water quality in the lake.
Samples should be collected randomly from the lakes surface if the government
agency is concerned with environmental management
6. Calculate –
(a) the mean
µ = Σ P (X – x)
= 1.78
σ = √ σ²
σ² = ∑ x² P (X = x) - µ²
σ = √ σ²
σ = √ 0.4316²
= 0.6569627082
= 0.6570