Edexcel IAL Statistics 3
Edexcel IAL Statistics 3
le n> 73719, soyouneed n= 74 25. v25 © Avwidth of 15 + 1532 re15 From the table on page 135 you find that PZ < 15) = 09332 and so PZ > 15) = PZ <-15) 1- 09332 o.0ces, fe) ooces o.0ces aS 15 i 1 A random sample of size 9 is taken from a normal distribution with variance 36. The sample mean is 128 a Find a 95% confidence interval for the mean 1 of the distribution. b Find a 99% confidence interval for the mean 1 of the distribution. 2 A random sample of size 25 is taken from a normal distribution with standard deviation 4. The sample mean is 85. a Find a 90% confidence interval for the mean j of the distribution. b Find a 95% confidence interval for the mean 1 of the distribution. 3 A.95% confidence interval is given by (25.61, 27.19) Calculate a 99% confidence intervalEM arse} STUN aaa Ly 4A normal distribution has standard deviation 15. Estimate the sample size required if the following confidence intervals for the mean should have width of less than 2. a 90% b 95% © 99% ‘A railway company is studying the number of seconds that express trains are late to arrive. Previous surveys have shown that the times are normally distributed and that the standard deviation is 50. A random sample of 200 trains was selected and gave rise to a mean of 310 seconds late. a Find a 90% confidence interval for 1, the mean number of seconds that express trains are late (marks) Five different independent random samples of 200 trains are selected, and each sample is used to generate a different 90% confidence interval for b Find the probability that exactly three of these confidence ED se siabie cama intervals contain j @marks) — (BEEmiatvaetesd TaN 6 Amy is investigating the total distance travelled by vans in current use. The standard deviation can be assumed to be 15000 km. In a random sample, $0 vans were stopped and their mean distance travelled was found to be 75 872 km. Amy suspects that the population is normally distributed, but claims that she can still use the normal distribution to find a confidence interval for 1. Find a 90% confidence interval for the mean distance travelled by vans in current use. (marks) 7 Itis known that each year the standard deviation of the marks in a certain examination is 13.5 but the mean mark 1 will fluctuate, An examiner wants to estimate the mean mark of all the candidates on the examination but he only has the marks of a sample of 250 candidates, which gives a sample mean of 68.4 a What assumption about the candidates must the examiner make in order to use this sample mean to calculate a confidence interval for j? (1 mark) b Assuming that the above assumption is justified, calculate a 95% confidence interval for 4 (3 marks) Later, the examiner discovers that the actual value of « was 65.3, © What conclusions might the examiner draw about his sample? (2 marks) 8 A student calculated 95% and 99% confidence intervals for the mean of a certain population but failed to label them. The two intervals were (22.7, 27.3) and (23.2, 26.8). a State, with a reason, which interval is the 95% one. (1 mark) Estimate the standard error of the mean in this case, (marks) € What was the student’s unbiased estimate of the mean jin this case? (2 marks) © _% The director of a company has asked for a survey to estimate the mean expenditure of customers on electrical appliances. In a random sample, 100 people were questioned and the research team presented the director with a 95% confidence interval of ($128.14, $141.86). The director says that this interval is too wide and wants a confidence interval of total width S10. a Using the same value of X, find the confidence limits in this case, (3 marks) b Find the level of confidence for the interval in part a. (2 marks)ESTO gs ae COGaitoky ®w @®u ‘The managing director is still not happy and now wishes to know how large a sample would be required to obtain a 95% confidence interval of total width no greater than $10. ¢ Find the smallest size of sample that will satisfy this request. (3 marks) ‘A factory produces steel sheets whose masses are known to be normally distributed with a standard deviation of 2.4kg. A random sample of 36 sheets had a mean mass of 31.4kg. Find 99% confidence limits for the population mean, (Gmarks) A machine is set up to pour liquid into cartons in such a way that the amount of liquid poured on each occasion is normally distributed with a standard deviation of 20 ml Find 99% confidence limits for the mean amount of liquid poured if a random sample of 40 cartons had an average content of 266m (3 marks) a The error made when a certain instrument is used to measure the body length of a butterfly of a particular species is known to be normally distributed with mean 0 and standard deviation | mm, Calculate, to 3 decimal places, the probability that the size of the error made when the instrument is used once is less than 0.4mm, (marks) b Given that the body length of a butterfly is measured 9 times with the instrument, calculate, to 3 decimal places, the probability that the mean of the 9 readings will be within 0.5mm of the true length, (GB marks) ¢ Given that the mean of the 9 readings was 22.53mm, determine a 98% confidence interval for the true body length of the butterfly (3 marks) ®i1 The masses of bags of lentils, X’kg, have a normal distribution with unknown mean jkg and a known standard deviation akg. A random sample of 80 bags of lentils gave a 90% confidence interval for j of (0.4533, 0.5227). a Without carrying out any further calculations, use this confidence interval to test whether = 0.48, State your hypotheses clearly and write down the significance level you have used. (3 marks) A second random sample of 120 of these bags of lentils had a mean mass of 0.482 kg. b Calculate a 95% confidence interval for j« based on this second sample. (G marks) The lengths of the tails of mice in a pet shop are assumed to have unknown mean j1 and unknown standard deviation o. A random sample of 20 mice is taken and the length of their tails recorded. The sample is represented by Xi, X35... Xoo a State whether or not the following are statistics. Give reasons for your answers. | 2X, + Xy ~s3 (4 marks) b Find the mean and variance of (G marks)EC aie} STUN en aa) © _3 The breaking stresses of elastic bands are normally distributed. A company uses bands with a mean breaking stress of 46.50 N. ‘A new supplier claims that they can supply bands that are stronger, and provides a sample of 100 bands for the company to test. The company checked the breaking stress X for each of these 100 bands and the results are summarised as follows: n=100) | Dx=47l5 Dx? =222910 a Find an approximate 95% confidence interval for the mean breaking stress of these new rubber bands. (3 marks) b Do you agree with the new supplier, that they can supply bands that are stronger? (2 marks) 4 Oneach of 100 days, a scientist took a sample of 1 litre of water from a particular place along, a river, and measured the amount, Ymg, of chlorine in the sample, The results she obtained are shown in the table. ¥ 1 3[4[slel[7[s]9 Number of days | 4 | 8 | 20 [22 | 16) 13| 10 | 6 a Estimate the mean amount of chlorine present per litre of water, and estimate, to 3 decimal places, the standard error of this estimate, (3 marks) b Obtain approximate 98% confidence limits for the mean amount of chlorine present per litre of water. (3 marks) Given that measurements at the same point under the same conditions are taken for a further 100 days, € estimate, to 3 decimal places, the probability that the mean of these ‘measurements will be greater than 4.6 mg per litre of water. (3 marks) © § Theamount, to the nearest mg, of a certain chemical in particles inthe air at a weather station was measured each day for 300 days, The results are shown in the table Amount of chemical (mg) | 12 | 13 | 14 | 15 | 16. Number of days s | 42 [aio [a1 [12 Estimate the mean amount of this chemical in the air, and find, to 2 decimal places, the standard error of this estimate. (3 marks) @®) 6 Occasionally, a firm manufacturing furniture needs to check the mean distance between pairs of holes drilled by a machine in pieces of wood to ensure that no change has occurred. It is known from experience that the standard deviation of the distance is 0.43 mm. The firm intends to take a random sample of size n, and to calculate a 99% confidence interval for the mean of the population. The width of this interval must be no more than 0.60 mm. Calculate the minimum value of 2. (4 marks) © 7 The times taken by five-year-old children to complete a certain task are normally distributed with a standard deviation of 8.0 s. In a random sample, 25 five-year-old children from school 4 were given this task and their mean time was 44.25, a Find 95% confidence limits for the mean time taken by five-year-old cl school 4 to complete this task, (3 marks) ren fromESTO gs ae COGaitoky ADIPTIVE EARNING ‘The mean time for a random sample of 20 five-year-old children from school B was 40.95. ‘The headteacher of schoo! B concluded that the overall mean for school B must be less than that of school 4. Given that the two samples were independent, b test the headteacher’s conclusion using a 5% significance level. State your hypothes clearly. (6 marks) ‘The random variable X is normally distributed with mean p and variance 0°, a Write down the distribution of the sample mean ¥ of a random sample of size n. (1 mark) b State, with a reason, whether this distribution is exact or is an estimate. (1 mark) An efficiency expert wishes to determine the mean time taken to drill a fixed number of holes in a metal sheet. © Determine how large a random sample is needed so that the expert can be 95% certain that the sample mean time will differ from the true mean time by less than 15 seconds. ‘Assume that itis known from previous studies that o = 40 seconds, (4marks) ‘A man regularly uses a train service which should arrive in Zurich at 09:31. He decided to test this stated arrival time. Each weekday for a period of 4 weeks, he recorded the number of minutes that the train was late on arrival in Zurich. If the train arrived early then the value of X was negative. His results are summarised as follows: n=20) Yv=180 Dx? =103.21 a Calculate unbiased estimates of the mean and variance of the number of minutes, late of this train service. (Smarks) The random variable ¥ represents the number of minutes that the train is late on arriving in Zurich. Records kept by the railway company show that over fairly short periods, the standard deviation of is 2.5 minutes. The man made two assumptions about the distribution of Vand the values obtained in the sample and went on to calculate a 95% confidence interval for the mean arrival time of this train service, b State the two assumptions, (2 marks) ¢ Find the confidence interval. (3 marks) 4 Given that the assumptions are reasonable, comment on the stated arrival time of the service. (1 mark) The random variable ¥ is normally distributed with mean and variance a? a Write down the distribution of the sample mean ¥ of a random sample of size n. (1 mark) b Explain what you understand by a 95% confidence interval. (marks) A garage sells both leaded and unleaded fuel. The distribution of the values of sales for each type is normal, During 2010, the standard deviation of individual sales of each type of fuel was £3.25, The mean of the individual sales of leaded fuel during this time was £8.72. A random sample of 100 individual sales of unleaded fuel gave a mean of £9.71 Caleulate: € an interval within which 90% of the sales of leaded fuel will lie (3 marks) da 95% confidence interval for the mean sales of unleaded fuel. (GB marks) ‘The mean of the sales of unleaded fuel for 2009 was £9.10.Ce arse} STUN aaa Ly € Using a 5% significance level, investigate whether there is sufficient evidence to conclude that the mean of all the 2010 unleaded sales was greater than the mean of the 2009 sales. (Gmarks) {Find the size of the sample that should be taken so that the garage owner can be 95% certain that the sample mean of sales of unleaded fuel during 2010 will differ from the true mean by less than £0.50. (Amarks) G@®) 11 a Explain what is meant by a 98% confidence interval for a population mean, (marks) The lengths, in cm, of the leaves of oak trees are known to be normally distributed with variance 1.33. 0m?, ‘A sample of 40 oak tree leaves is found to have a mean of 10.20em. b Estimate, giving your answer to 3 decimal places, the standard error of the mean. (2 marks) © Use this value to estimate 95% confidence limits for the mean length of the population of oak tree leaves, giving your answer to 2 decimal places, (B marks) 4 Find the minimum size of the sample of leaves which must be taken if the width of the 98% confidence interval for the population mean is at most 1.50cm. (A marks) GP) 12 a Write down the mean and the variance of the distribution of the means of all possible samples of size n taken from an infinite population having mean js and variance o?, (2 marks) Describe the form of this distribution of sample means when: i nislarge the distribution of the population is normal, (2marks) The standard deviation of all the til receipts of a supermarket during 2014 was £4.25. € Given that the mean of a random sample of 100 of the till receipts is £18.50, obtain an approximate 95% confidence interval for the mean of all the till receipts during 2014. (3 marks) Find the size of sample that should be taken so that the management can be 95% confident that the sample mean will not differ from the true mean by more than £0.50. (GB marks) € The mean of all the till receipts of the supermarket during 2013 was £19.40. Using a 5% significance level, investigate whether the sample in part a provides sufficient evidence to conclude that the mean of all the 2014 till receipts is different from that in 2013. (6 marks) Records of the diameters of spherical metal balls produced on a certain machine show that, the diameters are normally distributed with mean 0.824em and standard deviation 0.046cm. ‘Two hundred samples are randomly chosen, each consisting of 100 metal balls. a Calculate the expected number of the 200 samples having a mean diameter less than 0.823em, (2 marks) Ona certain day, it was believed that the machine was faulty. It may be assumed that if the ‘machine is faulty, it will change the mean of the diameters without changing their standard deviation, On that day, a random sample of 100 metal balls had mean diameter 0.834 em, b Determine a 98% confidence interval for the mean diameter of the metal balls being produced that day. (3 marks) € Hence state whether or not you would conclude that the machine is faulty on that day given that the significance level is 2° (3 marks)CST Une gy Pa Ue CHAPTER @®) 14 A doctor claims that there is a higher mean heart rate in people who always drive to work compared to people who regularly walk to work. She measures the heart rates Y of 30 people who always drive to work and 36 people who regularly walk to work. Her results are summarised in the table below. Drivetowork | 30 | 52_| 60.2 Walktowork | 36 | 47 | S58 a Test, at the 5% level of significance, the doctor's claim. State your hypotheses clearly. (6 marks) b State any assumptions you have made in testing the doctor’s claim, (marks) The doctor decides to add another person who drives to work to her data. ‘She measures the person's heart rate and finds X = 55. ¢ Find an unbiased estimate of the variance for the sample of 31 people who drive to work. Give your answer to 3 significant figures (4marks) Cars ‘ADAPTIVE CERRWINS Bead a 1. If Xisa random variable, then a random sample of size m will consist of n observations of the random variable X, which are referred to as Xy, Xs, Ny, ..., X, where the X,: + are independent random variables + each have the same distribution as X. statistic T'is defined as a random variable consisting of any function of the X, that involves ‘no other quantities, such as unknown population parameters. 2 The sampling distribution of a statistic T'is the probability distribution of 7 3. Astatistic that is used to estimate a population parameter is called an estimator and the particular value of the estimator generated from the sample taken is called an estimate. aCeara STUN aaa Ly ‘4 Ifa statistic Tis used as an estimator for a population parameter # and E(7) = 4, then Tis an unbiased estimator for 0, 5 Ifa statistic Tis used as an estimator for a population parameter #, then the bias is E(7’) — 8. For an unbiased estimator, the bias is 0 6 An unbiased estimator for 0” is given by the sample variance S? where: wayai-¥e 7 The standard deviation of an estimator is called the standard error of the estimator. 8 When using the sample mean ¥, you can use the following result for the standard error: Standard error of ¥=-2 or van 9 Aconfidence interval for a population parameter @is a range of values defined so that there is a specific probability that the true value of the parameter lies within that range. 196 x7, ¥ +196 x 40 4.95% confidence interval forthe population mean jis wn a 11 The width of a confidence interval is the difference between the upper confidence limit and the lower confidence limit. This is 2 x z x 2, where z is the relevant percentage point from ai the standard normal distribution, for example 1.96, 1.6449, etc.Palast as Review exercise © 1 A researcher is hired by a cleaning company to survey the opinions of employees on a proposed pension scheme. The company employs 55 managers and 495 cleaners, a Explain what is meant by a census and give one disadvantage of using it in this contest. To collect data, the researcher decides, to give a questionnaire to the first 50 cleaners to leave at the end of the day. b State the sampling method used by the researcher. © Give two reasons why this method is likely to produce biased results. (2) 4 Explain briefly how the researcher could select a sample of 50 employees using: ia systematic sample a stratified sample, Q ‘€ Statistics 3 Sections 1, 1.2, 1.3, @ a 2. Describe one advantage and one disadvantage of: a quota sampling b simple random sampling ‘© Statistics 3 Sections 1.1, 14 3. Mrs Hilyard wants to select a sample of 50 of her students to fill in a questionnaire. The school has a record of all 500 students, listed alphabetically and numbered 1 to 500, Mrs Hilyard uses the same random ‘number table that appears on page 144 of this textbook. Starting with the top-left hand corner and working across, Mrs Hilyard chooses three random numbers. The first wo suitable numbers are 384 and 100. @s5 a What are the next two suitable sumbers? Mrs Hilyard decides to take a systematic sample instead, using the same list. b Explain why a systematic sample may not give a sample that represents the proportion of boys and girls in the school. © Which sampling technique should Mrs Hilyard use? € Statistics 3 Sections 1, 1.2, 1.3, A hotel has 320 rooms, of which 180 are classified as standard, 100 are classified as premier, and 40 are classified as executive. ‘The manager wants to obtain information about room usage in the hotel by taking a 10% sample of the rooms. Explain how the manager should obtain a stratified sample + Statistics 3 Section 1.2 At an amusement park, the duration R seconds of a ride on the rollercoaster has the normal distribution N(82, 3°). The duration F of a ride on the Ferris Wheel has the normal distribution N(238, 7°). Alice rides on the rollercoaster and the Ferris Wheel. a Find the probability that her ride on the Ferris Wheel is less than three times as long as her ride on the rollercoaster. © b State one assumption you have made and comment on its validity @ Paul rides on the rollercoaster three times in a row. The random variable D represents the total duration of the three rides. ¢ Find the distribution of D. 8 cc}Given that Alice starts a ride on the Ferris Wheel at the same time as Paul starts his three rides on the rollercoaster, 4 find the probability that Alice and Paul's rides finish within 10 seconds of one another, S) Statistics 3 Section 21 A workshop makes two types of electrical resistor. ‘The resistance, X ohms, of resistors of ‘Type A is such that ¥ ~ N(20, 4), The resistance, Y ohms, of resistors of Type Bis such that Y ~ N10, 0.84). When a resistor of each type is connected into a circuit, the resistance R ohms of the circuit is given by R= X'+ ¥, where Xand Yare independent. Find: a E(R) Q b Var(R) @ © P(28.90 < R< 32.64) 8 Statisties 3 Section 2.1 A simple random sample X,, X, X; is taken from a normal distribution with ‘mean and standard deviation o. yp kewe hs Given that ¥ =" an aha (HEME where k is a contant, find the value of k, giving your answer correct to 3 sf ‘© Statistics 3 Section 21 > ¥+ko)=02 Ina bag, there are five coins worth | RMB, three coins worth 0.5 RMB and two coins worth 0.1 RMB. Two coins are taken from the bag without replacement and the mean value calculated. Write down the sample distribution for the mean value, ‘© Statistics 3 Section 3.1 fala estat 9 The random variable Cis defined as C=2445B where 4 ~ N(IS, 1.5%) and B~ N(j, 22) and A and Bare independent Given that P(C < 83.5) = 0.9, find the value of 1, giving your answer to 2 decimal places. Statistics 3 Section 2.1 © 10 The random variables 4,, A>, As and Ay each have the same distribution as 4, where 4 ~ N(24, 42), The random variable X has distribution Y ~ N(20, 3°) ‘The random variable B is defined as KD A where X, A), Ao, Asand Ay are independent. Find P(B< 170| B> 156) est B istics 3 Section 2.4, 11 The random variable has a continuous uniform distribution over the interval [a 3, 5a-9] , where a is a constant. ‘The mean of a random sample of size n taken from this distribution is Y. a Show that is a biased estimator for cvand calculate the bias of ¥ when used as an estimator for a. b Given that ¥ = K+ 4 is an unbiased estimator, find the value of k. A random sample of 10 values of is taken and the results are as follows: 17 425 32.2 423 46 45 46.3 30.7 117 499 © Use the sample to estimate the maximum value that ¥’ can take. ‘€ Statistics 3 Sections 3.1, 3.2 © 12 The weights of adult men are normally distributed with a mean of 84kg and a standard deviation of 11 kg. a Find the probability that the total weight of 4 randomly chosen adult men is less than 350 kg, 8)Palast as ‘The weights of adult women are normally distributed with a mean of 62kg and a standard deviation of 10kg. b Find the probability that the weight of a randomly chosen adult man is less than one and a half times the weight of a randomly chosen adult woman. (4) + Statistics 3 Section 3.1 The random variable D is defined as D=A-3B+4C where A ~ N(5, 2%), B~ N(7, 33) and C~NO,4), and 4, Band Care independent. a Find PD < 44) ‘The random variables B,, By and By ate independent and each has the same distribution as B. @ ‘The random variable Wis defined as =4-SB,44AC b Find P(Y > 0) @ Statistics 3 Section 21 A manufacturer produces two flavours of soft drink: cola and lemonade. The weights, Cand L, in grams, of randomly selected cola and lemonade cans are such that C ~~ N(350, 8) and L ~ NG45, 17). a Find the probability that the weights of two randomly selected cans of cola will differ by more than 6g, @ ‘One can of each flavour is selected at random, b Find the probability that the can of cola weighs more than the can of lemonade, Cans are delivered to shops in boxes of 24 cans. The weights of empty boxes are normally distributed with mean 100g and standard deviation 2. @ 5 @v € Find the probability that a full box of cola cans weighs between 8.51 kg and 8.52kg. @ 4 State an assumption you made in your calculation in part e. @ + Statistics 3 Section 3.1 Ina trial of diet 4, a random sample of 80 participants was taken to record their weight loss, xkg, after their first week of using the diet. The results are summarised as follows: Yxs361.6 Ex = 1753.95 a Find unbiased estimates for the mean and variance of weight lost after the first week of using diet 4 The designers of dict A believe it can achieve a greater mean weight loss after the first week than an existing diet B. A random sample of 60 people used diet B. After the first week they had achieved a mean weight loss of 4.06 kg, with an unbiased estimate of variance of weight loss of 2.50 kg? b Test, at the 5% level of significance, whether or not the mean weight loss after the first week using diet 4 is greater than that using diet B. State your hypotheses clearly () ¢ Explain the significance of the central, limit theorem to the test in part b. (1) 4 State an assumption you have made in carrying out the test in partb. (1) + Statistics 3 Sections 3.1, 3.4 @ ‘A random sample of the daily sales (in Rand) of a small company is taken and, using tables of the normal distribution, 4.99% confidence interval for the mean daily sales is found to be (123.5, 154.7). Find a 95% confidence interval for the ‘mean daily sales of the company. (6) « Statistics 3 Section 3.2 45@®u A machine produces metal containers. ‘The masses of the containers are normally distributed. A random sample of 10 containers was taken and the ‘mass of each container was recorded to the nearest 0.1 kg. The results were as follows: 49.7 50.3 510 49.5 49.9 50.1 50.2 50.0 49.6 49.7 a Find unbiased estimates of the mean and variance of the masses of the population of metal containers, ‘The machine is set to produce metal containers whose masses have a population standard deviation of 0.5 kg. b For the population mean, find: i 95% confidence interval fi a 99% confidence interval ag © Statistics 3 Sections 3.1, 3.2 ‘The drying times of paint can be assumed to be normally distributed. A paint ‘manufacturer paints 10 test areas with a new paint. The following drying times, to the nearest minute, were recorded: 82 98 140 110 90 125 150 13070110 a Calculate unbiased estimates for the ‘mean and the variance of the population of drying times of this paint @ Given that the population standard deviation is 25, b find a 95% confidence interval for the ‘mean drying time of this paint. (5) Fifteen similar sets of tests are done and the 95% confidence interval is determined for each set. ¢ Find the probability that all 15 of these confidence intervals contain the population mean. Q «Statistics 3 Sections 3.1, 3.2 @®vw fala estat Some biologists were studying a large group of birds. A random sample of 36 were measured and the wing length, xmm, of each bird was recorded. The results are summarised as follows: Yox= 6046 Oe = 1016338 a Calculate unbiased estimates for the mean and the variance of the wing lengths of these birds 8 Given that the wing lengths are assumed to be normally distributed and that the standard deviation of the wing lengths of this particular type of bird is actually 5.1mm, b find a 99% confidence interval for the mean wing length of the birds from this group. 8) 4 Statistics 3 Sections 3.1, 3.2 A computer company repairs large sumbers of PCs and wants to estimate the mean time taken to repair a particular fault, Five repairs are chosen at random. from the company’s records and the times taken, in seconds, ate as follows: 205 310 405 195320 a Calculate unbiased estimates of the mean and the variance of the population of repair times from which this sample has been taken. Itis known from previous results that the standard deviation of the repair time for this fault is 100 seconds and that the repair time is normally distributed. The company manager wants to ensure that there is a probability of at least 0.95 that the estimate of the population mean lies within 20 seconds of its true value. b Find the minimum sample size required. @) 6) Statistics 3 Sections 3.1, 3.2Palast as 21 A company makes individual slices of cheesecake, The weight of each slice is normally distributed with mean 135 g and standard deviation 3 g It is possible to buy a box of 12 individual slices of cheesecake. The box has a weight which is normally distributed with weight 100 g and standard deviation 6 g. a Find the probability that the weight of the 12 slices and the box is greater than L7kg. b- What assumptions about the weights of the slices of cheesecake are you making? & Statistics 3 Sections 3.1, 32 22, Maike found a 95% confidence interval to be (14.6904, 15.7096) Unfortunately, he had lost his original information, but he did remember that the standard deviation was 1.3 Calculate the sample size that Maike used to create this confidence interval Statisties 3 Section 32 23. A c% confidence interval was calculated using 36 observations from a data set which is normally distributed. The value of s was 3.6 ‘The confidence interval calculated was (12,9636, 16.2364). a Find the value of ¥. b Find the value of « + Statistics 3 Section 3.2 ‘Arandom sample of three independent variables 2%, Xeand is taken from a distribution with mean j and variance of a Show that 2X; - 354 estimator fr ‘An unbiased estimator for sis given by f= aX;+ bXpwhere a and b are constants. b Show that Var(i) = 2a ~2a+ 1a? Hence determine the value of a and the value of b for which fi has minimum variance. “Statistics 3 Section 3.1 5 san unbiased a7= va ee r Haye a ba ae ee Et = TESTING ta eae) © Understand and apply the central limit theorem to approximate the sample mean of a random variable ¥ “+ pages 49-51 © Apply the central limit theorem to other distributions + pages 51-53 © Apply the central limit theorem in finding confidence “+ pages 53-54 on the mean > pages 54-59 means + pages 60-64 Understand beeen Be large sample pages 64-66 Arandom variable X"~ N(120, 8°). Find a P(¥>115) — b P(120< ¥<130) © auch that P(X'< a) A fair six-sided dice is rol uppermost fat entral limit theorem a E(Y) b Var(¥) about the distribution of t cP «Statist 3 Sections 6.5, 6.6 population is unki sticians use Robin flips a fair . d it to infer how likely the views of a sample the probability that the coin is flipped at least 12 are to be representative of the population times. + Statistics 2 Sect > Chapter review 4 Q10MOOR a) ses ©) The central limit theorem Ifyou takearendom sample ofn observations from a normally distributed random variable X~ Nu.) then the sample mean ¥’is also normally distributed with ¥ ~ N(u, 7 In fact, this result is a special case of a more powerful result called the central limit theorem. This states that the mean of a large random sample taken from any random variable is always approximately normally distributed, This result is true without paying attention to the distribution of the original random variable. = The central limit theorem says that if X;, Xo, .... Xyis a random sample of size n froma “ 2 population with mean # and variance o%, then X is approximately ~ N{q, 7}. In general, the sample mean is only approximately distributed with N(y., 7). As n gets large, this approximation gets better. The variance of the sample mean also decreases as 7 gets large. You can say that for a large sample, the sample mean will be very close to the population mean, A six-sided dice is changed so that there are three faces marked 1, two faces marked 3 and one face marked 6. The dice is rolled 40 times and the mean of the 40 scores is recorded. a Find an approximate distribution for the mean of the scores, b Use your approximation to estimate the probability that the mean is greater than 3. cue a Let the random variable X re score ona single roll Find the mean and variance of the discrete Then the distribution of Xs: distribution € Statistics 1 Sections 6.3, 6.6 x 1 a] 6 rxven | 3 sl So: y= EX) = Ea = +3xde6xd and a? = Vart! = DvP =) +aexdrarxd- (3 ex 25 or B Now by the central limit theorem: ati Fw ~nl25, i) ol nal limit rTCe ase} eMC RS aU) all =) Vigo CEIEDD 01 do not need to apelye - PZ < 175. ‘continuity correction when using the central -09599 limit theorem, This is because the underlying, 401 distribution is the mean of the sample. Although this isa discrete random variable, it does not have o take integer values. Ittakes fractional values, and the gaps between values get smaller and smalleras-n gets larger. ons 1 The lengths of bolts produced by a machine have an unknown distribution with mean 3.03.em and standard deviation 0.20em. ‘A sample of 100 bolts is taken. a Estimate the probability that the mean length of this sample is less than 3cm. (3 marks) A second sample is taken. The probability that the mean of this sample is less than 3.cm needs to be less than 1%. b Find the minimum sample size required. (Smarks) © 2 A random variable ¥ has the discrete uniform distribution P(V=x)=4 x=1,2,3,4,5 40 observations are taken from X, and their mean ¥ is recorded. Find an estimate for PLY > 3.2) (6 marks) ©® 3 A fair dice is rolled 35 times. a Find the approximate probability that the mean of the 35 scores is more than 4, b Find the approximate probability that the total of the 35 scores is less than 100. 4 The 25 children in a class each roll a fair dice 30 times and record the number of sixes they obtain, Find an estimate of the probability that the mean number of sixes recorded for the class is less than 4.5. © 5 The random variable V has the probability x 0 2 3 3 distribution shown in the table. paso [or | x | & [03 a Find the value of &. (2 marks) A random sample of 100 observations of ‘is taken. Use the central limit theorem to estimate the probability that the mean of these observations is greater than 3, (6 marks) © Comment on the accuracy of your estimate. (mark)MOOR a) ses @® 6 A fair dice is rolled m times. Given that there is less than a 1% chance that the mean of all the scores differs from 3.5 by more than 0.1, find the minimum sample size. 7 The annual part-time salaries of employees at a large company have an unknown distribution with mean AUDS28,500 and standard deviation AUDS6800. ‘A random sample of 5 members of the senior management team is taken. ‘A researcher suggests that N(28 500, S") of the sample mean, a Give a reason why this is unlikely to be a good model (mark ‘A ssecond random sample of 15 employees from the whole company is taken. b Estimate the probability that the mean annual salary of these employees is: i less than AUDS25,000 ii between AUDS25,000 and AUDS30,000. (4marks) € Comment on the accuracy of your estimate. (mark) could be used to model the distribution GP 8 Anclectrical company repairs very large numbers of television sets and wishes to estimate the mean time taken to repair a particular fault It is known from previous research that the standard deviation of the time taken to repair this particular fault is 2.5 minutes, The manager wishes to ensure that the probability that the estimate differs from the true mean by less than 30 seconds is 0.95. Find how large a sample is required. (G marks) Applying the central limit theorem to other distributions You can use the central limit theorem to solve problems involving other distributions such as the binomial, the Poisson and the uniform distributions. A supermarket manager is trying to model the number of customers who visit her store each day. She observes that, on average, 20 new customers enter the store every minute, a Calculate the probability that fewer than 4 customers arrive in a given 15-second interval. b Use the central limit theorem to estimate the probability that in one hour no more than 1150 customers arrive ‘a Let Xrepresent the number of customers who arrive in one minute. Then X ~ Pot20) Let Y represent the number of customers who arrive in a 15-Second interval Then ¥ ~ PotS) PLY < 5} = 0.4408, from tablesCe ataiey eMC RS aU) b Consider a sample of GO observations taken from X. By the central limit theorem, X is approxim: or N[20, If T-< 1150 then ¥ < Problem-solving IF IX; is the sum of the observations from a sample of size n, then the sample mean is given by: =X Standaraising POY < 19:16. PZ < 144...) - PZ <1.44,..) 1-09251 0.0749 1 A random sample of 10 observations is taken from a Poisson distribution with mean 3 a Find the exact probability that the sample mean does not exceed 2.5. b Estimate the probability that the sample mean does not exceed 2.5 using the central limit theorem, and compare your answer to part a. © 2 A sample of size 20/is taken from a binomial distribution with = 10 and p = 0.2 Estimate the probability that the sample mean does not exceed 2.4 (marks) GP) 3 There are 20 children in a class. Each flips a biased coin 15 times. The probability of getting a head is 0.25. a Write down the expected number of heads that each child would get. (2 marks) b Find an estimate of the probability that the mean number of heads is at most 4. (3 marks) 4 A town is hit by three thunderstorms per month, on average. a Find the probability that there are four thunderstorms next month, (2 marks) b Use the central limit theorem to estimate the probability that over the course of a year, the average number of thunderstorms each month is at most 2.5 (4marks) 5 The continuous random variable ¥ is uniformly distributed over the interval fa-3,30+ 5] where a is a constant. 40 observations of X will be taken. Use the central limit theorem to find an approximate distribution for Y.