06 Stat Est
06 Stat Est
Distributions
Mahbub Latif, PhD
January 2024
Plan
Point estimate
Sampling distributions
Sampling distribution of sample mean
Sampling distribution of sample proportion
2
Statistical inference
Descriptive statistics deal with describing observations of a sample, e.g. sample men, sample
variance, etc. are tools of descriptive statistics
Inferential statistics deal with making a statement (conclusion) about a population
characteristic using the infromation obtained from a sample
There are two methods of statistical inference
Estimation and test of hypothesis
Two methods of estimation
point estimation
interval estimation (confidence interval)
3
Point Estimate
4
Parameters
5
Example (Machine breakdown)
Let μ and σ be mean and variance of probability distribution of milk contents of a container
2
6
Statistics
Statistic is a property of a sample, e.g. sample mean, sample variance, etc. are example of
statistics
Statistic is a function of sample observation and it can be used to estimate unknown
parameters
Statistics are random variables whose observed values can be calculated from a set of
observed data observations.
7
Statistics
n 2
∑ (xi − x̄)
2 i=1
s =
n − 1
8
Estimation
A point estimate θ^ may not be exactly equal to the corresponding parameter θ, but a good
point estimator is a good indicator of the actual value of the parameter
A point estimate can only be as good as the data set from which they are calculated, e.g.,
whether sample is randomly selected, representative of the population, etc.
9
The relationship between a point estimate θ^ and a parameter θ
10
Estimation of the population mean
11
Estimation (Sample I)
Sample observations: 72.3, 77.7, 82.1, 70.8, 81.6, 80.2, 80.7, 88.9, 70.2, 90.1
Point estimates: x̄ = μ
^ = 79.46 and s
2
= σ
^
2
= 47.9
12
Estimation (Sample II)
Sample observations: 75, 81.5, 73.3, 92.8, 82.6, 73.4, 83.9, 85.9, 84.6, 77.6
Point estimates: x̄ = μ
^ = 81.06 and s
2
= σ
^
2
= 39.11
13
Estimation of population proportion
operator misuse
Suppose a sample of n machine breakdowns is recorded, of which x are due to operator
0
misuse
14
Estimation of population proportion
x0
^ =
p 0
n
^ =
p
13
= 0.28
type Frequency
0
46 Electrical 9
Mechanical 24
Misuse 13
15
Summary
2
1
2 2
σ
^ = s = ∑(xi − x̄)
n − 1
i=1
16
Summary
17
Sampling distribution
18
Sampling distribution
19
Sampling distribution of a sample mean
For a large n, the sampling distribution of sample mean x̄ follows a normal distribution
¯ 2
^ = X
μ ∼ N (μ, σ /n)
Note this result is valid for any population, e.g. skewed or bell-shaped
20
Sampling distribution of a sample mean
The standard deviation (sd) of a sampling distribution is known as the standard error (se)
The standard error of sample mean
σ
se(x̄) =
√n
√n(X̄ − μ)
= ∼ N (0, 1)
σ
21
Sampling distribution of a sample mean
22
Exponential distribution
23
Normal distribution
24
Exercise 7.3.24
Suppose that components have weights that are normally distributed with μ = 341 and
σ = 2.
What is the probability that the experimenter's estimate of μ will be less than 341.5?
25
Exercise 7.3.24
^ = x̄ = ∑ x/20
μ
341.5 − 341
^ < 341.5) = Φ(
P (μ )
2/√20
= Φ(1.12)
= 0.8686
26
Exercise 7.3.2
Consider a sample of X 1, … , Xn of normally distributed random variables with mean μ and
variance σ = 1.
2
27
t-distribution
28
t-distribution
29
Comparison between t and standard normal distributions
30
P (t(1) > 6.314) = 0.05
31
Exercise 7.3.9
The breaking strengths of 35 pieces of cotton thread are measured.
The sample mean is x̄ = 974.3 and the sample variance is s 2
= 452.1 .
Construct a point estimate of the average breaking strength of this type of cotton thread.
What is the standard error of your point estimate?
32
Sampling distribution of a sample proportion
p(1 − p)
^ ∼ N (p,
p )
n
p(1 − p)
^) = √
se(p
n
33
Sampling distribution of a sample proportion
A coin that is suspected of being biased is tossed many times in order to investigate the
possible bias. Consider the following two scenarios:
Scenario I: The coin is tossed 100 times and 40 heads are obtained
Scenario II: The coin is tossed 1000 times and 400 heads are obtained
What is the difference, if any, between the interpretations of these two sets of experimental
results?
34
Sampling distribution of a sample proportion
.4(1 − .4)
^) = √
se(p = 0.0024
100
Scenario II
.4(1 − .4)
^) = √
se(p = 0.0002
1000
35
Exercise 7.3.7
Consider a sample X , … , X of normally distributed random variables with mean μ.
1 n
36
Exercise 7.3.21
Unknown to an experimenter, when a coin is tossed there is a probability of p = 0.63 of
obtaining a head.
The experimenter tosses the coin 300 times in order to estimate the probability p.
What is the probability that the experimenter's point estimate of p will be within the interval
(0.62, 0.64)?
37
Exercise 7.3.8
In a consumer survey, 234 people out of a representative sample of 450 people say that they
prefer product A to product B.
Let p be the proportion of all consumers who prefer product A to product B.
Construct a point estimate of p.
What is the standard error of your point estimate?
38
Exercise 7.3.20
The capacitances of certain electronic components have a normal distribution with a mean
μ = 174 and a standard deviation σ = 2.8.
39
Exercise 7.3.23
A scientist reports that the proportion of defective items from a process is 12.6%. If the
scientist’s estimate is based on the examination of a random sample of 360 items from the
process, what is the standard error of the scientist's estimate?
Exercise 7.3.34
Consider the data set 7, 9, 14, 15, 22. Obtain the standard error of the sample mean.
40
Exercise 7.7.18
The probability that a medical treatment is effective is 0.68, unknown to a researcher. In an
experiment to investigate the effectiveness of the treatment, the researcher applies the
treatment in 140 cases and measures whether the treatment is effective or not.
What is the probability that the researcher's estimate of the probability that the medical
treatment is effective is within 0.05 of the correct answer?
41