0% found this document useful (0 votes)

13 views23 pages

Notes 12

The document provides an overview of statistical inference, focusing on the concepts of population, sample, parameter estimation, confidence intervals, and hypothesis testing. It details methods for estimating parameters such as mean, variance, and proportion, as well as the use of maximum likelihood and method of moments for estimation. The document also outlines the process of hypothesis testing, including the formulation of null and alternative hypotheses, significance levels, and the calculation of test statistics.

Uploaded by

Jiregna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views23 pages

Notes 12

Uploaded by

Jiregna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

STATISTICAL INFERENCE

POPULATION AND SAMPLE

Population = all elements of interest

Characterized by a distribution F
with some parameter θ

Sample = the data X1, . . . , Xn,

selected subset of the population

n = sample size

Examples of F : Bernoulli(p), Normal(µ, σ),

Gamma(n, λ), Poisson(λ), etc.

1
Statistical Inference

Statistical Inference
= inference about the population based on a
sample

• Parameter estimation

• Conﬁdence intervals

• Hypothesis testing

• Model ﬁtting

2
Statistical Inference

Parameter Estimation

Statistic = any function of data W (X1, ..., Xn)

Estimator of θ = any statistic used

to estimate parameter θ

Estimator θ̂ is unbiased if E(θ̂) = θ

Standard error of an estimator is its standard

deviation Std(θ̂)

d θ̂). It shows the accu-

It is estimated by Std(
racy, reliability of estimator θ̂.

3
Parameter estimation

Estimation of a mean

Sample (X1, . . . , Xn) is collected from a popu-

lation with E(X) = µ and Var(X) = σ 2.

Estimate the population mean θ = µ = E(Xi)

by a sample mean
n
1 ∑
X̄ = Xi
n i=1
Properties:
n n
1 ∑ 1 ∑
E(X̄) = EX i = θ=θ
n i=1 n i=1
n
1 ∑ nσ 2 σ2
Var(X̄) = 2 VarXi = 2 =
n i=1 n n

So, X̄ is unbiased, and its standard error is

σ
SE(X̄) = Std(X̄) = √
n
4
Parameter Estimation

Estimation of a variance

Estimate the population variance

θ = σ 2 = Var(Xi)
by a sample variance
n
1 ∑
S2 = (Xi − X̄)2
n − 1 i=1

It is also unbiased: E(S 2) = σ 2.

Then, the standard error of X̄ is estimated by

v
u∑
S u (X − X̄)2
d X̄) = √
Std( = t i
n n(n − 1)

5
Parameter estimation

Estimation of a proportion

Sample (X1, . . . , Xn) is collected from Bernoulli

population with parameter p.

Estimate the population proportion p = E(Xi)

by a sample proportion
n
1 ∑ number of Xi = 1
p̂ = Xi =
n i=1 n
Special case of a sample mean X̄
σ2 p(1 − p)
E(p̂) = p, Var(p̂) = =
n n

So, p̂ is unbiased;
√
p(1 − p)
its standard error is SE(p̂) =
n
√
d p̂(1 − p̂)
typically estimated by SE(p̂) =
n
6
Parameter Estimation

General Methods of Estimation

1. Method of Moments

kth population moment µk = EX k

n
1 ∑
kth sample moment Mk = Xik
n i=1

To estimate d parameters, solve the system of

d equations


 M1 = µ1
...

 M
d = µd

M1, . . . , Md are known from the sample; µ1, . . . , µd

are functions of unknown parameters
7
Method of moments

Example: X1, . . . , Xn are Exponential(λ)

Estimate λ.

The number of parameters is d = 1. So, we

need 1 equation.

1
M1 = X̄; µ1 =
λ

Solve
1 1
M1 = µ 1 ⇒ X̄ = ⇒ λ̂mom =
λ X̄

8
2. Method of Maximum Likelihood

Maximize the probability (pmf, pdf) of seeing

the really observed data

Implementation

Observe X1, ..., Xn from pdf or pmf f (x | θ).

Maximize
n
∏
f (X1, ..., Xn | θ) = f (Xi | θ)
i=1
in θ.

Simpliﬁcation: maximize
n
∑
ln f (X1, ..., Xn | θ) = ln f (Xi | θ)
i=1

Typically, compute
n
∑
∂ ∂
ln f (X1, ..., Xn | θ) = ln f (Xi | θ)
∂θ i=1 ∂θ
equate to 0 and solve in θ.
9
Method of maximum likelihood

Example: X1, . . . , Xn are Exponential(λ)

n
∏
f (X1, ..., Xn | θ) = λe−λXi
i=1

n
∑ ( )
ln f (X1, ..., Xn | θ) = ln λe−λXi
i=1
n
∑
= n ln λ − λ Xi
i=1
n
∑
∂ n
ln f (X1, ..., Xn | θ) = − Xi =: 0
∂λ λ i=1
Solve for λ,
n 1
λ̂mle = ∑ =
Xi X̄

10
Maximum likelihood

6
- 2h

f (x) This area

= P {x − h ≤ x ≤ x + h}
≈ (2h)f (x)

-
x−h x x+h

Probability of observing “almost” X = x

11
Statistical Inference

Confidence Intervals

100 (1−α) %-confidence interval is an inter-

val that contains parameter θ with probability
γ.

That is,

P {a ≤ θ ≤ b} = 1 − α
where

a = a(X1, ..., Xn) and b = b(X1, ..., Xn)

are statistics. So, a and b are random, θ is not.

12
Confidence intervals

for the same parameter θ



obtained from different




Confidence intervals





samples of data


























-
θ
Confidence intervals and coverage of parameter θ.

13
Confidence intervals

Example: X1, ..., Xn from Normal(µ, σ) with

unknown µ, known σ

1 ∑
1. Estimate θ = µ by its estimator X̄ = n Xi.

2. Find its distribution: Normal with

E(X̄) = µ
n
1 ∑ nσ 2 σ2
Var(X̄) = 2 Var(Xi) = 2 =
n 1 n n
Therefore,
X̄ − µ
Z= √ is Normal(0,1)
σ/ n

3. Find critical values ±zα/2 such that

{ }
P −zα/2 < Z < zα/2
for Z ∼ Normal(0,1).
14
Confidence intervals

4. Then we have
{ }
X̄ − µ
P −zα/2 < √ < zα/2 =1−α
σ/ n

Solve for µ:
{ }
zα/2σ zα/2σ
P X̄ − √ < µ < X̄ + √ =1−α
n n

5. Hence,
[ ]
zα/2σ zα/2σ zα/2σ
X̄ ± √ = X̄ − √ , X̄ + √
n n n
is a (1 − α)100% conﬁdence interval for µ.

X̄ is approximately Normal for large n and any

distribution of X1, . . . , Xn.
15
Confidence intervals

When σ is unknown

Data X1, ..., Xn from Normal(µ, σ) with

unknown µ, unknown σ
v
u n (
u 1 ∑ )
1. Estimate σ by S = t Xi − X̄ 2
n−1 1

2. Use t-distribution with (n − 1) degrees of

freedom instead of Normal.

For large n, use Normal approximation.

Result:
tα/2,n−1S
X̄ ± √
n

16
TESTING HYPOTHESES

Hypothesis H0 and alternative HA = mutually

exclusive statements about the unknown pa-
rameter θ.

Collect data
⇓
Conduct a test
⇓
State if there is suﬃcient evidence to reject
H0 in favour of HA.

Conclusion Reject H0 Accept H0

H0 is true Type I error correct
H0 is false correct Type II error

Control the significance level

α = P { Type I error }

17
Hypotheses testing

Data: X1, ..., Xn from Normal(µ, σ) with

unknown µ, known σ

Test H0 : µ = µ0 vs HA : µ ̸= µ0.

1. Find ±zα/2. Acceptance region: [−zα/2, zα/2].

2. Compute the test statistic

X̄ − µ0
Z= √ .
σ/ n

3. If Z belongs to the acceptance region, do

not reject H0.
Otherwise, reject H0.

If H0 is true, Z has Normal(0,1) distribution,

and
{ }
P { Type I error } = P |Z| > zα/2 = α

18
Hypotheses testing

One-sided, right-tail tests

Test H0 : µ = µ0 vs HA : µ > µ0.

1. Find zα. The acceptance region is (−∞, zα].

2. Compute the test statistic

X̄ − µ0
Z= √ .
σ/ n

3. If Z belongs to the acceptance region, do

not reject H0.
Otherwise, reject H0.

19
Hypotheses testing

One-sided, left-tail tests

Test H0 : µ = µ0 vs HA : µ < µ0.

1. Find zα. The acceptance region is [−zα, +∞).

2. Compute the test statistic

X̄ − µ0
Z= √ .
σ/ n

3. If Z belongs to the acceptance region, do

not reject H0.
Otherwise, reject H0.

20
Hypotheses testing

Case of unknown variance

v
u n (
u 1 ∑ )2
1. Estimate σ by S = t Xi − X̄
n−1 1

2. Use t-distribution with (n − 1) degrees of

freedom.

For large n, use Normal approximation.

21
Hypotheses testing, Z-tests

Null Parameter,
hypothesis estimator If H0 is true: Test statistic

θ̂ − θ0
H0 θ, θ̂ E(θ̂) Var(θ̂) Z= √
Var(θ̂)

One-sample Z-tests for means and proportions, based on a sample of size n

σ2 X̄ − µ0
µ = µ0 µ, X̄ µ0 √
n σ/ n

p0 (1 − p0 ) p̂ − p0
p = p0 p, p̂ p0 √
n p̂(1−p̂)
n

Two-sample Z-tests comparing means and proportions of two populations,

based on independent samples of size n and m

µX − µY µX − µY , 2
σX σY2 X̄ − Ȳ − D
D + √
=D X̄ − Ȳ n m 2
σX σY2
n
+ m

p1 − p2 p1 − p2 , p1 (1−p1 )
p̂1 − p̂2 − D
D n √
=D p̂1 − p̂2 + m p2 (1−p2 )
p̂1 (1−p̂1 ) p̂2 (1−p̂2 )
n
+ m

22
Hypothesis testing, t-tests

Hypothesis Test statistic Degrees of

H0 Conditions t freedom

Sample size n; X̄ − µ0
µ = µ0 unknown σ t= √ n−1
s/ n

Sample sizes n, m; X̄ − Ȳ − D
µX − µY = D unknown but equal t= √1 1
n+m−2
σX = σy sp n
+ m

Sample sizes n, m; X̄ − Ȳ − D Special

µX − µY = D unknown, unequal t= √ formula
σX ̸= σy s2X s2Y
n
+ m

Statistics ESCP
No ratings yet
Statistics ESCP
383 pages
Unit 10 Merged
No ratings yet
Unit 10 Merged
60 pages
Lecture 30 - Sample and Population Mean
No ratings yet
Lecture 30 - Sample and Population Mean
49 pages
AllNotes 4
No ratings yet
AllNotes 4
56 pages
Lectures Series 8a - Point Estimators
No ratings yet
Lectures Series 8a - Point Estimators
23 pages
5-6.sampling Error and Confidence Interval
No ratings yet
5-6.sampling Error and Confidence Interval
74 pages
Chap 09
No ratings yet
Chap 09
46 pages
Lecture 4-Statistical Inferences
No ratings yet
Lecture 4-Statistical Inferences
118 pages
Ci 1
No ratings yet
Ci 1
47 pages
QEM 2004 - Module 2 (Confidence Interval Estimation)
No ratings yet
QEM 2004 - Module 2 (Confidence Interval Estimation)
59 pages
Bootstrap
No ratings yet
Bootstrap
23 pages
Topic 9 Statistical Inference: (Revision Notes)
No ratings yet
Topic 9 Statistical Inference: (Revision Notes)
21 pages
webMATH236 Lecture5
No ratings yet
webMATH236 Lecture5
87 pages
Statistical Inference 417
No ratings yet
Statistical Inference 417
90 pages
Par Est
No ratings yet
Par Est
36 pages
STAT 101 Module Handout 4.1
No ratings yet
STAT 101 Module Handout 4.1
12 pages
Statistical Inference Point Estimators Estimating The Population Mean Using Confidence Intervals
No ratings yet
Statistical Inference Point Estimators Estimating The Population Mean Using Confidence Intervals
40 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Interval de Incredere
No ratings yet
Interval de Incredere
20 pages
h5 Statistical Inference
No ratings yet
h5 Statistical Inference
4 pages
Module 5
No ratings yet
Module 5
67 pages
S 2
No ratings yet
S 2
247 pages
Chapter 2
No ratings yet
Chapter 2
30 pages
Estimation of Parameters
No ratings yet
Estimation of Parameters
49 pages
5.confidence Interval
No ratings yet
5.confidence Interval
53 pages
Ch-1.Ppt Business Statx
No ratings yet
Ch-1.Ppt Business Statx
66 pages
Chapters4 5 PDF
No ratings yet
Chapters4 5 PDF
96 pages
Confidence Intervals
No ratings yet
Confidence Intervals
56 pages
Chapter 9-Inference About A Population
No ratings yet
Chapter 9-Inference About A Population
23 pages
Business Statistics CH 2
No ratings yet
Business Statistics CH 2
49 pages
Chapters4 5 PDF
No ratings yet
Chapters4 5 PDF
96 pages
STA248
No ratings yet
STA248
26 pages
Estimation of Parameters
No ratings yet
Estimation of Parameters
30 pages
Lecture Notes in Statistics 148
No ratings yet
Lecture Notes in Statistics 148
241 pages
Research Methodology - Types, Examples and Writing Guide
No ratings yet
Research Methodology - Types, Examples and Writing Guide
12 pages
Statistics Cheatsheet
100% (1)
Statistics Cheatsheet
2 pages
Lecture 4 - Confidence Intervals & Hypothesis
No ratings yet
Lecture 4 - Confidence Intervals & Hypothesis
25 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
C 4
No ratings yet
C 4
61 pages
A Session 18 2021
No ratings yet
A Session 18 2021
36 pages
Chap 010
No ratings yet
Chap 010
45 pages
Estimation
No ratings yet
Estimation
27 pages
Goiteom Wmariam
No ratings yet
Goiteom Wmariam
98 pages
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
No ratings yet
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
40 pages
Interval Estimation
100% (1)
Interval Estimation
42 pages
课本附录 (二) - 公式表 Formula Sheet - final
No ratings yet
课本附录 (二) - 公式表 Formula Sheet - final
2 pages
Bio Statistics
No ratings yet
Bio Statistics
164 pages
Reliance JIO
No ratings yet
Reliance JIO
69 pages
Advanced Statistical Approaches To Quality: INSE 6220 - Week 4
No ratings yet
Advanced Statistical Approaches To Quality: INSE 6220 - Week 4
44 pages
Stimation: Statistic
No ratings yet
Stimation: Statistic
46 pages
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
No ratings yet
2006 Geog090 Week06 Lecture01 CentralLimitTheorem
37 pages
Chapter Two
No ratings yet
Chapter Two
28 pages
Design Exams 2005
No ratings yet
Design Exams 2005
25 pages
Statistical Estimation
No ratings yet
Statistical Estimation
32 pages
Pro Band Stat
No ratings yet
Pro Band Stat
27 pages
Inferential Statistic: 1 Estimation of A Population Mean
No ratings yet
Inferential Statistic: 1 Estimation of A Population Mean
8 pages
CH 4 Estimation.
100% (1)
CH 4 Estimation.
48 pages
Statistical Inference: Stat 472
No ratings yet
Statistical Inference: Stat 472
11 pages
L8 Estimate 2014
No ratings yet
L8 Estimate 2014
40 pages
Activity-Based Costing, Total Quality Management and Business Process Re Engineering Their Separate and Concurrent Association With Improvement in Financial Performance
No ratings yet
Activity-Based Costing, Total Quality Management and Business Process Re Engineering Their Separate and Concurrent Association With Improvement in Financial Performance
22 pages
Applied Statistics and Probability For Engineers Chapter - 8
No ratings yet
Applied Statistics and Probability For Engineers Chapter - 8
13 pages
40
No ratings yet
40
7 pages
Final Push Trig & Stats
No ratings yet
Final Push Trig & Stats
24 pages
Kami Export - Tools in Studying Environmental Science
No ratings yet
Kami Export - Tools in Studying Environmental Science
63 pages
1.1 Classifying Numbers
No ratings yet
1.1 Classifying Numbers
55 pages
Unit 5
No ratings yet
Unit 5
15 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
Multivariate Analysis of Variance-MANOVA
No ratings yet
Multivariate Analysis of Variance-MANOVA
14 pages
Notes Management Accounting
No ratings yet
Notes Management Accounting
23 pages
Tes09 01 04 Ex
No ratings yet
Tes09 01 04 Ex
3 pages
واقع ممارسات الإبداع التكنولوجي في المؤسسة الصناعية - دراسة ميدانية بمؤسسة كوندور إلكترونيك
No ratings yet
واقع ممارسات الإبداع التكنولوجي في المؤسسة الصناعية - دراسة ميدانية بمؤسسة كوندور إلكترونيك
20 pages
2088-Article Text-6814-1-10-20230619
No ratings yet
2088-Article Text-6814-1-10-20230619
9 pages
Jurnal Nur Aeni Salsabila 119020059 Word
No ratings yet
Jurnal Nur Aeni Salsabila 119020059 Word
11 pages
Introduction To Statistics: Teacher
No ratings yet
Introduction To Statistics: Teacher
19 pages
What Is A Good Research Design
No ratings yet
What Is A Good Research Design
9 pages
Ramirez (1995)
No ratings yet
Ramirez (1995)
19 pages
Report
No ratings yet
Report
53 pages
QAM 4th Module Assessment PDF
No ratings yet
QAM 4th Module Assessment PDF
7 pages
Stephen Few Show Me The Numbers
0% (1)
Stephen Few Show Me The Numbers
7 pages
Basic Concepts of Statistical Sampling Methods
No ratings yet
Basic Concepts of Statistical Sampling Methods
6 pages
Synchronous Session - 3: Bana7030 Denise L. White, PHD Mba
No ratings yet
Synchronous Session - 3: Bana7030 Denise L. White, PHD Mba
13 pages
14 Standard Setting Methods For Pass Fail Decisions On High Stakes Objective Structured Clinical Examinations A Validity Study
No ratings yet
14 Standard Setting Methods For Pass Fail Decisions On High Stakes Objective Structured Clinical Examinations A Validity Study
13 pages
Bhavana Resume PDF
No ratings yet
Bhavana Resume PDF
2 pages
S.C.S. (A) College, Puri: B.A. (Hons.) Geography Syllabus
No ratings yet
S.C.S. (A) College, Puri: B.A. (Hons.) Geography Syllabus
15 pages
Lab2.ipynb - Colaboratory
No ratings yet
Lab2.ipynb - Colaboratory
2 pages
Confidence Intervals with σ unknown
No ratings yet
Confidence Intervals with σ unknown
9 pages
How To Choose The Best Sampling Method
No ratings yet
How To Choose The Best Sampling Method
3 pages
Geetha Polaboina - Data Analyst - CV
100% (1)
Geetha Polaboina - Data Analyst - CV
4 pages
Quant
No ratings yet
Quant
2 pages
Formulas: Introductory Statistics, 6/E
No ratings yet
Formulas: Introductory Statistics, 6/E
4 pages
CE31501 Soft-Computing Tools in Engineering ES 2013
No ratings yet
CE31501 Soft-Computing Tools in Engineering ES 2013
1 page
Intervention Effectiveness Research Quality Improvement and Program Evaluation in Healthcare A Practical Guide To Real World Implementation, 2nd Edition Exclusive Download
100% (20)
Intervention Effectiveness Research Quality Improvement and Program Evaluation in Healthcare A Practical Guide To Real World Implementation, 2nd Edition Exclusive Download
14 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
The Green Book of Mathematical Problems
From Everand
The Green Book of Mathematical Problems
Kenneth Hardy
4.5/5 (3)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Notes 12

Uploaded by

Notes 12

Uploaded by

STATISTICAL INFERENCE

POPULATION AND SAMPLE

Population = all elements of interest

Sample = the data X1, . . . , Xn,

Examples of F : Bernoulli(p), Normal(µ, σ),

Statistic = any function of data W (X1, ..., Xn)

Estimator of θ = any statistic used

Estimator θ̂ is unbiased if E(θ̂) = θ

Standard error of an estimator is its standard

d θ̂). It shows the accu-

Sample (X1, . . . , Xn) is collected from a popu-

Estimate the population mean θ = µ = E(Xi)

So, X̄ is unbiased, and its standard error is

Estimate the population variance

It is also unbiased: E(S 2) = σ 2.

Then, the standard error of X̄ is estimated by

Sample (X1, . . . , Xn) is collected from Bernoulli

Estimate the population proportion p = E(Xi)

General Methods of Estimation

kth population moment µk = EX k

To estimate d parameters, solve the system of

M1, . . . , Md are known from the sample; µ1, . . . , µd

Example: X1, . . . , Xn are Exponential(λ)

The number of parameters is d = 1. So, we

Maximize the probability (pmf, pdf) of seeing

Observe X1, ..., Xn from pdf or pmf f (x | θ).

Example: X1, . . . , Xn are Exponential(λ)

f (x) This area

Probability of observing “almost” X = x

100 (1−α) %-confidence interval is an inter-

a = a(X1, ..., Xn) and b = b(X1, ..., Xn)

for the same parameter θ

obtained from different

Example: X1, ..., Xn from Normal(µ, σ) with

2. Find its distribution: Normal with

3. Find critical values ±zα/2 such that

X̄ is approximately Normal for large n and any

Data X1, ..., Xn from Normal(µ, σ) with

2. Use t-distribution with (n − 1) degrees of

For large n, use Normal approximation.

Hypothesis H0 and alternative HA = mutually

Conclusion Reject H0 Accept H0

Control the significance level

Data: X1, ..., Xn from Normal(µ, σ) with

1. Find ±zα/2. Acceptance region: [−zα/2, zα/2].

2. Compute the test statistic

3. If Z belongs to the acceptance region, do

If H0 is true, Z has Normal(0,1) distribution,

One-sided, right-tail tests

Test H0 : µ = µ0 vs HA : µ > µ0.

1. Find zα. The acceptance region is (−∞, zα].

2. Compute the test statistic

3. If Z belongs to the acceptance region, do

One-sided, left-tail tests

Test H0 : µ = µ0 vs HA : µ < µ0.

1. Find zα. The acceptance region is [−zα, +∞).

2. Compute the test statistic

3. If Z belongs to the acceptance region, do

Case of unknown variance

2. Use t-distribution with (n − 1) degrees of

For large n, use Normal approximation.

One-sample Z-tests for means and proportions, based on a sample of size n

Two-sample Z-tests comparing means and proportions of two populations,

Hypothesis Test statistic Degrees of

Sample sizes n, m; X̄ − Ȳ − D Special

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.