0% found this document useful (0 votes)

36 views3 pages

Theoretical Problem Week6

This document discusses different methods for calculating confidence intervals for proportions and differences in proportions from small sample sizes. It provides formulas and examples comparing the Wald interval, plus four interval, and Agresti-Caffo interval, noting situations where each method performs better or worse in terms of achieving the intended coverage probability.

Uploaded by

thibebongtran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views3 pages

Theoretical Problem Week6

Uploaded by

thibebongtran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

STATISTICAL INFERENCE (MS-C1620). 8.1.–17.4.2024. Aalto University.

Univer-
sity Lecturer Pekka Pere.

Theoretical exercise 5 (week 6)

Theory
The classical Wald confidence interval for a probability π is defined by the
bounds r
π̂(1 − π̂)
π̂ ± z1−α/2 . (1)
n
It is of width zero or is not meaningful if π̂ = 0. In such a case the rule of three

3 3

0, ≈ 0, .
n n+1

yields a valid one-sided 95% confidence interval. Both formulae are approxi-
mative. The latter approximation yields better coverage (closer to 95%). Both
formulae are applicable if n > 30.
Formula (1) fails also if π is close to 0 or 1 and n is small. E.g. if π = 0.05
and n = 25 then the 95% Wald confidence interval, defined by the bounds
in (1), covers π with probability about 0.75. Newcomb (1998, 868) argues
that confidence interval (1) should not be used in scientific research. Andersson
(2023), Fagerland et al. (2017, 65), Meeker et al. (2017, 105, 108), and Schilling
and Doi (2014) think likewise. The reason is the poor coverage of confidence
interval (1) if the sample is small and π is close to 0 or 1.
Plus four confidence interval (Agresti and Coull 1998) is a much better 95%
confidence interval if the sample is small. Four imaginary observations are added
to a sample (of n observations and y successes) as follows:

outcome
yes no Σ
y+2 n−y+2 n+4

Σ stands for the sum of the frequencies of outcomes yes and no. Plus four
confidence interval is calculated from this modified sample with formula (1).
Plus four confidence interval covers the true π with a probability which tends
to be much closer to 0.95 than the Wald confidence interval does. If π is very
close to 0 or 1 then the plus four confidence interval covers π with too large a
probability.
The Wald confidence interval for a difference of probabilites π1 −π2 is defined
by the bounds
s
π̂1 (1 − π̂1 ) π̂2 (1 − π̂2 )
π̂1 − π̂2 ± z1−α/2 + . (2)
n1 n2

Here it performs better than when estimating a single probability π but yet
tends to have a too small coverage probability.
Let us assume that the estimated probabilities π̂1 = n11 /n1 and π̂2 = n21 /n2
have been calculated from two independent samples with the following outcomes:
outcome
yes no Σ
sample 1 n11 n12 n1
sample 2 n21 n22 n2

An improvement is to add an imaginary observation to each outcome in the two

samples

outcome
yes no Σ
sample 1 n11 + 1 n12 + 1 n1 + 2
sample 2 n21 + 1 n22 + 1 n2 + 2

and to calculate a confidence interval by formula (2) from the modified samples.
Such a confidence interval is called an Agresti–Caffo confidence interval. Its
coverage probability is close to the intended even for fairly small sample sizes
(e.g. n1 = n2 = 20). If the sample sizes are very small (e.g. n1 = n2 = 10)
then the coverage probability of the Agresti–Caffo confidence interval is much
too large if πi s are close to 0 or 1 but otherwise can be satisfactory. The same
modification can be used for different confidence levels (not only 95%).
Derivations of the improved confidence intervals are skipped. The deriva-
tions employ exact small sample distribution of π̂ and Taylor approximation or
approximations of so called score confidence intervals.

Exercise
a) Intelligent extraterrestrial life has been searched for in many projects.
In the Phoenix project 1995–2004 radio wave frequencies from nearly 800 stars
were targeted and observed. No signs of anomalies or systemacy, i.e. signs of
intelligent extraterrestrial life, were detected from radio waves from any of the
stars.1
Calculate a 95% confidence interval for the proportion of stars of the kind
investigated in the Phoenix project with intelligent extraterrestrial life. In the
calculation assume that n = 800. What do you think?: Does the probability of
intelligent extraterrestrial life appear small or large? Would it be a good idea
to show the confidence interval to potential financiers of similar endeavours?
b) Morandi Bridge in Genova, Italy, collapsed 14.8.2018 killing 43 and in-
juring 16 people. The bridge had a concrete structure prone for corrosion and
damage. Need for maintenance had been reported.2
Finnish Transport Infrastructure Agency (Väylävirasto) announced 12.12.
2016 that it will investigate solidity of concrete in bridges built between 2011–
2016 and that the bridges to be investigated will be chosen by random sampling.
Yle News reported 2.1.2017 that the Agency had investigated 18 bridges and
had found deficiencies in 6 bridges.
Calculate a 95% confidence interval for the proportion of bridges built be-
tween 2011–2016 with deficiencies in solidity of concrete. Note: A confidence
1 https://en.wikipedia.org/wiki/Search_for_extraterrestrial_intelligence and

https://ui.adsabs.harvard.edu/abs/2004AAS...204.7504B (read 7.2.2024).

2 https://en.wikipedia.org/wiki/Ponte_Morandi (read 7.2.2024).
interval valid under large samples only is not asked for but a confidence interval
tailored for small samples.
c) Broberg and Hakovirta (2009) surveyd divorced Finnish fathers who do
not live with their children.3 The frequencies of meetings between fathers and
their children are tabulated below.

frequency of meetings between father

and children nij (%), i = 1, 2
age of the youngest child every week less often or not at all Σ
< 10 years 8 (42.1) 11 (57.9) 19 (100)
≥ 10 years 7 (33.3) 14 (66.7) 21 (100)
Fathers with children younger than 10 years of age appear to meet their children
more often than fathers with older children. Calculate a 95% Wald confidence
interval and a 95% Agresti–Caffo confidence interval for the difference of pro-
portions of fathers from the two groups who meet their children every week.
Compare the confidence intervals. Do the confidence intervals cover 0? How do
you interpret the results?

3 M. Broberg and M. Hakovirta (2009): Lapsistaan erillään asuvana isänä eron jälkeen.

In K. Forssén, A. Haataja and M. Hakovirta (eds.): Yksinhuoltajuus Suomessa. Väestön-

tutkimuslaitos. Tutkimuksia D 50/2009. The information in the tables has been provided by
Mia Hakovirta (personal communication).

(Ebook PDF) The Basic Practice of Statistics 8Th Edition: Go To Download The Full and Correct Content Document
No ratings yet
(Ebook PDF) The Basic Practice of Statistics 8Th Edition: Go To Download The Full and Correct Content Document
43 pages
Process Analysis by Statistical Methods D. Himmelblau
100% (4)
Process Analysis by Statistical Methods D. Himmelblau
474 pages
Week 8 Statistical Intervals
No ratings yet
Week 8 Statistical Intervals
32 pages
Theoretical Solution Week6
No ratings yet
Theoretical Solution Week6
3 pages
06 Chapter 15, 16, 17, 21 Inferences For Proportions Completed
No ratings yet
06 Chapter 15, 16, 17, 21 Inferences For Proportions Completed
18 pages
Lecture Slides 11 UN1201
No ratings yet
Lecture Slides 11 UN1201
23 pages
Binomial Confidence Intervals
No ratings yet
Binomial Confidence Intervals
17 pages
4 3+–+Interval+Estimates+for+Proportions
No ratings yet
4 3+–+Interval+Estimates+for+Proportions
4 pages
Population Mean (Known Variance)
No ratings yet
Population Mean (Known Variance)
5 pages
Agresti 2000
No ratings yet
Agresti 2000
10 pages
List of Formulae and Statistical Tables
No ratings yet
List of Formulae and Statistical Tables
4 pages
Estimating With Confidence Practice
No ratings yet
Estimating With Confidence Practice
1 page
Estimation Handout
No ratings yet
Estimation Handout
7 pages
Confidence Intervals - Google Slides
No ratings yet
Confidence Intervals - Google Slides
21 pages
UE23MA242A - Unit-2 - Class-17!18!19 - Confidence Intervals Introduction Large Samples CI - Population Mean
No ratings yet
UE23MA242A - Unit-2 - Class-17!18!19 - Confidence Intervals Introduction Large Samples CI - Population Mean
57 pages
SEE5211 Chapter5 P2017
No ratings yet
SEE5211 Chapter5 P2017
48 pages
Complete Business Statistics: Confidence Intervals
No ratings yet
Complete Business Statistics: Confidence Intervals
50 pages
PDF Lesson 2 Understanding Confidence Interval Estimates For The Population Mean
No ratings yet
PDF Lesson 2 Understanding Confidence Interval Estimates For The Population Mean
33 pages
UE23MA242A Unit-2 Class-20 Confidence Intervals For Proportions
No ratings yet
UE23MA242A Unit-2 Class-20 Confidence Intervals For Proportions
17 pages
Notes7 1o
No ratings yet
Notes7 1o
29 pages
Probability Distributions: by Dr. Ameer Kadhim Hussein. M.B.Ch.B. FICMS (Community Medicine
No ratings yet
Probability Distributions: by Dr. Ameer Kadhim Hussein. M.B.Ch.B. FICMS (Community Medicine
37 pages
Confidence Interval Estimate
No ratings yet
Confidence Interval Estimate
30 pages
Stats 8 Practice Test
No ratings yet
Stats 8 Practice Test
6 pages
Word Format LAS Statistics and Prob 3
No ratings yet
Word Format LAS Statistics and Prob 3
6 pages
Estimatation
No ratings yet
Estimatation
21 pages
MIT18 05S14 Class22-Slde-A
No ratings yet
MIT18 05S14 Class22-Slde-A
19 pages
CI For A Proportion
No ratings yet
CI For A Proportion
24 pages
Statistics FinalReview
No ratings yet
Statistics FinalReview
8 pages
Confidence Intervals For A Single Sample: H.W. Kayondo C 1
No ratings yet
Confidence Intervals For A Single Sample: H.W. Kayondo C 1
16 pages
Ch3 Prob II Anu Fall24 1
No ratings yet
Ch3 Prob II Anu Fall24 1
20 pages
Understanding Confidence Interval Estimates
No ratings yet
Understanding Confidence Interval Estimates
29 pages
Math-138 Unit 3 Packet Fall 2024 (Canvas)
No ratings yet
Math-138 Unit 3 Packet Fall 2024 (Canvas)
36 pages
Topic 11 Confidence Intervals For A Single Sample
No ratings yet
Topic 11 Confidence Intervals For A Single Sample
21 pages
Chapter Five Statistical Inferences Estimating For Single Populations Estimating Population Mean With Large Sample Size
No ratings yet
Chapter Five Statistical Inferences Estimating For Single Populations Estimating Population Mean With Large Sample Size
13 pages
Binomial统计相关
No ratings yet
Binomial统计相关
33 pages
Point and Interval Estimates
No ratings yet
Point and Interval Estimates
17 pages
Finals. Quiz No. 1
No ratings yet
Finals. Quiz No. 1
8 pages
3.18 Confidence Intervals
No ratings yet
3.18 Confidence Intervals
3 pages
Lecture5 More Bayes
No ratings yet
Lecture5 More Bayes
16 pages
Bayesian Credible Interval
100% (1)
Bayesian Credible Interval
8 pages
Handout 4 - Statistical Interval
No ratings yet
Handout 4 - Statistical Interval
13 pages
Tutorial Confidence Interval
No ratings yet
Tutorial Confidence Interval
21 pages
EDA Module 7
No ratings yet
EDA Module 7
3 pages
Biostatistics Chapter 17
No ratings yet
Biostatistics Chapter 17
13 pages
9 Statistical Interval PDF
No ratings yet
9 Statistical Interval PDF
16 pages
ch16 Ci
No ratings yet
ch16 Ci
21 pages
66cc482dcab874225a22d789 Chapter8 StatisticalIntervalsforaSingleSample
No ratings yet
66cc482dcab874225a22d789 Chapter8 StatisticalIntervalsforaSingleSample
17 pages
Unit 2 Combined
No ratings yet
Unit 2 Combined
192 pages
05 - Estimating A Proportion
No ratings yet
05 - Estimating A Proportion
5 pages
Unit 11 - Inferential Statistics
No ratings yet
Unit 11 - Inferential Statistics
10 pages
Statistical Intervals 2
No ratings yet
Statistical Intervals 2
58 pages
Chap 006
No ratings yet
Chap 006
38 pages
Act 12
No ratings yet
Act 12
1 page
Topic 5
No ratings yet
Topic 5
11 pages
Week 8: Lecture 1: Parametric Confidence Intervals
No ratings yet
Week 8: Lecture 1: Parametric Confidence Intervals
7 pages
Confidence Intervals I: ENGG 2780A ESTR 2020
No ratings yet
Confidence Intervals I: ENGG 2780A ESTR 2020
17 pages
MArketing Research Notes Chapter18
No ratings yet
MArketing Research Notes Chapter18
4 pages
STAT210 FL17 LCN 6 Edited
No ratings yet
STAT210 FL17 LCN 6 Edited
25 pages
Lab Activity #10: More Review of Confidence Intervals (KEY)
No ratings yet
Lab Activity #10: More Review of Confidence Intervals (KEY)
4 pages
Estimation of Parameters
No ratings yet
Estimation of Parameters
14 pages
Mathematical Foundations of Information Theory
From Everand
Mathematical Foundations of Information Theory
A. Ya. Khinchin
3.5/5 (9)
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Group Project-F
No ratings yet
Group Project-F
1 page
Group Project-C
No ratings yet
Group Project-C
1 page
Group Project-B
No ratings yet
Group Project-B
1 page
Group Project-A
No ratings yet
Group Project-A
1 page
Answers Key-Solutions of Semi-Final Exam in Statistics and Probability 2019-2020 For of Grade 11
No ratings yet
Answers Key-Solutions of Semi-Final Exam in Statistics and Probability 2019-2020 For of Grade 11
2 pages
Unit 3 (QM)
No ratings yet
Unit 3 (QM)
20 pages
SM Tutorial Sheet-2
0% (1)
SM Tutorial Sheet-2
2 pages
Univariate Analysis of Variance: Between-Subjects Factors
No ratings yet
Univariate Analysis of Variance: Between-Subjects Factors
3 pages
Continuous Random Variable
No ratings yet
Continuous Random Variable
8 pages
(Ebook PDF) Statistics Unplugged 4th Edition by Sally Caldwell - Download The Ebook With All Fully Detailed Chapters
100% (2)
(Ebook PDF) Statistics Unplugged 4th Edition by Sally Caldwell - Download The Ebook With All Fully Detailed Chapters
42 pages
Quiz 8 Chap 9
No ratings yet
Quiz 8 Chap 9
5 pages
ARIMA Box-Jenkins 1st
No ratings yet
ARIMA Box-Jenkins 1st
15 pages
Channel Capacity PDF
No ratings yet
Channel Capacity PDF
4 pages
Statistics and Probability - Solved Assignments - Semester Fall 2007
50% (2)
Statistics and Probability - Solved Assignments - Semester Fall 2007
27 pages
Comps Sample Questions Applied Statistics Methods
No ratings yet
Comps Sample Questions Applied Statistics Methods
135 pages
Structural Equation Model-SEM
No ratings yet
Structural Equation Model-SEM
113 pages
Machine Learning Classification Bootcamp Cheatsheet
No ratings yet
Machine Learning Classification Bootcamp Cheatsheet
7 pages
Advanced Statistics Manual PDF
100% (3)
Advanced Statistics Manual PDF
258 pages
Problem SET 1
No ratings yet
Problem SET 1
3 pages
Sample Questions: Subject Name: Semester: VI
No ratings yet
Sample Questions: Subject Name: Semester: VI
17 pages
Probability
No ratings yet
Probability
12 pages
Graphical Displays For Meta-Analysis
No ratings yet
Graphical Displays For Meta-Analysis
15 pages
Correlation
No ratings yet
Correlation
11 pages
Semester Cohort: SMK Kai Chung (Yfb 6301) Peti Surat 100, 96507 Bintangor, Sarawak
No ratings yet
Semester Cohort: SMK Kai Chung (Yfb 6301) Peti Surat 100, 96507 Bintangor, Sarawak
6 pages
Process Capability NIST
No ratings yet
Process Capability NIST
7 pages
T PFN: A T T S S T C P S: AB Ransformer HAT Olves Mall Abular Lassification Roblems in A Econd
No ratings yet
T PFN: A T T S S T C P S: AB Ransformer HAT Olves Mall Abular Lassification Roblems in A Econd
33 pages
Assignment
No ratings yet
Assignment
6 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
2 pages
Maths 2 Syllabus
No ratings yet
Maths 2 Syllabus
2 pages
Clabe Problem Sheet 6 Solution
No ratings yet
Clabe Problem Sheet 6 Solution
5 pages
Answer
100% (3)
Answer
5 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Theoretical Problem Week6

Uploaded by

Theoretical Problem Week6

Uploaded by

STATISTICAL INFERENCE (MS-C1620). 8.1.–17.4.2024. Aalto University.

Theoretical exercise 5 (week 6)

An improvement is to add an imaginary observation to each outcome in the two

https://ui.adsabs.harvard.edu/abs/2004AAS...204.7504B (read 7.2.2024).

frequency of meetings between father

In K. Forssén, A. Haataja and M. Hakovirta (eds.): Yksinhuoltajuus Suomessa. Väestön-

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.