0% found this document useful (0 votes)

16 views65 pages

5-6.sampling Error and Confidence Interval 1

Uploaded by

yanghm669

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views65 pages

5-6.sampling Error and Confidence Interval 1

Uploaded by

yanghm669

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 65

Sampling Error and Confidence Interval

抽样误差与置信区间

Haomin Yang
School of Public Health
Fujian Medical University

1
Content
• Sampling error and Sampling distribution
• Central limit theorem
• Standard error
• t distribution
• Point estimation
• Confidence Interval estimation

2 2
Population and sample
• Population: The whole individuals that one
intends to study.

--- Homogeneity but with Variation.

• Sample: A representative part of the population. It

is a subset of the population.

3
The relationship between the population and sample

Population
(The complete set) inference

sampling Sample
(The subset of
the population)

Samples are taken from populations to provide estimates of population

parameters. Then we use sample data to make an inference about a population.

4
Why use sample?

• Cost
• Time
• Possibility to find all individuals

• Further questions for using samples

– How to select sample from the population?
– How many are enough?

5 5
Aims of sampling

• Reduces cost of research

• Generalize about a larger population

• In some cases (e.g. industrial production) analysis

may be destructive, so sampling is needed
Sampling research
• In statistics, sampling is the selection of a subset (a statistical
sample) of individuals from within a statistical population to
estimate characteristics of the whole population.

• Need to evaluate the precision of our estimation→ aim of this

lecture

7
Importance of sampling

• Traditionally, the marginal costs of data collection and

processing were high

• In the Era of Big Data: easier and faster to collect, store

and process lots of data

• sampling?

8
Importance of sampling

Even if the data that you have is very “big”, it might represent only a part
of the population and may not be representative of the whole!
9
10
Importance of sampling

• Throwing computational resources at a problem may not

always solve the problem

• Focus on reducing the sampling bias, not only increasing the

sample size

• Sometimes, less is more!

11
• Generally, it is costly and
labor-intensive to study the
entire population, and in
some cases even
impossible because the
Advantages of sampling
• lower cost number of the whole
• faster data collection individuals may be infinite.
• High quality of data

12
Population

All people or items with the characteristic one

wishes to understand
• Eg. All people in FJMU
• Dimensions: time? Space?

• Broad or narrow：carefully define

- Demographically mixed and geographically
dispersed→ difficult to gain access
13
Sampling frame

The sampling frame is the actual list of individuals that

the sample will be drawn from. Ideally, it should include
the entire target population.

Eg. Work condition of doctors in hospital A

14
Sample size

The number of individuals in your sample depends on the

size of the population, and on how precisely you want the
results to represent the population as a whole.

• sample size calculator (www.openepi.com)

• the larger the sample size, the more accurately and
confidently

15
16
Sampling techniques

• Random sampling
 Simple random sampling
 Systemic sampling
 Stratified random sampling
 Cluster sampling
• Non-random sampling
 Convenience Sampling
 Judgement Sampling
 Snowball sampling
17
Simple random sampling

• Representativeness: All individuals in a

population have an equal chance of being
selected

• Set a random number for each individual

• Limitation: need complete list of all

individualls. If not possible, use other
sampling approach
18
19
Eg. Work condition of doctors

Assign a number to every doctor in the

hospital database from 1 to 1000, and
use a random number generator to
select 100 numbers.

20
Systemic sampling

• Members of the population are put in some

order. A starting point is selected at random,
and every nth superscript member is selected
to be in the sample.
• Evenly sampled

• Hidden periodic trait within the population

and sampling coincidently consistent with
that periodic trait--- not representative
• Hospitalization in the first week of Oct 21
22
Eg. Work condition of doctors

All doctors are listed in alphabetical order.

From the first 10 numbers, you randomly
select a starting point: number 6. From
number 6 onwards, every 10th person on
the list is selected (6, 16, 26, 36, and so on),
and you end up with a sample of 100.

23
Stratified random sampling
• The population is first split into groups.
The members from each group are
chosen randomly.
• Groups should not be overlapped
• Equal importance and variance of data in
each group
• Draw more precise conclusions by
ensuring that every subgroup is properly
represented in the sample.
24
25
Eg. Work condition of doctors

The hospital has 600 female doctors and 400

male doctors. You want to ensure that the
sample reflects the gender balance of the
hospital, so you sort the population into two
strata based on gender. Then you use random
sampling on each group, selecting 60 women
and 40 men.

26
Cluster sampling
• The entire population is divided into clusters
or sections and then the clusters are
randomly selected. All the elements of the
cluster are used for sampling. Clusters are
identified using details such as age, sex,
location (geographic cluster)
• Give all the clusters equal chances of being
selected
• Instead of sampling individuals from each
subgroup, you randomly select entire
subgroups. 27
Cluster sampling

This method is good for dealing with large

and dispersed populations, but there is
more risk of error in the sample, as there
could be substantial differences between
clusters. It’s difficult to guarantee that the
sampled clusters are really representative
of the whole population.

28
29
Eg. Work condition of doctors

The hospital group has clinics in 10

communities across the city (all with roughly
the same number of doctors in similar roles).
You don’t have the time to go to every clinic
to collect your data, so you use random
sampling to select 3 clinics – these are your
clusters.

30
Non-random sampling

31
32
Exercise

Suppose you are going to be conducting a study on

FJMU students, asking for their opinion on influenza
vaccination. First, formulate your research question.
Then, describe how you would carry out the sampling of
students using the following methods:
(a) simple random sampling
(b) stratified sampling
(c) cluster sampling

33
Sampling error

34
Sampling research
• Statistical inference refers to reach conclusions about
population based on a sample.

• The sampling error exists in any sampling research.

Sampling error
• The difference between statistics from different
samples, as well as the difference between sample
statistics and population parameter, is called
sampling error.
• It can not be avoided but can be estimated.

36
• Sample surveys take into account the
study of a tiny segment of a population,
so, there is always a particular amount of
inaccuracy in the information obtained

Sampling Error =
(Response Error) +
(Frame Error) +
(Chance Error)

37
How to Reduce Sampling Error?
• Increasing Sample Size
the size of the sample increases, the chance of
occurrence of the sampling error will be less. No error if
the sample size and the population size coincide

• Stratification
Stratified sampling: all the groups are defined in the
sample, the sampling error is reduced.

38
• Sampling error is the reason why we have to use
statistics.

• sampling error is a consequence of

– the population distribution of the variables

– the sampling method used to investigate the population.

39
Sampling distribution

40
Sampling distribution

• a probability distribution of a statistic obtained from a

larger number of samples(with sample size N) drawn
from a specific population, usually the mean

41
Sampling Distribution
• A sampling distribution is a distribution of a statistic over
all possible samples.

• To get a sampling distribution,

– 1. Take a sample of size N (a given number like 5, 10, or 1000) from
a population
– 2. Compute the statistic (e.g., the mean) and record it.
– 3. Repeat 1 and 2 a lot (infinitely for large pops).
– 4. Plot the resulting sampling distribution, a distribution of a
statistic over repeated samples.
Simulation test
• Population: X ~N (165.70,3.212 )
• Repeatedly draw 100 independent, random
samples from the same population with
sample size equal to 20.
1、 165.82, 3.06
Population: 2、 164.98, 3.04
Normal 3、 165.75, 3.07
distribution ┆ 100 samples
99、165.82,3.14
 =165.70
100、165.92, 3.18
 =3.21 n =20

43
Simulation test
Frequency

X ~N (165.70,3.212 )

Sample mean
44
Simulation test indicate that:

• The sample means are different from

population mean.

• The sample means differ from each other.

• The mean of the sample means is equal

to the population mean.

45
Simulation test indicate that:

• The range of the sample means is narrower

than that of the original population
distribution.

• The sample mean is symmetric about the

population mean, taller around center, shorter
on two sides. It is normal distribution.

46
Central limit theorem 1
If a population is a normal distribution, with mean
equal to μ and standard deviation equal to σ, the
sampling distribution of the sample mean x is also
normal distribution with mean equal to μ and
standard deviation equal to the population standard
deviation divided by the square root of the sample
2
size. X ~ N ( , 2 ) X ~ N ( , )
n

47
Central limit theorem 1
• The population is
normal distribution.

• The variation of the

sample means
decreases as the
sample size n
increases.

48
Central limit theorem 2
For simple random samples of n observations taken
from a population with mean equal to μ and standard
deviation equal to σ, regardless of the population’s
distribution, provided the sample size n is sufficiently
large, the distribution of the sample mean x will be
approximately normal with mean equal to μ and standard
deviation equal to the population standard deviation
divided by the square root of the sample size.
2
X ~ N ( , )
n 49
Central limit theorem 2
• The population is
uniform distribution.

• The variation of the

sample means
decreases as the
sample size n
increases.
50
Central limit theorem 2
• The population is
exponential distribution.

• The variation of the

sample means
decreases as the
sample size n
increases.
51
Central limit theorem 2
• The population is U-
shaped distribution.

• The variation of the

sample means
decreases as the
sample size n
increases.
52
53
Sampling Distribution
• The sampling distribution shows the relation
between the probability of a statistic and the
statistic’s value for all possible samples of
size N drawn from a population.
f(M) Hypothetical Distribution of Sample Means
Sampling Distribution Mean and SD
• The Mean of the sampling distribution is defined
the same way as any other distribution
(expected value).
• The SD of the sampling distribution is the
Standard Error. Important and useful.
• Variance of sampling distribution is the expected
value of the squared difference – a mean
square.
Standard error
• The variation of the sample mean, or
the standard deviation of the sample
mean, is called the standard error of

the mean SE, denoted by X  .
n

• The standard deviation of the initial

variable: σ.
56
Standard error
• The standard error is used to measure the
sampling error. It is affected by both standard
deviation and sample size.

• The standard deviation is a fixed level we

cannot change. In order to minimize the
standard error , the only thing we can do is to

increase sample size. X 
n
57
Standard error

• In practice, the population standard deviation σ is

usually unknown and replaced by the sample
standard deviation s approximately.

 S
X  SX 
n n

58
• SE gives us a way to quantify how much variability we
expect to see in a sampling distribution

• A point estimate is useless without some kind of

associated measure of uncertainty. A standard error is
one such measure

59
60
Exercise
• Random samples of size 225 are drawn from a population
with mean 100 and standard deviation 20. Find the mean and
standard deviation of the sample mean.

• Random samples of size 64 are drawn from a population

with mean 32 and standard deviation 5 . Find the mean and
standard deviation of the sample mean

61
Exercise

A population has mean 75 and standard deviation 12.

1.Random samples of size 121 are taken. Find the mean and
standard deviation of the sample mean.

2.How would the answers to part (a) change if the size of

the samples were 400 instead of 121?

62
Exercise

• If the standard error of the mean is 10 for N=12 ,

what is the standard error of the mean for N=22 ?

• If the standard error of the mean is 50 for N=25 ,

what is it for N=64 ?

63
True/false
• The standard error of the mean is smaller when N=20 than when N=10

• You choose 20 students from the population and calculate the mean of their test
scores. You repeat this process 100 times and plot the distribution of the means. In
this case, the sample size is 100

• The median has a sampling distribution

• In your school, 40% of students watch TV at night. You randomly ask 5 students
every day if they watch TV at night. Every day, you would find that 2 of the 5 do
watch TV at night

64
What is the point of all this
• Why looking at properties of repeated samples from a
population?
• We also don’t know anything about the population
parameter of interest.

• how point estimates behave under repeated sampling

(i.e. sampling distributions),
• how ‘sampling error’ and ‘standard error’ relate to
sampling distributions.

Phillips Disaster 1989
50% (2)
Phillips Disaster 1989
24 pages
Personal Mandala Rubric
No ratings yet
Personal Mandala Rubric
2 pages
Unit-III Sample & Sampling Distribution
No ratings yet
Unit-III Sample & Sampling Distribution
53 pages
4 - Sampling and Sample Size - SFB
No ratings yet
4 - Sampling and Sample Size - SFB
52 pages
Gate Scholorship Work - October: Sampling Fundamentals
No ratings yet
Gate Scholorship Work - October: Sampling Fundamentals
13 pages
Lecture 13
No ratings yet
Lecture 13
44 pages
MRM Mod 3
No ratings yet
MRM Mod 3
121 pages
Sample Design and Sampling Procedures
No ratings yet
Sample Design and Sampling Procedures
43 pages
Statistics For Managers Using Microsoft Excel: 5 Edition
No ratings yet
Statistics For Managers Using Microsoft Excel: 5 Edition
43 pages
Lecture 8
No ratings yet
Lecture 8
39 pages
Chapter 4
No ratings yet
Chapter 4
40 pages
Introduction To Management Chapter One Rift Valley University
No ratings yet
Introduction To Management Chapter One Rift Valley University
31 pages
Sampling and Sampling Distributions
No ratings yet
Sampling and Sampling Distributions
7 pages
Sampling MM 2022
No ratings yet
Sampling MM 2022
63 pages
Bus 6
No ratings yet
Bus 6
45 pages
Chapter 3
100% (1)
Chapter 3
79 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
30 pages
Sampling
No ratings yet
Sampling
86 pages
Chapter 2-Part 1 Applied Statistics
No ratings yet
Chapter 2-Part 1 Applied Statistics
30 pages
Inferential Statistics
No ratings yet
Inferential Statistics
169 pages
Sampling & Sampling Distribution: by Asif Hanif
No ratings yet
Sampling & Sampling Distribution: by Asif Hanif
25 pages
Sampling: Iiird Year Resident
No ratings yet
Sampling: Iiird Year Resident
26 pages
Sampling Distribution
No ratings yet
Sampling Distribution
29 pages
Stat For Comp (7-9)
No ratings yet
Stat For Comp (7-9)
22 pages
Sampling Theory
No ratings yet
Sampling Theory
19 pages
Sample and Population
No ratings yet
Sample and Population
35 pages
Sampling in Daily Life
No ratings yet
Sampling in Daily Life
45 pages
Sampling Design
No ratings yet
Sampling Design
104 pages
Session 8
No ratings yet
Session 8
34 pages
RM 7
No ratings yet
RM 7
47 pages
Est&Hypgp 7
No ratings yet
Est&Hypgp 7
292 pages
Lecture 5 Statistics
0% (1)
Lecture 5 Statistics
52 pages
Sampling Method and Estimation: Statistics For Economics 1
No ratings yet
Sampling Method and Estimation: Statistics For Economics 1
62 pages
Sampling
No ratings yet
Sampling
42 pages
Ba1 7
No ratings yet
Ba1 7
37 pages
Chapter Seven
No ratings yet
Chapter Seven
35 pages
Intro W10 Rev
No ratings yet
Intro W10 Rev
23 pages
4th Unit - Statistics
No ratings yet
4th Unit - Statistics
13 pages
Samplin Distn
No ratings yet
Samplin Distn
37 pages
Report On Sampling Techniques
0% (1)
Report On Sampling Techniques
44 pages
5sampling Methods
No ratings yet
5sampling Methods
78 pages
Sampling and Sampling Distribution
No ratings yet
Sampling and Sampling Distribution
67 pages
What Is Sampling?
100% (2)
What Is Sampling?
45 pages
Lesson 07 - Sampling and Sampling Distributions (Without Video)
No ratings yet
Lesson 07 - Sampling and Sampling Distributions (Without Video)
53 pages
Stat 11
No ratings yet
Stat 11
12 pages
Sampling and Sample Size Calculation: Lazereto de Mahón, Menorca, Spain September 2006
No ratings yet
Sampling and Sample Size Calculation: Lazereto de Mahón, Menorca, Spain September 2006
49 pages
Session 9
No ratings yet
Session 9
29 pages
Sampling and Distribution
No ratings yet
Sampling and Distribution
40 pages
Introduction To Sampling: Situo Liu Spry, Inc. 10/25/2013
No ratings yet
Introduction To Sampling: Situo Liu Spry, Inc. 10/25/2013
22 pages
Chapter 7 BRM
No ratings yet
Chapter 7 BRM
51 pages
Brief Lecture Notes
No ratings yet
Brief Lecture Notes
13 pages
Statistics 2: DR Taher
No ratings yet
Statistics 2: DR Taher
42 pages
Lesson 6 - Sampling Distributions
No ratings yet
Lesson 6 - Sampling Distributions
7 pages
Module III Sampling
No ratings yet
Module III Sampling
62 pages
Sampling and Sampling Distribution
100% (1)
Sampling and Sampling Distribution
64 pages
Lectorial Slides 6a
No ratings yet
Lectorial Slides 6a
30 pages
Sampling (Method)
No ratings yet
Sampling (Method)
31 pages
Eth Od S
No ratings yet
Eth Od S
17 pages
Lecture 3 Sampling
No ratings yet
Lecture 3 Sampling
83 pages
Sampling Techniques
No ratings yet
Sampling Techniques
61 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
5/5 (1)
Corex Delivery
No ratings yet
Corex Delivery
37 pages
Social Work in A Digital Age - Ethical and Risk Management Challenges
No ratings yet
Social Work in A Digital Age - Ethical and Risk Management Challenges
12 pages
Chicago Boogie - Alto Sax
No ratings yet
Chicago Boogie - Alto Sax
2 pages
Manual Reductores FACHINI
No ratings yet
Manual Reductores FACHINI
32 pages
Dissertation Sara Parchami
100% (2)
Dissertation Sara Parchami
7 pages
Supply Chain Improvement in Construction Industry
No ratings yet
Supply Chain Improvement in Construction Industry
8 pages
Orlan Suit Introduction
No ratings yet
Orlan Suit Introduction
20 pages
Code-Switching As A Teaching and Learning Strategy
No ratings yet
Code-Switching As A Teaching and Learning Strategy
21 pages
Life Plan by Randy Pope
No ratings yet
Life Plan by Randy Pope
25 pages
Sir Sanny DLP
No ratings yet
Sir Sanny DLP
8 pages
QMB 6357 Welcome Letter
No ratings yet
QMB 6357 Welcome Letter
4 pages
Building Services Compressed Compressed
No ratings yet
Building Services Compressed Compressed
79 pages
LSB Exercise 1 Boot Sequence
No ratings yet
LSB Exercise 1 Boot Sequence
11 pages
Ocean Maths Homework
100% (1)
Ocean Maths Homework
8 pages
Technical Delay Report
100% (1)
Technical Delay Report
1 page
2021 - OanhNC - Analysis of PVD With Surcharge Preloading of Hiep Phuoc Clay
No ratings yet
2021 - OanhNC - Analysis of PVD With Surcharge Preloading of Hiep Phuoc Clay
16 pages
Chapter 13 - Aggregate Supply and The Short-Run Tradeoff Between Inflation and Unemployment
No ratings yet
Chapter 13 - Aggregate Supply and The Short-Run Tradeoff Between Inflation and Unemployment
26 pages
The Lifestyle Flow
No ratings yet
The Lifestyle Flow
14 pages
003 - Syngas Generation For GTL PDF
No ratings yet
003 - Syngas Generation For GTL PDF
91 pages
Safety Data Sheet: 1. Identification of The Substance/Mixture and The Supplier
No ratings yet
Safety Data Sheet: 1. Identification of The Substance/Mixture and The Supplier
8 pages
Central University of Haryana: Temporary Camp Office: Govt. B.Ed. College Building, Narnaul (Distt. Mahendergarh) Haryana
No ratings yet
Central University of Haryana: Temporary Camp Office: Govt. B.Ed. College Building, Narnaul (Distt. Mahendergarh) Haryana
7 pages
APPENDIX IV Geotechnical Factual Report
No ratings yet
APPENDIX IV Geotechnical Factual Report
69 pages
Cambridge IGCSE: Travel & Tourism 0471/21
No ratings yet
Cambridge IGCSE: Travel & Tourism 0471/21
12 pages
Swarna Ganga Form
No ratings yet
Swarna Ganga Form
1 page
Understanding How PeopleCode Events Work
No ratings yet
Understanding How PeopleCode Events Work
14 pages
World Religion Week 2 PDF
No ratings yet
World Religion Week 2 PDF
9 pages
EPB-6. Cs-Ti
No ratings yet
EPB-6. Cs-Ti
29 pages
Raspberry Pi Factsheet
No ratings yet
Raspberry Pi Factsheet
9 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

5-6.sampling Error and Confidence Interval 1

Uploaded by

5-6.sampling Error and Confidence Interval 1

Uploaded by

Sampling Error and Confidence Interval

--- Homogeneity but with Variation.

• Sample: A representative part of the population. It

Samples are taken from populations to provide estimates of population

• Further questions for using samples

• Reduces cost of research

• Generalize about a larger population

• In some cases (e.g. industrial production) analysis

• Need to evaluate the precision of our estimation→ aim of this

• Traditionally, the marginal costs of data collection and

• In the Era of Big Data: easier and faster to collect, store

• Throwing computational resources at a problem may not

• Focus on reducing the sampling bias, not only increasing the

• Sometimes, less is more!

All people or items with the characteristic one

• Broad or narrow：carefully define

The sampling frame is the actual list of individuals that

Eg. Work condition of doctors in hospital A

The number of individuals in your sample depends on the

• sample size calculator (www.openepi.com)

• Representativeness: All individuals in a

• Set a random number for each individual

• Limitation: need complete list of all

Assign a number to every doctor in the

• Members of the population are put in some

• Hidden periodic trait within the population

All doctors are listed in alphabetical order.

The hospital has 600 female doctors and 400

This method is good for dealing with large

The hospital group has clinics in 10

Suppose you are going to be conducting a study on

• The sampling error exists in any sampling research.

• sampling error is a consequence of

– the sampling method used to investigate the population.

• a probability distribution of a statistic obtained from a

• To get a sampling distribution,

• The sample means are different from

• The sample means differ from each other.

• The mean of the sample means is equal

• The range of the sample means is narrower

• The sample mean is symmetric about the

• The variation of the

• The variation of the

• The variation of the

• The variation of the

• The standard deviation of the initial

• The standard deviation is a fixed level we

• In practice, the population standard deviation σ is

• A point estimate is useless without some kind of

• Random samples of size 64 are drawn from a population

A population has mean 75 and standard deviation 12.

2.How would the answers to part (a) change if the size of

• If the standard error of the mean is 10 for N=12 ,

• If the standard error of the mean is 50 for N=25 ,

• The median has a sampling distribution

• how point estimates behave under repeated sampling

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.