Introduction To Statistical Concepts - New1
Introduction To Statistical Concepts - New1
Concepts
Dr. Pooja Goel
Department of Commerce
Types of Statistics
Descriptive Statistics
A sample is defined as a
subset of a population.
The key is that the
sample consists of some,
but not all, of the
members of the
population.
Parameter
• Statistic
• A statistic is defined as a
characteristic of a sample.
Impossible
Inconvenient
Expensive
Homogenous
Stages in the Selection of a Sample
Define the
Target
Population
Select a
sampling
frame
Choosing
sample
method
Procedure for
selecting
sampling
Determine
the sampling
size
Select actual
sampling
units
Types of sampling method
If a company employs 3,500 people and if a random sample of 175 of these employees has
been taken by systematic sampling, what is the value of k? The researcher would start the
sample selection between what two values? Where could the researcher obtain a frame for
this study?
Let’s try this….
Q:A pharmaceutical company wants to trace the effects of a new
drug on patients with specific health problems (muscular
dystrophy, sickle cell anemia, rheumatoid arthritis, etc.). It then
contacts such individuals and, with a group of voluntarily
consenting patients, tests the drug. Which sampling technique this
company has used?
Judgement Sampling
Let’s try this….
Q:A human resources director is interested in knowing why staff resign.
Which sampling will be useful in this case for conducting exit
interviews of all members completing their final papers in the human
resources department on the same day, before resigning. The sample
chosen for interview will be based on a simple random sampling of the
various clusters of personnel resigning on different days. The interviews
will help to understand the reasons for turnover of a heterogeneous
group of individuals (i.e., from various departments), and the study can
be conducted at a low cost.
Cluster Sampling
Types of sampling method
How much time is available to collect the data from the sample?
Comparative Analysis of Sampling Techniques
Comparative Analysis of Sampling Techniques
Comparative Analysis of Sampling Techniques
Errors associated with Sampling
Errors associated with sampling
Non-response error: can occur when subjects who refuse to take part in a
study, or who drop out before the study can be completed, are systematically
different from those who participate.
Extremity Bias. Some individuals tend to use extremes when responding to questions.
For example, they may choose only “1” or “10” on a ten-point scale.
Interviewer Bias. Response bias may arise from the interplay between interviewer
and respondent. If the interviewer’s presence influences respondents to give untrue or
modified answers.
Is it more representative?
Standard error varies inversely with the square root of the sample size.
Hence, if we want to reduce the standard error given a particular
standard deviation in the sample, we need to increase the sample size.
Confidence
Confidence denotes how certain we are that our estimates will really hold true
for the population.
Other things being equal, the narrower the range, the lower the confidence. In
other words, there is a trade-off between precision and confidence for any
given sample size.
The level of confidence can range from 0 to 100%. 95% confidence is the
conventionally accepted level for most business research, most commonly
expressed by denoting the significance level as p ≤ 0.05.
In other words, we say that at least 95 times out of 100 our estimate will
reflect the true population characteristic.
Precision VS Confidence Level
Sample data, Precision, and Confidence in Estimation
For example, we may want to estimate the mean dollar value of
purchases made by customers when they shop at department stores.
From a sample of 64 customers sampled through a systematic
sampling design procedure, we may find that the sample mean =105 ,
and the sample standard deviation S = 10, the sample mean, is a point
estimate of μ, the population mean. We could construct a confidence
interval around X to estimate the range within which μ will fall. The
standard error and the percentage or level of confidence we require
will determine the width of the interval, which can be represented by
the following formula, where K is the t statistic for the level of
confidence desired.
Sample data, Precision, and Confidence in Estimation
Sample data, Precision, and Confidence in Estimation
Characteristics
1. Continuous distribution
2. Symmetrical distribution about its
mean
3. Asymptotic to the horizontal axis
4. It is unimodal
5. Family of curves
6. Area under curve is 1
Standard Normal Distribution
Normal Distribution
Normal Distribution
Calculation of Sample Size in case of mean
Calculation of Sample Size in case of Proportion
• Below $100
a) P(͞X< 160)?
b) P(͞X >142)?
In a normal distribution with mean 375 and standard deviation 48, how
large a sample must be taken so that the probability will be at least 0.95
that the sample mean falls between 370 and 380.