Criminal Justice
Criminal Justice
2 Summary
Confidence intervals for the difference between the means of two populations or the mean
responses to two treatments μ1 and μ2 are based on the difference 1 - 2 between the sample
means.
Before estimating μ1 μ2, we need to check for independence and that the sampling distribution
of 1-2 is approximately Normal. The required conditions are:
Random: The data come from two independentrandom samples or from two groups in a
randomized experiment.
O 10%: When sampling without replacement, n1 < 0.10N1 and n2 < 0.10N2. Norma Large
Sample: For each sample, the corresponding population distribution (or the true distribution of
response to the treatment) is Normal or the sample size is large (n ≥ 30). For each sample, if the
population (treatment) distribution has unknown shape and n < 30, confirm that a graph of the
sample data shows no strong skewness or outliers.
•
•
·
When conditions are met, a C% confidence interval for μ1 - μ2 is
(1-2)+t*
n1 n2
-
where t* is the critical value with C% of its area between -t* and t* for the tdistribution with
degrees of freedom from either Option 1 (technology) or Option 2 (the smaller of μ2.
1 and n21). This is called a two-sample tintervalforμ1
Paired data result from recording two values of the same quantitative variable for each
individual or for each pair of similar individuals.
To analyze paired data, start by computing the difference for each pair. Then make a graph of
the differences. Use the mean difference diff and the standard deviation of the differences & diff
as summary statistics.
Before estimating diff, we need to check for independence between the differences and that the
sampling distribution of a diff is approximately Normal. The required conditions
are:
Random: Paired data come from a random sample from the population of interest or from a
randomized experiment.
10%: When sampling without replacement,n diff<0.10Ndiff
Norma Large Sample: The population distribution of differences (or the true distribution of
differences in response to the treatments) is Normal or the number of
differences in the sample is large(ndiff ≥ 30). If the population (true) distribution of differences
has unknown shape and the number of differences in the sample is less than 30, a graph of the
sample differences shows no strong skewness or outliers.
When the conditions are met, a C% confidence interval for the true mean difference diff is
Sdiff
I
difft*
ndiff
where t* is the critical value for at distribution with df = ndiff - 1 and C% of its area between -t*
and t*. This is called a one-sample t intervalfor a mean difference or a paired t intervalfor a
mean difference.
The proper inference method depends on how the data were produced. For paired data, use a
one-sample tinterval for diff. For quantitative data that come from independent random samples
from two populations of interest or from two groups in a randomized experiment, use a
two-sample tinterval for μ12.
Be sure to follow the four-step process whenever you construct a confidence interval for a
difference between two means or for a mean difference.