Unit V Small Sample Tests
Unit V Small Sample Tests
UNIT-V
SMALL SAMPLES
If the size of the sample < 30 the sample is said to be small sample or exact sample.
There are three kinds of tests for small samples.
1. Students t test using t - distribution
2. 𝜒 2 𝑡𝑒𝑠𝑡 𝑢𝑠𝑖𝑛𝑔 χ2 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛
3. F – test using F – distribution
DEGREES OF FREEDOM:
Degrees of freedom one number of values we can choose freely.
For example: If 𝑥1 + 𝑥2 + 𝑥3 = 50 to find 𝑥3 we have to assign any value to the two of the
variables (say 𝑥1 , 𝑥2). Similarly for four we require three values.
Degrees of freedom for 2 sample values = 2 – 1 = 1
Degrees of freedom for n sample = n – 1
SMALL SAMPLE TESTS :
(i) t-test:
Name of the test Null Level of Test statistic
Hypothesis𝑯𝟎 significance(𝜶)
1.Test for single mean 𝜇=𝜇0 5% or 1% or 10% 𝑥̅ −𝜇
t= 𝑠
√𝑛−1
2.Test for difference 𝜇 1= 𝜇 2 5% or 1% or 10% 𝑥̅ − 𝑦̅
t= 1 1
of means 𝑠 √𝑛 +𝑛
1 2
3. paired t-test 𝜇 =0 5% or 1% or 10% 𝑑̅
=𝑠
√𝑛
(ii) F-test:
Test for difference of variance
𝐻0 : 𝜎1 2 = 𝜎2 2
𝛼 : 5% or 10%.
𝑆 2 𝐺𝑟𝑒𝑎𝑡𝑒𝑟 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
Test statistic F = 𝑆12 = 𝑠𝑚𝑎𝑙𝑒𝑟 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
2
𝟐
(iii) Chi-square (𝝌 )test:
Test for goodness of fit and test for independence of attributes.
(𝑂−𝐸)2
Test statistic 𝜒 2 = ∑ 𝐸
T – DISTRIBUTION:
T – Distribution is used when population standard deviation is unknown.
Let 𝑥̅ be the mean of a random sample of size n, taken from a normal population having the
(𝑥𝑖 −𝑥̅ )2
mean μ and variance 𝜎 2 and sample variance 𝑆 2 = ∑𝑛𝑖=1 then test statistic is
𝑛−1
𝑥̅ −𝜇 1
𝑡= 𝑤ℎ𝑒𝑟𝑒 𝑥̅ = ∑𝑛𝑖=1 𝑥𝑖 , 𝑛 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒 (𝑛 − 1) = 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚
𝑆⁄√𝑛 𝑛
PROPERTIES OF T – DISTRIBUTION:
1. The shape of t – distribution is bell – shaped which is similar to that of a normal distribution.
Page 1
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
2. The mean of standard normal distribution and as well as t – distribution is zero. Variance of t
– distribution depends on degrees of freedom V.
3. The variance of t – distribution approaches 1 as n → ∞. In fact t – distribution with v –
degrees of freedom approaches standard normal distribution as 𝑉 = (𝑛 − 1) → ∞
APPLICATIONS OF THE T – DISTRIBUTION:
1. To test the significance of sample mean, when population variance is not given.
2. To test the significance of the mean of sample, i.e., to test if the sample mean differs
significantly from the population mean.
3. To test the significance difference between two sample means or to compare two samples.
4. To test the significance of an observed sample correlation coefficient, sample regression
coefficient.
2. CHI – SQUARED (𝝌𝟐 ) DISTRIBUTION:
Chi – squared distribution is a continuous probability distribution of continuous random
variable X with probability density function is given by
𝑣 𝑥
1
𝑉 𝑉 𝑥 2−1 . 𝑒 −2 , 𝑓𝑜𝑟 𝑥 > 0
𝑓(𝑥) = { 2 2 Γ( 2 )
0, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
𝑤ℎ𝑒𝑟𝑒 𝑉 𝑖𝑠 𝑡ℎ𝑒 + 𝑣𝑒 𝑖𝑛𝑡𝑒𝑔𝑒𝑟 𝑎𝑙𝑠𝑜 𝑘𝑛𝑜𝑤𝑛 𝑎𝑠 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚.
𝜒 2 distribution was used to measure of goodness of fit and to test independent attributes.
PROPERTIES OF 𝝌𝟐 DISTRIBUTION:
1. 𝜒 2 distribution curve is not symmetrical, lies entirely in the first quadrant. It is not a normal
curve since 𝜒 2 varies from 0 𝑡𝑜 ∞.
Page 2
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
2. Test of goodness of fit are used when we want to determine whether an actual sample
distribution matches unknown theoretical distribution. This test is known as "𝜒 2 − 𝑡𝑒𝑠𝑡" of
goodness of fit.
Test statistic for 𝜒 2 – test is
(𝑂−𝐸)2
𝜒2 = ∑ ( ) 𝑤ℎ𝑒𝑟𝑒 𝑂 = 𝑂𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦; 𝐸 = 𝐸𝑥𝑝𝑒𝑐𝑡𝑒𝑑 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
𝐸
PROPERTIES OF F – DISTRIBUTION:
i) F – distribution is free from population parameter and depends upon degrees of freedom only.
ii) F – distribution curve lies entirely in first quadrant.
iii) The F – curve depends not only on the two parameters 𝑉1 𝑎𝑛𝑑 𝑉2 but also on the order in
which they are stated.
1
iv) 𝐹1−𝛼 (𝑉1, 𝑉2 ) =
𝐹𝛼 (𝑉1 ,𝑉2 )
Page 3
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
𝑥̅ −𝜇
1. For student t – test statistic when S.D. is t is defined as 𝑡 = 𝜎⁄
√𝑛−1
Page 4
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
5. Conclusion : |𝑡| = 1.22 < 𝑡0.025 = 2.06 𝑓𝑜𝑟 24 𝑑𝑒𝑔𝑟𝑒𝑒𝑠 𝑜𝑓 𝑓𝑟𝑒𝑒𝑑𝑜𝑚,
∴ 𝐴𝑐𝑐𝑒𝑝𝑡 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠 𝐻0, i.e., the claim that the average life of the bulbs produced by
the company is 1600 hrs is acceptable.
5. The average breaking strength of steel rods is specified to be 18.5 thousand pounds to test the
sample of 14 rods were tested. The mean and S.D. obtained is 17.85 and 1.955. The result of
experiment significance?
Sol: Given sample size 𝑛 = 14, 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛 𝑥̅ = 17.85,
𝑆. 𝐷. (𝑆) = 1.955
𝑀𝑒𝑎𝑛 𝑜𝑓 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝜇 = 18.5
Degrees of freedom = 𝑛 − 1 = 13
1. Null hypothesis 𝐻0 : 𝑇ℎ𝑒 𝑟𝑒𝑠𝑢𝑙𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑒𝑥𝑝𝑒𝑟𝑖𝑚𝑒𝑛𝑡 𝑖𝑠 𝑛𝑜𝑡 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 𝜇 = 18.5
2. Alternative hypothesis 𝐻1 : 𝜇1 ≠ 18.5 (Two tail)
3. Level of significance 𝛼: 0.05
𝑥̅ −𝜇 17.85−18.5 −0.65
4. Test statistic 𝑡: 𝑡 = ⁄ = = = −1.199
𝑆 √𝑛−1 1.955/√13 0.542
|𝑡| = 1.199 ⇒ Calculated t = 1.199
5. Conclusion: Tabulated value at 5% level of significance for 13 degrees of freedom for two
tailed test = 2.16 , since calculated value < tabulated t we accept H0 at 5% level of
significance.
6. A random sample of six steel beams has a mean compressive strength of 58,392 (pounds per
square inch), with S.D. of 648 p.s.i. Use this information and the level of significance α = 0.05
to test whether the true average compressive strength of the steel from which this sample
came is 58,000 p.s.i.
Sol: Given sample size 𝑛 = 6, 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛 𝑥̅ = 58392 𝑝𝑠𝑖,
𝑆. 𝐷. (𝑆) = 648𝑝𝑠𝑖
𝑀𝑒𝑎𝑛 𝑜𝑓 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝜇 = 58000
Degrees of freedom = 𝑛 − 1 = 5
1. Null hypothesis 𝐻0 : 𝜇 = 58000
2. Alternative hypothesis 𝐻1 : 𝜇1 ≠ 58000 (Two tail)
3. Level of significance 𝛼: 0.05
𝑥̅ −𝜇 58392−58000
4. Test statistic 𝑡: 𝑡 = ⁄ = = 1.35
𝑆 √𝑛−1 648/√5
|𝑡| = 1.35
Calculated t = 1.35
5. Conclusion : Tabulated value = 3.365
Since calculated value < tabulated t we accept H 0.
PROBLEMS RELATED TO STUDENTS T – TEST
(WHEN S.D. OF SAMPLE IS NOT GIVEN DIRECTLY) :
1. A random sample of 10 boys has the following I.Qs 70, 120, 110, 101, 88, 83, 95, 98, 107,
100. (a) Do these data support the assumption of population mean IQ of 100.
(b) Find reasonable range in which most of the mean IQ values of samples of 10 boys lie.
Sol: (a) Here S.D. and mean of the sample is not given directly.
Given data 70, 120, 110, 101, 88, 83, 95, 98, 107, 100.
∑𝑥 972
Mean 𝑥̅ = 𝑛 = 10 = 97.2
𝒙 𝒙−𝒙 ̅ ̅)𝟐
(𝒙 − 𝒙
70 -27.2 739.84
120 22.8 519.84
Page 5
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
110 12.8 163.84
101 3.8 14.44
88 -9.2 84.64
83 -14.2 201.64
95 -2.2 4.84
98 0.8 0.64
107 9.8 96.04
100 2.8 7.84
1833.60
∑(𝑥𝑖 −𝑥̅ )2 1833.60
𝑆2 = = = 203.73
𝑛−1 9
𝑆 = √203.73 = 14.27
1. Null hypothesis 𝐻0 : 𝜇 = 100
2. Alternative hypothesis 𝐻1 : 𝜇1 ≠ 100 (Two tail)
3. Level of significance 𝛼: 0.05
𝑥̅ −𝜇 97.2−100
4. Test statistic 𝑡: 𝑡 = 𝑆⁄ = 14.27/√10 = −0.62
√𝑛
|𝑡| = 0.62 ⇒ Calculated t = 0.62
5. Conclusion : Tabulated value of t for 9 degrees of freedom at 5% level of significance is
2.26 (two tailed test) . Since calculated value < tabulated t we accept H 0, IQ μ = 100.
(b) Confidence limits
𝑆
𝑥̅ ± 𝑡𝛼 . = 97.2 ± 2.26 × 4.512 = (97.2 ± 10.198) = (87, 107.4)
√𝑛
2. The height of 10 males of given locality are found to be 70, 67, 62, 68, 61, 68, 70, 64, 64, 66
inches. Is it reasonable to believe that the average height is greater than 64 inches, Test at 5%
significance level assuming that for 9 degrees of freedom.
∑𝑥 660
Sol: Mean 𝑥̅ = 𝑛 = 10 = 66
𝒙 𝒙−𝒙 ̅ (𝒙 − 𝒙̅)𝟐
70 4 16
67 1 1
62 -4 16
68 2 4
61 -5 25
68 2 4
70 4 16
64 -2 4
64 2 4
66 0 0
660 90
∑(𝑥𝑖 −𝑥̅ )2 90
𝑆2 = = = 10
𝑛−1 9
𝑆 = √10 = 3.1
1. Null hypothesis 𝐻0 : 𝐴𝑣𝑒𝑟𝑎𝑔𝑒 ℎ𝑒𝑖𝑔ℎ𝑡 𝑖𝑠 𝑛𝑜𝑡 𝑔𝑟𝑒𝑎𝑡𝑒𝑟 𝑡ℎ𝑎𝑛 64 𝑖𝑛𝑐ℎ𝑒𝑠 𝜇 = 64
2. Alternative hypothesis 𝐻1 : 𝜇 > 64 (One tail)
3. Level of significance 𝛼: 0.05
𝑥̅ −𝜇 66−64 2
4. Test statistic 𝑡: 𝑡 = 𝑆⁄ = 3.16/√9 = 1.05 = 1.9
√𝑛
Page 6
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
|𝑡| = 1.9
Calculated t = 1.9
5. Conclusion : Tabulated value of t for 9 degrees of freedom at 5% level of significance for
single test is 1.833. Since calculated value > tabulated t we reject H0.
3. A random sample from a company's very extensive files shares that the order for certain kind
of machinery were filled respectively in 10, 12, 19, 14, 15, 18, 11 and 13 days use the level of
significance α = 0.01 to test claim that on the average such orders are filled in 10.5 days.
∑𝑥 112
Sol: n = 8, Mean 𝑥̅ = = = 14
𝑛 8
2 ∑(𝑥𝑖 −𝑥̅ )2 1
𝑆 = = 7 ((10 − 14)2 + (12 − 14)2 + (19 − 14)2 + ⋯ . +(13 − 14)2 )
𝑛−1
1 1
= 7 (16 + 4 + 25 + ⋯ . +1 + 16) = 7 (72) = 10.286
𝑆 = √10.286 = 3.207
1. Null hypothesis 𝐻0 : 𝜇 = 10.5 𝑑𝑎𝑦𝑠
2. Alternative hypothesis 𝐻1 : 𝜇 ≠ 10.5
3. Level of significance 𝛼: 0.01
𝑥̅ −𝜇 14−10.5
4. Test statistic 𝑡: 𝑡 = 𝑆⁄ = 3.207/√7 = 3.087
√𝑛
|𝑡| = 3.087
Calculated t = 3.087
5. Conclusion : Tabulated value of t for 7 degrees of freedom at 1% level of significance is
3.499. Since calculated value < tabulated t we accept H0.
STUDENTS T – TEST FOR DIFFERENCE OF MEANS:
To test the significant difference between the sample means 𝑥̅ 𝑎𝑛𝑑 𝑦̅ of two independent
samples of sizes 𝑛1 𝑎𝑛𝑑 𝑛2 with the same variance, we use statistic
𝑥̅ −𝑦̅ 𝑥̅ −𝑦̅
𝑡= 1 1
(𝑜𝑟) 1 1
√𝑆 2(𝑛 +𝑛 ) 𝑆√(𝑛 +𝑛 )
1 2 1 2
1 𝑛1 1 𝑛2
𝑥̅ = 𝑛 ∑𝑖=1 𝑥𝑖 , 𝑦̅ = 𝑛 ∑𝑖=1 𝑦𝑖
1 2
1 1
𝑆 = 𝑛 +𝑛 −2 [∑𝑖(𝑥𝑖 − 𝑥̅ )2 + ∑𝑖 (𝑦𝑖
2
− 𝑦̅)2 ] (𝑜𝑟) 𝑆 2 = 𝑛 [(𝑛1 − 1)𝑆12 + (𝑛2 − 1)𝑆22 ]
1 2 1 +𝑛2 −2
T – distribution follows (𝑛1 + 𝑛2 − 2) degrees of freedom.
PROBLEMS:
1. Samples of two types of electric light bulbs were tested for length of life and following data
were obtained.
Type I Type II
Sample size 𝑛1 = 8 𝑛2 = 7
Sample mean 𝑥̅ = 1234 ℎ𝑟𝑠 𝑦̅ = 1036 ℎ𝑜𝑢𝑟𝑠
Sample S.D. 𝑆1 = 36 ℎ𝑟𝑠 𝑆2 = 40 ℎ𝑟𝑠
Is the difference in the means sufficient to warrant that type I is superior to type II regarding
length of life.
Sol: 1. 𝐻0 : 𝜇1 = 𝜇2
2. 𝐻1 : 𝜇1 > 𝜇2 (Two tail)
3. 𝛼: 5%
4. Test statistic 𝑡:
1 1
𝑆 2 = 𝑛 +𝑛 −2 [(𝑛1 − 1)𝑆12 + (𝑛2 − 1)𝑆22 ] = 8+7−2 [7(36)2 + 6(40)2] = 1436.31
1 2
Page 7
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
𝑥̅ −𝑦̅ 1234−1036
𝑡= 1 1
= 1 1
= 10.09
√𝑆 2(𝑛 +𝑛 ) √1436( + )
1 2 8 9
5. Conclusion: 𝐷. 𝐹 = 8 + 7 − 2 = 13; |𝑡| > 1.77 (0.05 𝑙𝑒𝑣𝑒𝑙 − 1.77) 𝑓𝑟𝑜𝑚 𝑡𝑎𝑏𝑙𝑒
2. Two independent samples have following values
Sample I 11 11 13 11 15 9 12 14
Sample II 9 11 10 13 9 8 10 -
Test the difference between the means.
96 70
Sol: 𝑛1 = 8, 𝑛2 = 7, 𝑥̅ = = 12, 𝑦̅ = = 10
8 7
𝒙 𝒙−𝒙 ̅ (𝒙 − 𝒙̅)𝟐 𝒚 𝒚−𝒚 ̅ (𝒚 − 𝒚̅ )𝟐
11 -1 1 9 -1 1
11 -1 1 11 1 1
13 1 1 10 0 0
11 -1 1 13 3 9
15 3 9 9 -1 1
9 -3 9 8 -2 4
12 0 0 10 0 0
14 2 4
26 16
2 1 2 2 1 42
𝑆 = 𝑛 +𝑛 −2 [∑𝑖(𝑥𝑖 − 𝑥̅ ) + ∑𝑖 (𝑦𝑖 − 𝑦̅) ] = 8+7−2 [26 + 16] = 13 = 3.23
1 2
1. 𝐻0 : 𝜇1 = 𝜇2
2. 𝐻1 : 𝜇1 ≠ 𝜇2
3. 𝛼 = 0.05
𝑥̅ −𝑦̅
4. Test statistic : 𝑡 = 1 1
= 2.15
𝑆√( + )
𝑛1 𝑛2
5. Conclusion: 𝐴𝑡 13 𝑑. 𝑓. 0.05 𝑙𝑒𝑣𝑒𝑙 𝑜𝑓 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑐𝑒 2.15 < 2.16, therefore accept H0.
3. Two horses A and B were tested according to the time (in seconds) to run a particular track
with the following results
Horse A 28 30 32 33 33 29 34
Horse B 29 30 30 24 27 29
Test whether the two horses have to same running capacity.
Sol: 𝑛1 = 7, 𝑛2 = 6,
We first compute the sample means and S.D.
1
𝑥̅ = 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑓𝑖𝑟𝑠𝑡 𝑠𝑎𝑚𝑝𝑙𝑒 = 7 (219) = 31.286,
169
𝑦̅ = 𝑚𝑒𝑎𝑛 𝑜𝑓 𝑠𝑒𝑐𝑜𝑛𝑑 𝑠𝑎𝑚𝑝𝑙𝑒 = = 28.16
6
𝒙 𝒙−𝒙 ̅ (𝒙 − 𝒙 ̅)𝟐 𝒚 𝒚−𝒚 ̅ (𝒚 − 𝒚̅ )𝟐
28 -3.286 10.8 29 0.84 0.7056
30 -1.286 1.6538 30 1.84 3.3856
32 0.714 0.51 30 1.84 3.3856
33 1.714 2.94 24 -4.16 17.3056
33 1.714 2.94 27 -1.16 1.3456
29 -2.286 5.226 29 0.84 0.7056
34 2.714 7.366
219 31.4358 169 26.8336
2 1 2 2 1
𝑆 = 𝑛 +𝑛 −2 [∑𝑖(𝑥𝑖 − 𝑥̅ ) + ∑𝑖 (𝑦𝑖 − 𝑦̅) ] = 11 [31.4358 + 26.8336]
1 2
Page 8
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
1
= 11 (58.2694) = 5.23 ⇒ 𝑆 = 2.3
1. 𝐻0 : 𝜇1 = 𝜇2
2. 𝐻1 : 𝜇1 ≠ 𝜇2
3. 𝛼 = 0.05
𝑥̅ −𝑦̅ 31.286−28.16
4. Test statistic : 𝑡 = 1 1
= 1 1
= 2.443
𝑆√( + ) (2.3)√(7+6)
𝑛1 𝑛2
Page 10
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
𝑑̅ −𝜇 1
𝑡 = 𝑆⁄ 𝑤ℎ𝑒𝑟𝑒 𝑑̅ = 𝑛 ∑ 𝑑𝑖
√𝑛
1 1 (∑ 𝑑) 2
2
𝑆 = ∑𝑛𝑖=1(𝑑𝑖 − 𝑑̅ )2 𝑜𝑟 𝑆 2 = [∑ 𝑑 2 − ]
𝑛−1 𝑛−1 𝑛
Above statistic follows students t – distribution with n – 1 degrees of freedom.
NOTE:
Take mean of differences 𝜇 = 0 𝑎𝑠 𝐻0 𝑎𝑛𝑑 𝜇 > 0 𝑎𝑠 𝐻1
PROBLEMS:
1. Ten workers were given training program with a view to study their assembly time for a
certain mechanism. The result of the time and motion studies before and after training
programme are given below.
Workers 1 2 3 4 5 6 7 8 9 10
X1 15 18 20 17 16 14 21 19 13 22
Y1 14 16 21 10 15 18 17 16 14 20
𝑋1 = Time taken for assembling before training
𝑌1 = Time taken for assembling after training
Test whether there is significant difference is assembly times before and after training.
Sol: Let μ be the mean of population of differences.
1. 𝐻0 : 𝜇 = 0 (𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑖𝑠 𝑛𝑜𝑡 𝑢𝑠𝑒𝑓𝑢𝑙)
2. 𝐻1 : 𝜇 > 0
3. 𝛼: 0.05
4. Test statistic :
Computation Differences (before and after training)
1, 2, -1, 7, 1, -4, 4, 3, -1, 2
14
𝑑̅ = 10 = 1.4
1 1
𝑆2 = ∑𝑛𝑖=1(𝑑𝑖 − 𝑑̅ )2 = (82.4) = 9.1555
𝑛−1 9
𝑆 = 3.026
𝑑̅ −𝜇 1.4−0
𝑡 = 𝑆⁄ = 3.026⁄
√𝑛 √10
⇒ |𝑡| = 1.46
5. Conclusion : Table value with 9 d.f. at 0.05 is 1.833
𝑡𝑐𝑎𝑙 < 𝑡𝑡𝑎𝑏 ⇒ 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0 i.e., training is not useful.
2. Scores obtained in a shooting competition by 10 soldiers before and after intensive training
are given below:
Before 67 24 57 63 54 56 68 33 43
After 70 38 58 58 67 68 75 42 38
Test whether the intensive training is useful at 0.05 level of significance.
Sol: 1. 𝐻0 : 𝜇 = 0 (𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑖𝑠 𝑛𝑜𝑡 𝑢𝑠𝑒𝑓𝑢𝑙)
2. 𝐻1 : 𝜇 > 0 (𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑢𝑠𝑒𝑓𝑢𝑙)
3. 𝛼: 0.05
4. Test statistic :
Computation Differences -3, -14, -1, -3, 7, -13, -12, -7, -9.5
1 50
𝑑̅ = ∑10 𝑑 = − = −5
10 𝑖=1 𝑖 10
1 1
𝑆2 = ∑𝑛𝑖=1(𝑑𝑖 − 𝑑̅ )2 = (482) = 53.555
𝑛−1 9
𝑆 = 7.32
𝑑̅ −𝜇 −5−0
Test statistic 𝑡 = 𝑆⁄ = 7.32⁄ = −2.16
√𝑛 √10
Page 11
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
⇒ |𝑡| = 2.16
5. Conclusion: Table value with 9 d.f. at 0.05 is 1.83
𝑡𝑐𝑎𝑙 > 𝑡𝑡𝑎𝑏 ⇒ 𝑟𝑒𝑗𝑒𝑐𝑡 𝐻0 i.e., intensive training is useful.
3. Blood pressure of 5 women before and after intake of a certain drug are given below.
Before 110 120 125 132 125
After 120 118 125 136 121
Test whether there is significant change in blood pressure at 1% level of significance.
Sol: Differences -10, 2, 0, -4, 4
𝑑̅ = 2
𝑆 2 = 30 ⇒ 𝑆 = √30
1. 𝐻0 : 𝜇 = 0
2. 𝐻1 : 𝜇 > 0 (𝑜𝑛𝑒 𝑡𝑎𝑖𝑙)
3. 𝛼 ∶ 0.05
𝑑̅ −𝜇
4. Test statistic : 𝑡 = 𝑆⁄ = 0.82
√𝑛
5. Conclusion: t – table value for 4 d.f. is 4.6
𝑡𝑐𝑎𝑙 ≤ 𝑡𝑡𝑎𝑏𝑙𝑒 , accept 𝐻0 .
Page 12
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
∑(𝑦𝑖 −𝑦̅)2 102.6
𝑆22 = = = 11.4
𝑛2 −1 9
1.𝐻0 : 𝑆12 = 𝑆22
2.𝐻1 : 𝑆12 ≠ 𝑆22
3. 𝛼 ∶ 5%
𝑆2 12.057
Now 𝐹 = 𝑆12 = = 1.057
2 11.4
i.e., calculated F = 1.057
Tabulated value of F at 5% level for (7, 9) degrees of freedom is 3.29
𝐹𝑐𝑎𝑙 < 𝐹𝑡𝑎𝑏 𝑖. 𝑒. , 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0.
2. Two random samples gave the following results
Sample Size Sample mean Sum of squares of deviation from the mean
1 10 15 90
2 12 14 108
Test whether the samples came from the same normal population.
Sol: 𝑛1 = 10, 𝑛2 = 12, 𝑥̅ = 15, 𝑦̅ = 14
∑(𝑥𝑖 − 𝑥̅ )2 = 90; ∑(𝑦𝑖 − 𝑦̅)2 = 108
To test whether the samples drawn from same normal population.
i.e., 𝐻0 : 𝜇1 = 𝜇2 𝑎𝑛𝑑 𝜎12 = 𝜎22
Here we have to test
i) Test equality of variances by F - test
ii) Test equality of means by t – test
i) F – test (equality of variances)
1. 𝐻0 : 𝜎12 = 𝜎22
2. 𝐻1 : 𝜎12 ≠ 𝜎22
3. 𝛼 ∶ 5%
4. Test statistic :
Given 𝑛1 = 10, 𝑛2 = 12, ̅̅̅ 𝑋1 = 15, 𝑌̅1 = 14, ∑(𝑥𝑖 − 𝑥̅ )2 = 90, ∑(𝑦𝑖 − 𝑦̅)2 = 108
∑(𝑥𝑖 −𝑥̅ )2
𝑆12 = = 10
𝑛1 −1
∑(𝑦𝑖 −𝑦̅)2
𝑆22 = = 9.82
𝑛2 −1
𝑆12 10
𝐹= 𝑆22
= 9.82 = 1.018
i.e., calculated F = 1.018
5. Conclusion: Tabulated value of F at 5% level for (9, 11) degrees of freedom is 2.90
𝐹𝑐𝑎𝑙 < 𝐹𝑡𝑎𝑏 𝑖. 𝑒. , 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0. i.e., sample came from same normal population with same
variance.
ii) t – test : (to test equality of means)
Given 𝑛1 = 10, 𝑛2 = 12, ̅̅̅ 𝑋2 = 15, 𝑌̅2 = 14, ∑(𝑥𝑖 − 𝑥̅ )2 = 90, ∑(𝑦𝑖 − 𝑦̅)2 = 108
1 1
𝑆 2 = 𝑛 +𝑛 −2 [∑ (𝑥𝑖 − 𝑥̅ )2 + ∑ (𝑦𝑖 − 𝑦̅)2 ] = 20 [90 + 108] = 9.8 ⇒ 𝑆 = 3.15
1 2
1. 𝐻0 : 𝜇1 = 𝜇2
2. 𝐻1 : 𝜇1 ≠ 𝜇2
3. 𝛼: 0.05
𝑥̅ −𝑦̅ 15−14
4. Test statistic : 𝑡 = 1 1
= 1 1
= 0.74
𝑆√(𝑛 +𝑛 ) 315√10+12
1 2
5. Conclusion : Tabulated for 20 d.f. at 5% level (two tailed) is 2.086
Page 13
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
𝑡𝑐𝑎𝑙 < 𝑡𝑡𝑎𝑏 ⇒ 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0 𝑖. 𝑒. , 𝜇1 = 𝜇2
Given samples have been drawn from the same normal population.
3. Pumpkins were grown under two experimental conditions. Two random samples of 11 and 9
pumpkins shown the sample S.D. of their weights as 0.8 and 0.5 respectively. Assuming that
the weight distributions are normal. Test the hypothesis that true variance are equal.
Sol: 𝑛1 = 11, 𝑛2 = 9, 𝑆1 = 0.8, 𝑆2 = 0.5
Sample S.D's are given
Population variances 𝜎12 𝑎𝑛𝑑 𝜎22 use the relation
𝑛 𝑆2 11×(0.8)2
𝜎12 = 𝑛 1−11 = = 0.704;
1 10
𝑛2 𝑆22 9×(0.5)2
𝜎22 = 𝑛 = = 0.281
2 −1 8
1.𝐻0 : 𝜎12
= 𝜎22
2.𝐻1 : 𝜎12
≠ 𝜎22
3. 𝛼 = 0.05
𝜎2 0.704
4. Test statistic : 𝐹 = 𝜎12 = 0.281 = 2.5 ⇒ 𝐹 = 2.5
2
5. Conclusion : Tabulated value of F for (10, 8) d.f. at 5% level of significance is 3.35
𝐹𝑐𝑎𝑙 < 𝐹𝑡𝑎𝑏 𝑖. 𝑒. , 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0.
4. Time taken by workers in performing a job by method I and method II is given below:
Method I 20 16 26 27 23 22 -
Method II 27 33 42 35 32 34 38
Do the data show that the variances of time distribution from population from which these
samples are drawn do not differ significantly.
Sol: Calculation of sample variance
𝑛1 = 6, 𝑛2 = 7,
∑𝑥 134 241
𝑥̅ = 𝑛 = 6 = 22.3, 𝑦̅ = 7 = 34.4
1
𝒙 𝒙−𝒙 ̅ ̅)𝟐
(𝒙 − 𝒙 𝒚 𝒚−𝒚 ̅ (𝒚 − 𝒚̅ )𝟐
20 -2.3 5.29 27 -7.4 54.76
16 -6.3 39.69 33 -1.4 1.96
26 3.7 13.69 42 7.6 57.76
27 4.7 22.09 35 0.6 0.36
23 0.7 0.49 32 -2.4 5.76
22 -0.3 0.09 34 -0.4 0.16
38 3.6 12.96
81.34 133.72
∑(𝑥𝑖 −𝑥̅ )2 81.34
𝑆12 = = = 16.26
𝑛1 −1 5
∑(𝑦𝑖 −𝑦̅)2 133.72
𝑆22 = = = 22.29
𝑛2 −1 6
1.𝐻0 : 𝜎12 = 𝜎22
2.𝐻1 : 𝜎12 ≠ 𝜎22
3. 𝛼 = 0.05
𝑆2 22.29
4. Test statistic : 𝐹 = 𝑆22 = 16.26 = 1.3699
1
⇒ 𝐹 = 1.3699
5. Conclusion: Tabulated value of F for (5, 6) d.f. at 5% level of significance is 4.95.
𝐹𝑐𝑎𝑙 < 𝐹𝑡𝑎𝑏 𝑖. 𝑒. , 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0.
Page 14
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
5. The measurements of the output of two units have given the following results assuming that
both samples have been obtained from the normal population at 5% significant level test
whether the two populations have same variances.
Unit A 14.1 10.1 14.7 13.7 14.0
Unit B 14.0 14.5 13.7 12.7 14.1
Sol: 𝑛1 = 5, 𝑛2 = 5,
∑𝑥 66.6 69
𝑥̅ = 𝑛 = 5 = 13.32, 𝑦̅ = = 13.8
1 5
𝒙 𝒙−𝒙 ̅ ̅)𝟐
(𝒙 − 𝒙 𝒚 𝒚−𝒚 ̅ ̅ )𝟐
(𝒚 − 𝒚
14.1 0.78 0.6084 14.0 0.2 0.04
10.1 -3.22 10.3684 14.5 0.7 0.49
14.7 1.38 1.9044 13.7 -0.1 0.01
13.7 0.38 0.1444 12.7 -1.1 1.21
14.0 0.68 0.4624 14.1 0.3 0.09
13.488 1.84
∑(𝑥𝑖 −𝑥̅ )2 13.488
𝑆12 = = = 3.372
𝑛1 −1 4
∑(𝑦𝑖 −𝑦̅)2 1.84
𝑆22 = = = 0.46
𝑛2 −1 4
1.𝐻0 : 𝜎12 = 𝜎22
2.𝐻1 : 𝜎12 ≠ 𝜎22
3. 𝛼 = 0.05
𝑆2 3.372
4. Test statistic : 𝐹 = 𝑆12 = = 7.33
2 0.46
⇒ 𝐹 = 7.33
5. Conclusion: Tabulated value of F for (4, 4) d.f. at 5% level of significance is 6.39.
𝐹𝑐𝑎𝑙 > 𝐹𝑡𝑎𝑏 𝑖. 𝑒. , 𝑟𝑒𝑗𝑒𝑐𝑡 𝑛𝑢𝑙𝑙 ℎ𝑦𝑝𝑜𝑡ℎ𝑒𝑠𝑖𝑠 𝐻0 .
6. The nicotine contents in milligrams in the samples of tobacco were found to be as follows:
Sample A 24 27 26 21 25 -
Sample B 27 30 28 31 22 36
Can it be said that two samples came from same normal population.
Sol: 𝑛1 = 5, 𝑛2 = 6,
∑𝑥
𝑥̅ = 𝑛 = 24.6, 𝑦̅ = 29
1
̅)𝟐 = 21.2; ∑(𝒚𝒊 − 𝒚
∑(𝒙𝒊 − 𝒙 ̅)𝟐 = 108
∑(𝑥𝑖 −𝑥̅ )2 21.2
𝑆12 = = = 5.3
𝑛1 −1 4
∑(𝑦𝑖 −𝑦̅)2
𝑆22 = = 21.6
𝑛2 −1
F – test :
1. 𝐻0 : 𝜎12 = 𝜎22
2. 𝐻1 : 𝜎12 ≠ 𝜎22
3. 𝛼 = 5%
𝑆22 21.6
4. Test statistic : 𝐹 = =
𝑆12 5.3
𝐹𝑐𝑎𝑙 = 4.075
5. Conclusion: 𝐹𝑡𝑎𝑏 𝑎𝑡 (5. 4) d.f. is 6.2.
𝐹𝑐𝑎𝑙 ≤ 𝐹𝑡𝑎𝑏𝑙𝑒 , 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0.
T – test for difference of mean :
Page 15
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
1. 𝐻0 : 𝜇1 = 𝜇2
2. 𝐻1 : 𝜇1 ≠ 𝜇2
3. 𝛼 = 5%
𝑥̅ −𝑦̅ 24.6−29
4. Test statistic : 𝑡 = 1 1
= 1 1
= −1.92
𝑆√ + 3.78√5+6
𝑛1 𝑛2
Page 17
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
2. 𝐻1 ∶ The dice is biased
3. 𝛼 = 5%
264
4. Test statistic : The expected frequency of each of the numbers 1, 2, 3, 4, 5, 6 is = 44
6
Observed Expected frequency (O – E)2 (𝑶 − 𝑬)𝟐
frequency (O) (E) 𝑬
40 44 16 0.3636
32 44 144 3.2727
28 44 256 5.8181
58 44 196 4.4545
54 44 100 2.2727
52 44 64 1.4545
264 264 17.6362
(𝑂𝑖 −𝐸𝑖 )2
Calculated 𝜒 2 = ∑𝑛𝑖=1 = 17.6362
𝐸𝑖
2 2
5. Conclusion : Degrees of freedom n – 1 = 5 at 5% is 11.07; 𝜒𝑐𝑎𝑙 > 𝜒𝑡𝑎𝑏 , 𝑖. 𝑒. , 𝑟𝑒𝑗𝑒𝑐𝑡 𝐻0 .
5. A pair of dice are thrown 360 times and the frequency of each sum is indicated below:
Sum 2 3 4 5 6 7 8 9 10 11 12
Frequency 8 24 35 37 44 65 51 42 26 14 14
Would you say that the dice are fair on the basis of the Chi Square test at 0.05 level of
significance.
Sol: 1. 𝐻0 ∶ 𝑑𝑖𝑐𝑒 𝑎𝑟𝑒 𝑓𝑎𝑖𝑟
2. 𝐻1 : 𝑑𝑖𝑐𝑒 𝑖𝑠 𝑛𝑜𝑡 𝑓𝑎𝑖𝑟
3. 𝛼 = 0.05
4. Test statistic :
(x) 2 3 4 5 6 7 8 9 10 11 12
P(x) 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36
Page 18
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
Sol: Given data:
(x) 0 1 2 3 4
(f) 214 92 20 3 1 ∑ 𝒇 =330
∑ 𝑓𝑥 0×214+1×92+2×20+3×3+4×1 145
Mean 𝜇 = ∑𝑓
= = 330 = 0.439
330
Mean 𝜇 = 𝜆 = 0.439
𝑒 −𝜆 𝜆𝑥
𝑃(𝑥) = 𝑥!
𝑒 −𝜆 𝜆0
𝑃(0) = = 𝑒 −𝜆 = 𝑒 −0.439 = 0.6446
0!
Recurrence formula for Poisson distribution
𝜆
𝑃(𝑥 + 1) = 𝑥+1 𝑃(𝑥)
𝜆
For 𝑥 = 0, 𝑃(1) = 1 𝑃(0) = 0.6446 × 0.439 = 0.28297
0.439 0.439
For 𝑥 = 1, 𝑃(2) = 𝑃(1) = × 0.2829 = 0.06211
2 2
0.439 0.439
For 𝑥 = 2, 𝑃(3) = 𝑃(2) = × 0.06211 = 0.0091
3 3
0.439 0.439
For 𝑥 = 3, 𝑃(4) = 4 𝑃(3) = × 0.0091 = 0.0009
4
N = 330
1. 𝐻0 ∶ Fitting is good
2. 𝐻1 ∶ Fitting is not good
3. 𝛼 = 5%
4. Test statistic :
x Observed Expected frequency (O – E) (𝑶 − 𝑬)𝟐
frequency (O) (E) N× P(x) 𝑬
0 214 213 1 0.0046
1 92 93 -1 0.0107
2 20 20 0 0
3 3 3 0 0
4 1 1 0 0
0.015
(𝑂𝑖 −𝐸𝑖 )2
Calculated 𝜒 2 = ∑𝑛𝑖=1 = 0.00606
𝐸𝑖
5. Conclusion : 𝜒 2 table value at 3 d.f. at 5% level is 7.815
2 2
𝜒𝑐𝑎𝑙 < 𝜒𝑡𝑎𝑏 , 𝑖. 𝑒. , 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0.
CHI – SQUARED TEST FOR INDEPENDENCE OF ATTRIBUTES:
* Attributes means quality or characteristic examples of attributes are blindness, honesty, beauty
etc.
* An attribute may be marked by its presence or absence in number of a given population
* Let us consider two attributes A and B. A is divided into two classes and B is divided into two
classes.
* The various cell frequencies can be expressed in the following table.
A a B
B c D
a B a+b
c D c+d
a+c b+d N
Page 19
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
Expected frequencies are given by
Page 22
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
30 36.1 37.21 1.031
65 50.3 216.09 4.3
50 30 400.0 13.333
60 46 196 4.261
30 64 1156.0 18.062
60.2425
(𝑂𝑖 −𝐸𝑖 )2
5. Test statistic 𝜒 2 = ∑𝑛𝑖=1 = 60.2425
𝐸𝑖
Degrees of freedom = (3 – 1)(3 – 1) = 4 d.f. at 5% level of significance is 9.488.
2 2
𝜒𝑐𝑎𝑙 > 𝜒𝑡𝑎𝑏 , 𝑖. 𝑒. , 𝑟𝑒𝑗𝑒𝑐𝑡 𝐻0 .
EXAMPLE:
A firm manufacturing rivets wants to limit variations in their length as much as possible. The
lengths in (cms) of 10 rivets manufactured by new process are
2.15 1.99 2.05 2.12 2.17
2.01 1.98 2.03 2.25 1.93
Examine whether the new process can be considered superior to the old if the old population
has S.D. of 0.145 cm.
∑𝑥 20.68
Sol: We have 𝑛 = 10, 𝑥̅ = 𝑁 𝑖 = 10 = 2.068
∑(𝑥 −𝑥̅ )2 0.09096
𝑆 2 = 𝑛−1 𝑖
= 9 = 0.01010
𝜎 = 0.145
1. 𝐻0 ∶ 𝜎 2 = 𝜎02 (𝑜𝑟)𝑁𝑒𝑤 𝑝𝑟𝑜𝑐𝑒𝑠𝑠 𝑠𝑢𝑝𝑒𝑟𝑖𝑜𝑟 𝑡𝑜 𝑜𝑙𝑑
2. 𝐻1 : 𝜎 2 ≠ 𝜎02
3. 𝛼: 0.05
𝑛𝑆 2 10(0.01010)
𝜒2 = = = 4.8
𝜎2 (0.145)2
𝜒 2 table value of 9 d.f. is 16.916.
2 2
𝜒𝑐𝑎𝑙 < 𝜒𝑡𝑎𝑏 , 𝑖. 𝑒. , 𝑎𝑐𝑐𝑒𝑝𝑡 𝐻0.
Key Points:
SMALL SAMPLE TESTS :
(i) t-test:
Name of the test Null Level of Test statistic
Hypothesis𝑯𝟎 significance(𝜶)
1.Test for single mean 𝜇=𝜇0 5% or 1% or 10% 𝑥̅ −𝜇
t= 𝑠
√𝑛−1
2.Test for difference 𝜇 1= 𝜇 2 5% or 1% or 10% 𝑥̅ − 𝑦̅
t= 1 1
of means 𝑠 √𝑛 +𝑛
1 2
Page 23
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
3. paired t-test 𝜇 =0 5% or 1% or 10% 𝑑̅
=𝑠
√𝑛
(ii) F-test:
Test for difference of variance
𝐻0 : 𝜎1 2 = 𝜎2 2
𝛼 : 5% or 10%.
𝑆1 2 𝐺𝑟𝑒𝑎𝑡𝑒𝑟 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
Test statistic F = =
𝑆2 2 𝑠𝑚𝑎𝑙𝑒𝑟 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
Page 24
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
2. To examine the hypothesis that the husbands are more intelligent than the wives , an investigator
took a sample of 10 couples and administered them a test which measures the I.Q. .the results are as
follows .
Husband 117 105 97 105 123 109 86 78 103 107
Wives 106 98 87 104 116 95 90 69 108 85
Test the hypothesis with a reasonable test at the level of siginificance of 0.05 .
3 . two independent samples of 8 & 7 items respectively had the following values.
Sample I 11 11 13 11 15 9 12 14
Sample II 9 11 10 13 9 8 10
Is the difference between the means of samples siginificant ?
4 . Two compare two kinds of bumper guards , 6 of each kind were mounted on a car and then the car was
run into a concrete wall . The following are the costs of repairs
Guard 1 107 148 123 165 102 119
Guard 2 134 115 112 151 133 129
Use the 0.01 level of significance to test whether the difference between two sample means is significant.
5 . the table gives the biological values of protein from 6 cow’s milk and 6 buffalo’s milk. Examine the
difference are significant.
cow’s milk 1.8 2.0 1.9 1.6 1.8 1.5
buffalo’s milk 2 1.8 1.8 2.0 2.1 1.9
********
Page 26
PROBABILITY & STATISTICS SMALL SAMPLE TESTS
Page 27