Handout#3 - Statistical Inference, Z and T Test
Handout#3 - Statistical Inference, Z and T Test
STATISTICAL INFERENCE
The theory of statistical inference consists of methods by which one makes inferences or
generalizations about a population. Generally, there are two major areas of statistical inference. These are
the estimation of parameters and the tests of hypothesis.
WHAT IS A HYPOTHESIS?
A hypothesis is a statement or tentative theory which aims to explain facts about the real world.
Most hypothesis have their origin in a question to some practical problem. In search for an answer, an
“educated guess” and pertinent evidences are brought out which later on are turned into propositions or
“hypotheses”.
These hypotheses are then subjected to testing. If they are found to be statistically true, they are
ace[ted; if they are found to be false, they are rejected.
The hypothesis that we hope to reject or accept is called the null hypothesis and is denoted by H0.
Being the starting point of the testing process, it serves as our working hypothesis. A null hypothesis must
always express the idea of nonsignificance of difference.
The rejection of H0 implies our acceptance of an alternative hypothesis, which we denote as Ha. The
alternative hypothesis generally represents the hypothetical statement that the researcher wants to prove.
To summarize:
- Rejection of H0 implies acceptance of Ha
- Acceptance of H0 implies rejection of Ha
LEVEL OF SIGNIFICANCE
The probability of making a type I or alpha error in a test is called the significance level of a test.
The significance level of a test is the maximum value of the probability of rejecting the null hypothesis
when in fact it is true. For tests involving statistical decision, it is customary to use a level of significance of
5% or 1%. A 5% significance level means that we can accept about 5 chances in 100 that we would reject the
null hypothesis when it should be accepted. A 5% significance level implies that we are 95% sure or
confident of making the right decision.
1. Formulate the null hypothesis that there is no significant difference between the items being
compared. State the alternative hypothesis which is used in case the null hypothesis is rejected.
3. Determine the test to be used. Use the z-test if population standard deviation is given and the t-
test if the standard deviation is given from the samples.
4. Determine the tabular value for the test. For a z-test, use the table of critical values of z based on
the area of the normal curve. For a t-test, one must first compute for the degrees of freedom,
then look for the tabular value from the table of t-distribution. For a single sample, df = n – 1.
For two samples, df = n1 + n2 – 2
6. Compare the computed value with its corresponding tabular value, then state your conclusion
based on the following guidelines:
a. Reject the null hypothesis if the absolute computed value is equal to or greater than the
absolute tabular value.
b. Accept the null hypothesis if the absolute computed value is less than the absolute
tabular value.
SAMPLE PROBLEMS:
1. Data from a school census showed that the mean weight of college students was 45 kilos, with a
standard deviation of 3 kilos. A sample of 100 college students were found to have a mean weight
of 47 kilos. Are the 100 college students really heavier than the rest using 0.05 level of significance?
2. A researcher wishes to find out whether or not there is significant difference between the monthly
allowances of morning and afternoon students in his school. By random sampling, he took a sample
of 239 students in the morning session. These students were found to have a mean monthly
allowance of P142.00 The researcher also took a sample of 209 students in the afternoon session.
They were found to have a mean monthly allowance of P148.00 The total population of that school
has a standard deviation of P40.00 Is there significant difference between the two samples at 0.01
level of significance?
3. A sample survey of a television program in Metro Manila shows that 80 of 200 men dislike the same
program. We want to decide whether the difference between the two sample proportions, 80/200 =
0.40 and 75/250 = 0.30, is significant or not at 0.05 level of significance?
4. A researcher knows that the average height of Filipino women is 1.525 meters. A random sample of
26 women was taken and was found to have a mean height of 1.56 meters, with standard deviation
of 0.10 meters. Is there reason to believe that the 26 women are significantly taller than the others at
0.05 level of significance?
5. A teacher wishes to test whether or not the Case Method of teaching is more effective than the
Traditional Method. She picks two classes of approximately equal intelligence (verified through an
administered IQ test). She gathers a sample of 18 students to whom she uses the Case Method and
another sample of 14 students to whom she uses the Traditional Method. After the experiment, an
objective test revealed that the first sample got a mean score of 28.6 with a standard deviation of 5.9
while the second group got a mean score of 21.7 with a standard deviation of 4.6 Based on the
result, can we say that the Case Method is more effective than the Traditional Method?
EXERCISE/ACTIVITY:
1. Beta Company is manufacturing steel wire with an average tensile and strength of 50 kilos. The
laboratory tests 16 pieces and finds that the mean is 47 kilos and the standard deviation is 15 kilos.
Are the results in accordance with the hypothesis that the population mean is 50 kilos?
2. Two types of rice varieties are being considered for yield and a comparison is needed. Thirty
hectares were planted with the rice varieties exposed to fairly uniform growing conditions. The
results are obtained as follows:
Variety A Variety B
Average Yield 80 sacks/hectare 35 sacks/hectare
Sample variance 5.9 12.1
At 0.05 significance level, can we conclude that variety A is the better type?
3. Two methods of teaching statistics are being tried by a professor. A class of 40 students is taught by
Method A and a class of 36 is taught by Method B. The two classes are given the same final
examination. The scores are: mean of class for Method A 78 while the mean of the class for Method
B is 74. Using a 0.01 significance level, can we conclude that the average final examination
grade/scores produced by the two methods are different if the population standard deviation is 5?