Stats Mod6 18
Stats Mod6 18
Probability
Quarter 4 – Module 6:
Computing Test Statistic on
Population Mean
There are two specific test statistics used for hypothesis testing
concerning means: z-test and t-test.
Solution: Since σ is known and n ≥ 30, we will use z-test. Thus, we have:
1.5
𝑧=
0.8
𝐳 = 𝟏. 𝟖𝟕𝟓
Therefore, the computed z-value is 1.875.
Example 2: In the first semester of the school year, a random sample of 200
students got a mean score of 81.72 with a population standard deviation of
15 in Statistics and Probability test. The population mean is 79.83. Use 0.05
level of significance.
Solution: To answer the problem, let us first identify the given. We have:
𝑥̅ = 81.72 𝜇 = 79.83 𝜎 = 15 𝑛 = 200
Since σ is known and n ≥ 30, we will use z-test.
𝑥̅ − 𝜇 Use the formula for z-test.
𝑧= 𝜎
√𝑛
81.72 − 79. 83
𝑧=
15 Substitute the given value to the
√200 formula.
1. 89
𝑧=
15 Simplify.
14. 14
1. 89
𝑧=
1.06
Therefore, the computed z-value is
𝐳 = 𝟏. 𝟕𝟖𝟑 1.783.
−5
𝑧=
30 Simplify.
12.25
−5
𝑧=
2.45 Therefore, the computed z – value
𝐳 = − 𝟐. 𝟎𝟒𝟏 is -2.041.
Solution: Since σ is unknown and n < 30, we will use t-test. Thus, we have:
Statistics and
Probability
Quarter 4 – Module 7:
Drawing Conclusion About
Population Mean Based on
Test Statistic Value and
Table 1: z – Critical Value
Level of Significance
Type of Test
𝜶 = 1% 𝜶 = 2.5% 𝜶 = 5% 𝜶 = 10%
df = (n – 1)
1 6.311 12.706 31.821 63.657
2 2.920 4.303 6.065 9.925
3 2.353 3.182 4.541 5.841
4 2.132 2.776 3.747 4.604
5 2.025 2.571 3.365 4.032
6 1.943 2.447 3.143 3.707
7 1.895 2.365 2.998 3.499
8 1.860 2.306 2.896 3.355
9 1.833 2.262 2.821 3.250
10 1.812 2.228 2.764 3.169
11 1.796 2.201 2.718 3.106
12 1.782 2.179 2.681 3.055
13 1.771 2.160 2.650 3.012
14 1.761 2.145 2.624 2.977
15 1.753 2.131 2.602 2.947
16 1.746 2.120 2.583 2.921
17 1.740 2.110 2.567 2.898
18 1.734 2.101 2.552 2.878
19 1.729 2.093 2.539 2.861
20 1.725 2.086 2.528 2.845
21 1.721 2.080 2.512 2.831
22 1.717 2.074 2.508 2.819
23 1.714 2.069 2.500 2.807
24 1.711 2.064 2.492 2.797
25 1.708 2.060 2.485 2.787
26 1.706 2.056 2.479 2.779
27 1.703 2.052 2.473 2.771
28 1.701 2.048 2.467 2.763
29 1.699 2.045 2.462 2.756
30 1.697 2.042 2.457 2.750
31 1.695 2.040 2.453 2.744
32 1.694 2.037 2.449 2.738
33 1.692 2.035 2.445 2.733
34 1.691 2.032 2.441 2.728
35 1.690 2.030 2.438 2.724
36 1.688 2.028 2.434 2.719
37 1.687 2.026 2.431 2.715
38 1.686 2.024 2.429 2.712
39 1.685 2.023 2.426 2.708
40 1.684 2.021 2.423 2.704
42 1.682 2.018 2.418 2.698
44 1.680 2.015 2.414 2.692
46 1.679 2.013 2.410 2.687
48 1.677 2.011 2.407 2.682
50 1.676 2.009 2.403 2.678
60 1.671 2.000 2.390 2.660
Infinity 1.645 1.960 2.326 2.576
In a left-tailed test, if the computed value is less than the critical value,
we reject the null hypothesis and support the alternative hypothesis. But if
the computed value is greater than the critical value, we do not reject or we
fail to reject the null hypothesis and the alternative hypothesis is not
supported.
Rejecting the null hypothesis doesn’t mean that it is incorrect or the
alternative hypothesis is correct. The collected data suggest a sufficient
evidence to disprove the null hypothesis, hence we reject it.
Similarly, a failure to reject the null hypothesis does not mean that it is
true -only that the test did not prove it to be false. There is an insufficient
evidence to disprove the null hypothesis; hence we do not reject it.
Example 1: Compute for its value given the following information. Use 𝛼 =
0. 05. Interpret the result.
𝐻𝑜: 𝜇 = 70 𝑥̅ = 71.5 𝜇 = 70
𝐻𝑎: 𝜇 > 70 𝜎=8 𝑛 = 100
Solution: It is a one-tailed test, since it does mention about the direction of the
distribution (the alternative hypothesis uses the symbol >). Since σ is known and n ≥
30, we will use z-test. The level of significance is 0.05. From Table 1, the z-critical
value is 1.645. Thus, we have:
Non-Rejection Rejection Region
𝑥̅ − 𝜇 1.5
𝑧= 𝜎 𝑧= Region
8
ξ𝑛 10
71. 5 − 70 1. 5
𝑧= 𝑧=
8 0. 8
ξ 100 𝐳 = 𝟏. 𝟖𝟕𝟓
Decision: 1.645
The computed z-value is 1.875 which is greater than the critical value of 1.645.
Therefore, we reject the null hypothesis and support the alternative hypothesis.
Example 2: Compute for its value given the following information. Use 𝛼 =
0.01. Interpret the result.
𝐻𝑜: 𝜇 = 127 𝑥̅ = 124.5 𝜇 = 127
𝐻𝑎:𝜇 < 127 𝑠=5 𝑛 = 12
Solution: It is a left-tailed test, since it does mention about the direction of the
distribution (the alternative hypothesis uses the symbol <). Since σ is unknown
and n < 30, we will use t-test. The degree of freedom (df = n - 1) is 11 and 𝛼 =
0.01. Therefore, the t-critical value from Table 2 is -2.718. Thus, we have:
Rejection Acceptance or
𝑥̅ − 𝜇 −2. 5
𝑡= 𝑡= Region Non-Rejection
𝑠 5 Region
ξ𝑛 3.46
124. 5 − 127 −2.5
𝑡= 𝑡=
5 1.44
ξ 12 𝐭 = −𝟏. 𝟕𝟑𝟔
-6 -5 -4 -3 -2 -1 0 1 2 3 4 5
-2.718
Decision:
The computed t-value is greater than the t-critical value at 𝛼 = 0.01 (i. e.−1.736 >
−2.718. Since we have a left-tailed test, our conclusion is that we fail to reject the
null hypothesis.
It is a two-tailed test, since it does not mention about the direction of the
distribution. Since σ is unknown and n < 30, we will use t-test. The degree of
freedom (df = n - 1) is 25 and 𝛼 = 2.5%. Therefore, the t-critical value from Table
2 is 2.485. Thus, we have:
Non-Rejection
𝑥̅ − 𝜇 900 Region Rejection Region
𝑡= 𝑠 𝑡=
1 250
ξ𝑛 5.10
10 900 − 10 000 900
𝑡= 𝑡=
1 250 245. 10
ξ 26 𝐭 = 𝟑. 𝟔𝟕𝟏
-5 -4 -3 -2 -1 0 1 2 3 4 5
-2.485 2.485
Decision:
The absolute value of the computed t-value is greater than the absolute of the
critical t-value at 𝛼 = 0.025 (i.e. |3.671|> |2.485|). Therefore, we reject the null
hypothesis.
Conclusion:
We can conclude that there is enough evidence to reject the claim of the
government that P10,000 is the monthly expenses of a Filipino family with four
members.
Statistics and
Probability
Quarter 4 – Module 8: Solving
Problems Involving Test of
Hypothesis on Population
In testing hypothesis on the population means, follow the steps below:
𝑥̅ −𝜇
𝑧= 𝜎
𝑛 √
𝐳 = 𝟏. 𝟑𝟔𝟏
Step 3: Find the critical value and draw the critical region. Use the z-critical
value table.
Non-Rejection
Region
Rejection Region
1.361 1.645
Example 2: Blood glucose levels for obese teenagers have a mean of 120. A
researcher thinks that a diet high in raw cornstarch will have a positive or
negative effect on blood glucose levels. A sample of 25 patients who have tried
the raw cornstarch diet has a mean glucose level of 135 with a standard
deviation of 38. Test the hypothesis at 𝛼 = 0.10 that the raw cornstarch had an
effect.
Solution:
Given: 𝑥̅ = 135 𝜇 = 120 𝑠 = 38 𝑛 = 25 𝛼 = 0.10 𝑑𝑓 = 24
Step 1: State the null and alternative hypotheses.
𝐻𝑜: 𝜇 = 120 𝐻𝑎: 𝜇 ≠ 120
Step 2: Determine the test statistic, then compute its value.
Since it is the population mean being tested, the population standard deviation
is unknown, and 𝑛 < 30, the appropriate test statistic is the t-test.
t=
𝒕 = 𝟏. 𝟗𝟕𝟒
Since the t-computed value is 1.974 which is greater than the critical value
of 1.711, we reject the null hypothesis and support the alternative hypothesis.
We can conclude that there is enough evidence to support the claim that the raw
cornstarch had an effect on blood glucose levels.
Example 3: The average IQ of Senior High School students is 99 with a standard
deviation of 15. A researcher believes that the average IQ of Senior High School
students is lower. A random sample of 40 students was tested and got an average
of 95. Is there enough evidence to suggest that the average IQ is lower? Test the
hypothesis at 0.05 level of significance. Solution:
Given: 𝑥̅ = 95 𝜇 = 99 𝜎 = 15 𝑛 = 40 𝛼 = 0.05
Step 1: State the null and alternative hypotheses.
𝐻𝑜: 𝜇 = 99 𝐻𝑎: 𝜇 < 99
Step 2: Determine the test statistic, then compute its value.
Since the population mean is being tested, the population standard deviation 𝜎
is known, and 𝑛 > 30, the appropriate test statistic is the z-test.
𝑥̅ −𝜇
𝑧= 𝜎
𝑛 √
𝐳 = −𝟏. 𝟔𝟖𝟖
-1.645
Once you already know that you are dealing with a population proportion,
you can conduct the hypothesis test. You can start with the first step of a
hypothesis test which is to determine the hypotheses. In order to formulate null
and alternative hypotheses concerning population proportions, you can write
them in sentence form or you can use different symbols. Here, you will use the
symbol p for the population proportion.
Remember that the hypotheses are claims about the population
proportion, p. The null hypothesis states that the proportion is equal to a
specific value or the hypothesized proportion, po. On the other hand, the
alternative hypothesis is the competing claim that the population proportion is
less than, greater than, or not equal to po.
As a reminder, the null hypothesis is always a statement of equality. The
alternative hypothesis is always a statement of inequality, using the symbols <,
>, or ≠. Moreover, the hypotheses are stated in such a way that they are mutually
exclusive. That is, if one is true, the other must be false; and vice versa.
If you are going to write the null hypothesis in sentence form, you will
usually use “is” or “is equal to”. In symbols, you are going to use:
HO : p = po
In the given symbols as shown above, letters a and b are used in a one-
tailed test or one-sided tests (directional) while letter c is used for a twotailed
test (non-directional).
One-Tailed Two-Tailed
The next table below shows the null and alternative hypotheses stated
together with the types of hypothesis tests.
Two-Tailed Test Right-Tailed Test Left-Tailed Test
Null 𝐻𝑜:𝑝 = 𝑝𝑜 or 𝐻𝑜:𝑝 = 𝑝𝑜 or
Hypothesis 𝐻 𝑜 : 𝑝 = 𝑝𝑜
𝐻𝑜 : 𝑝 ≤ 𝑝𝑜 𝐻𝑜 : 𝑝 ≥ 𝑝𝑜
Alternative
𝐻𝑎: 𝑝 ≠ 𝑝𝑜 𝐻𝑎: 𝑝 > 𝑝𝑜 𝐻𝑎:𝑝 < 𝑝𝑜
Hypothesis
Illustrative Examples:
Example 1. It has been claimed that 40% of students in a particular senior
high school dislike Mathematics. When a survey was conducted by a researcher,
it showed that 145 of 800 students dislike Mathematics. Test if the claim was
different at α = 0.05 level.
Our cue word here is “different” which means “not the same” or “not
equal”. Therefore the alternative hypothesis is,
The proportion of students who dislike Mathematics is not equal to
40%.
In symbols, you can write,
Ha: p ≠ 0.40
40% is the hypothesized proportion; hence you have the null hypothesis
stated as
The proportion of qualified applicants in a particular job interview
is 40%.
And it can be written in symbols as
Ho: p = 0.40
Statistics and
Probability
Quarter 4 – Module 10:
Identifying Appropriate Test
Statistic Involving Population
Proportion
p= ____ %
𝒑̂ = 0.84
To change percent to
decimal, see examples
below:
1. 12% = 0.12
2. 5% = 0.05
3. 12.5% = 0.125
On the other hand, there are cases where we still need to calculate 𝒑̂.
Examples of these kinds are:
In this case, we need to solve for the value of the sample proportion
𝒑̂ (read as “p hat”).
The example below will help you understand better how we can easily
estimate the value of the sample proportion.
Remember that in a situation
describing a population
proportion/sample proportion, the
words “mean” or “average” are
notused.
Illustrative Example:
For a class project, a Grade 12 STEM student wants to estimate the
percentage of students in his school who are registered voters. From 45%
Grade 12 students, he surveys 500 students and finds that 200 are
registered voters. Determine the value of p and compute for the sample
proportion.
Solution:
The population proportion is the rate or percent used from the entire
Grade 12 students. Therefore:
Sample Proportion,
𝒑̂ = 0.4
1. The conditions for binomial experiment are met. That is, there is a fixed
number of independent trials with constant probabilities and each trial
has two outcomes that we usually classify as “success” (p) and
“failure” (q). The sum of p and q is 1. Hence, we can write p + q = 1
or q = 1 – p.
2. The conditions np ≥ 5 and nq ≥ 5 are both satisfied so that the binomial
distribution of sample proportion can be approximated by a normal
distribution with 𝜇 = 𝑛𝑝 and (However, the specific number
varies from source to source, some authors use 10 instead of 5
depending on how good an approximation one wants.)
Therefore, the formula for the value of z-test statistic for population
proportion would be:
Zcom
or Zcom
where:
zcom is the z-test statistic for proportion.
𝑝 is the sample proportion (
p is the hypothesized value of the population proportion.
n is the sample size or the number of observations in the
sample. q is equal to 1 – p.
Remember this formula because you are going to use this in Module
12 where the actual computation for the test statistic involving population
proportion will be held.
Statistics and
Probability
Quarter 4 – Module 11:
Identifying Appropriate
Rejection Region Involving
Population Proportion
There are two ways to test the hypothesis: with a p-value approach and
with a critical value approach. Here, we will consider the rejection region with
the critical value approach. The critical value enables us to reject or not the
null hypothesis. Also, it is calculated through alpha ( α ) levels and symbolized
by Z or Ztab.
This is the first statement in Activity 2: “The hypothesis that less than
20% of the population are right-handed” wherein Ha: p < 0.20 and it indicates a
left-tailed rejection region. Illustrating it in the normal curve, we will come up
with the picture below:
Rejection
Region Non-Rejection
(α) Region This is the
critical value.
Ztab
The illustration above is for you to visualize how the statement would
look like when put into the normal curve. Notice that the line represented by
ztab separates the curve into two regions. The shaded part is the rejection region
while the non-shaded part is the non-rejection region or the acceptance
region/area. Therefore, it is important that we determine the value of ztab or the
critical value. Now, let us proceed!
Let us now describe the following important terms that we will be needing
in our discussion.
We can use the table of critical values for the commonly used levels of
significance presented in the previous modules.
Level of Significance
Test Type
𝛼 = 0.01 𝛼 = 0.025 𝛼 = 0.05 𝛼 = 0.10
left-tailed test −2.33 −1.96 −1.645 −1.28
right-tailed test 2.33 1.96 1.645 1.28
two-tailed test ±2.575 ±2.33 ±1.96 ±1.645
Rejection Region
- the range of the values of the test value which indicates that there is a
significant difference and that the null hypothesis (Ho) should be rejected
Non-Rejection Region
- the range of the values of the test value which indicates that the difference
was statistically insignificant and that we failed to reject the null
hypothesis (Ho)
Illustrative Example1:
A sample of 100 students is randomly selected from Pinagpala High
School and 18 of them said they are left-handed. Test the hypothesis that less
than 20% of the students are left-handed by using 𝛼 = 0.05 as the level of
significance.
What to do:
a. Identify the level of significance.
b. Formulate the alternative hypothesis, Ha.
c. Determine the critical value, ztab.
d. Illustrate the rejection region in the normal curve.
Solution:
a. The level of significance is 𝛼 = 0.05.
b. The alternative hypothesis is Ha: p < 0.20.
It is one directional or left-tailed as determined by the term “less than”.
c. To determine the critical value using the table, we consider the intersection of
the row for the left-tailed test and the column for = 0.05. Hence, the table tells
us that the critical value is – 1.645.
d. Illustrating it under the normal curve makes:
Rejection
Region
𝛼 = 0.05 .
Non-rejection
Region
-3 -2 -1.645 0 1 2 3
From here, you will decide whether the null hypothesis will be rejected
or not, although that part will be discussed in the next module.
Illustrative Example 2:
The claim is made that 40% of tax filers use computer software to file
their taxes. In a sample of 50 tax filers, 14 used computer software to file their
taxes. If Ha: p < 0.40 at α = 0.025 where p is the population proportion who
use computer software to file their taxes. Determine the critical value, Ztab and
illustrate the rejection region in the normal curve.
Solution:
At α = 0.025 level of significance, with p < 0.40, by referring to the table
of the Level of Significance, it shows that the critical value or Ztab = –
1.96
Rejection
Region α = 0.025
Non-rejection
Region
Ztab = - 1.96
Illustrative Example 3:
Solution:
a. The level of significance is 𝛼 = 0.01.
b. The alternative hypothesis is p ≠ 0.40 due to the expression “is not 0.40 ”.
This explains why it is non-directional or two-tailed.
c. To determine the critical value using the table, we consider the intersection
of the row for the two -tailed test and the column f or 𝛼 = 0.01. Hence, the
table tells us that the critical value is ±2.575.
d. Illustrating the rejection region in the normal curve gives:
Rejection
Region Acceptance
𝛼
= 0.01 = 0.005
Region 2 2
𝛼
2
Z = -2.575 Z = 2.575
tab tab
Statistics and
Probability
Quarter 4 – Module 12:
Computing Test Statistic Value
Involving Population Proportion
It is observable that the previously cited situation did not use nor mention
words like “mean” or “average” but “percentage” instead. Also, it utilized count
data. Problems such as this involves population proportion. Inferences
involving proportions are made in the context of probability of “success”, p, in a
binomial distribution.
Zcom or Zcom
where:
Illustrative Example1:
Let us now determine the z-value in the situation presented
previously. To be able to solve it, we need to identify first the values of the
following:
Zcom = ?
78
Zcom
Illustrative Example 2:
Determine the value of Zcom given the following information:
p = 0.42
Sample Size: n = 150
Sample Proportion: 𝑝 = 0.45
Solution:
Zcom = ?
𝑝 = 0.45
p = 0.42
n = 150 q
= 1 – p = 1 – 0.42 = 0.58
Zcom
Zcom = 0.7444
Illustrative Example 3:
The claim is made that 40% of tax filers use computer software to file
their taxes. In a sample of 50, 14 used computer software to file their taxes.
To test Ho: p = 0.4 versus Ha: p > 0.4 at α= 0:05 where p is the population
proportion who use computer software to file their taxes. And to test using
the binomial distribution and test using the normal approximation to the
binomial distribution. Determine first the value of zcom.
Solution:
First, determine the value of the following:
Zcom = ?
p = 40% = 0.40 n = 50
q = 1 – p = 1 – 0.40 =
0.60
Zcom
Therefore, the computed z-value is Zcom = –1.739
Statistics and
Probability
Quarter 4 – Module 13:
Drawing Conclusions About Population
Proportion Based on Test
In drawing conclusions, there are two different approaches that you may
apply: the critical z-approach (computed z-value) and the P-value approach.
In applying the first approach which is determining the critical value (which you
were already taught in the previous modules), you need to consider the following:
Illustrative Example:
Example 1
a. Ho : p = 0.85
Ha : p < 0.85
b. Level of Significance: α = 0.01
c. Computed Test Statistic:
𝑋
𝑝=𝑛
𝒑̂ = 0.81
𝑝 −𝑝
𝑝 (1 −𝑝 )
z= √ 𝑛
z = -2.24
The alternative hypothesis is directional. Hence, one-tailed test shall
be used.
Using the Areas Under the Normal Curve Table, the critical value
is -2.326 at α = 0.01 level. There is a negative sign in the value due to
the direction of the alternative hypothesis.
d. DECISION: Since the computed test statistic (zcom) z = -2.24 does not fall
in the rejection region, fail to reject the null hypothesis (Ho).
P-VALUE APPROACH
What is P-value?
In critical value approach, a test statistic is compared with a critical value.
However, in p-value approach (short for probability value), probabilities or areas
are compared. P-value measures the consistency of the sample statistics with
the null hypothesis. High P-values mean that sample results are consistent with
a true null hypothesis while low P-values are not consistent. If the P value is
small enough, we can conclude that the sample is so incompatible with the null
hypothesis. Therefore, we can reject the null hypothesis for the entire population.
Solution:
z =
z =
z = 5.49
The p-value is represented in the graph below:
P=P(Z≥5.49)=0.0000⋯≈0
CONCLUSION: Because the p-value is smaller than the significance level
α=0.05, we can reject the null hypothesis. Again, we would
say that there is sufficient/enough evidence to conclude
that boys are more common than girls in the entire
population at α=0.05 level.
As should always be the case, the two approaches (critical value approach
and p-value approach) lead to the same conclusion.
Example 1
Given:
a. n= 50
b. = 0.01 significance level
c. H0 : The proportion of students that want to go to the zoo is 85%.
(H0: p = 0.85)
Ha: The proportion of students that want to go to the zoo is not 85%.
(Ha: p ≠ 0.85 )
d. p = 0.7554
Example 2
Given:
a. n= 150
b. = 0.1 significance level
c. Ho : The proportion of households that have three or more cell phones
is
30%. (Ho : p = 0.3)
Ho : The proportion of households that have three or more cell phones
is different from 30%. (Ha : p ≠ 0.3)
d. 𝑝 = 0.287
e. Zcom = 0.347
-1.64 Zcom=.347 1.64
0
DECISION/CONCLUSION: Fail to reject the null hypothesis (Ho). There is
insufficient evidence supporting that the proportion of households with three or
more cell phones is different from 30%.
NOTE:
Conclusions are answers in sentence form which include: 1) whether there
is enough evidence or not (based on the decision); 2) the level of significance; and
3) whether the original claim is supported or rejected.
Conclusions are based on the original claim which may be the null or
alternative hypothesis. The decisions are always based on the null hypothesis.
Original Claim
H0 Ha
Decision "REJECT" "SUPPORT"
Fail to reject H
0 There There isinsufficientevidence at
"INSUFFICIENT" is insufficientevidence the alpha level of significance
at the alpha level of to supportthe claim that(insert
significance original claim here)
.
to reject the claim that
(insert original claim
here).
NOTE:
If the null hypothesis isn’t rejected, this doesn’t necessarily mean that it’s
true. It simply means that there is not enough evidence to justify rejecting it.
Statistics and
Probability
Quarter 4 – Module 14:
Solving Problems Involving Test
of Hypothesis on Population
Just like in puzzles, you need to think of different ways on how you will be
able to solve it. Same with solving problems involving test of hypotheses on
population proportions, you need to follow important steps in order to arrive at
the correct answer.
Here are the five (5) steps in solving problems for a test of hypothesis on
the population proportion.
Remember:
Test statistic is a random variable calculated from a sample. You
can use test statistics to determine whether to reject the null hypothesis
or not. The test statistic compares your data with what is expected under
the null hypothesis. The test statistic is used to calculate the p-value.
A test statistic measures the degree of agreement between a sample
of data and the null hypothesis. Its observed value changes randomly from
one random sample to a different sample. A test statistic contains
information about the data relevant on deciding whether to reject the null
hypothesis or not.
STEP 4. CRITICAL VALUE/P-VALUE: Determine the critical value or p-
value.
𝑥̅ 𝑝 −𝑝 𝑝 −𝑝
𝑝= 𝑛
z= 𝑝𝑞
or z= 𝑝 (1 −𝑝 )
√𝑛 √
𝑛
NOTE:
Illustrative Examples
Example 1: Every year, the assigned teachers determine the Body Mass Index
(BMI) of students. In a certain public junior high school, a study
finds that 10% of Grade 7 students observed are underweight. A
sample of 780 Grade 7 students were randomly chosen and it was
found out that 125 of them are underweight. Is this claim different
for their grade level age? Use 0.05 level of significance.
SOLUTION:
𝑋
𝑝=𝑛 zc = 5.6
1
𝒑̂ = 0.16
𝑝 −𝑝
z= 𝑝 (1 −𝑝 )
√
𝑛
STEP 4: Determine the critical value.
NOTE: Since the alternative hypothesis is non-directional, the two- tailed test
shall be used. Divide α by 2, then subtract the quotient from 0.5.
Rejection Region
𝛼 𝛼
2
= 0.25 2
= 0.25
Rejection Region
𝑍𝛼
NOTE: Using the Areas Under the Normal Curve Table, critical
2
𝑣𝑎𝑙𝑢𝑒𝑠 at 0.05 level of significance are ± 1.96.
DECISION: Since the computed test statistic zcom = 2.0 is greater than the critical
value or it falls in the rejection region, reject the null hypothesis.
CONCLUSION: Therefore, we conclude that at 0.05 level of significance, there is
enough evidence that the percentage of Grade 7 students who are
underweight is different from 10%.
Statistics and Probability
Quarter 4 – Module 15:
Illustrating the Nature of Bivariate Data
Data that involve one variable is called univariate data. Univariate data are often
described using the measures of central tendency (mean or average, mode, and median),
variations, or other descriptive statistics. Here are examples of univariate data:
Data that involve two variables are called bivariate data. The statistical procedure used
to determine and describe the relationship between two variables is called correlation
analysis.
Scatter plot shows how points collected from a set of bivariate data are scattered on a
Cartesian plane. It gives a good visual picture of how two variables are related or
associated with one another in terms of form, trend, and variation of correlation. The form
of points in the scatter plot determines the shape of the correlation of the variables. The
trend determines the direction of the points, either the variables have positive, negative,
or no correlation. The variation or strength of correlation is based on the closeness of the
points on a trend line and it determines whether the variables have no, weak, moderate,
strong, or perfect correlation.
In constructing a scatter plot, you should know how to plot points in a
Cartesian plane. The independent variable will assume the values of x or abscissa while
the dependent variable will assume the values of y or ordinate.
Example 1:
The given numbers are the age of a person in years and his/her corresponding weight.
Age of a 11 12 13 14 15 16 17 18 19 20
person (x)
Weight (y) 40 42 38 35 45 51 48 48 50 47
Since the weight of an individual depends on his/her age, the independent variable
is the age of the person which is plotted horizontally. The dependent variable is the weight
of the person, which is plotted vertically as shown in the scatter plot below.
Example 2:
A Math teacher conducted a study regarding the performance of grade 11 students
in General Mathematics. Their average grades were taken at different time or period. The
data are given below.
From the data given, the independent variable is the order of the subject and the
dependent variable is the average grade. From this, order of the subject will be plotted on
the x-axis and grades will be plotted on the y-axis as illustrated below.
Example 3:
A researcher asked for the weight of 10 students together with the weight of their mother
(biological) and created a scatter plot as presented below.
Weight of mother 65 69 74 78 59 81 76 80 81 75
Weight of student 52 55 62 63 47 66 63 69 68 65
On the given, the independent variable is the weight of the mother while the dependent
variable is the weight of the student. The scatter plot is presented below.
Statistics and Probability
Quarter 4 – Module 17:
Describing the Shape (Form), Trend
(Direction), and Variation (Strength) Based
on a Scatter Plot
The correlation of the variables can be described in terms of form (shape), trend
(direction), and variation (strength) of scatter plot. The form of correlation can be
determined by the shape of points on a scatter plot categorized as linear or curvilinear.
The form of correlation is linear if the points on scatter plot follow a trend of straight line.
The form of scatter plot is non-linear if the points follow a trend of curve line. Sample
scatter plots showing curvilinear form of correlation are given below.
The correlation of variables can also be described in terms of its trend or direction.
The trend of correlation can be positive, negative, or zero/negligible depending on the
direction of the points. The trend of correlation is summarized in the table that follows.
No The points A
Correlation/ are neither negligibl
Negligible rising from e correlation
Correlation left to right exists
nor right to when
left. high values of
one variable
correspond to
either high or
low values of
another
variable.
The closeness of the points around the trend line determines the variation or
strength of the correlation between the variables involved. The closer the points to the
trend line, the stronger the correlation of the variables is. The strength of correlation
between two variables can be perfect, strong, weak, or no/negligible correlation. To
summarize the strength of correlation, refer to the table below.
Correlation Scatter Plot Description
Strong This correlation
Positive exists when almost
Correlation all of the points are
on the line or the
points are closely
scattered on the
trend line that rises
from left to right.
The correlation coefficient (r) is a number between -1 and 1 that describes both
the strength and the direction of correlation. In symbol, we write -1 ≤ r ≤ 1.
Illustrative Example:
Teachers of Pag-asa National High School instilled among their students the value
of time management and excellence in everything they do. The table below shows the time
in hours spent in studying (X) by six Grade 11 students and their scores in a test (Y).
Solve for the Pearson’s sample correlation coefficient r.
X 1 2 3 4 5 6
Y 5 10 10 15 25 30
The next section will guide you on how to compute the Pearson product moment
correlation r.
STEPS SOLUTION
1. Construct a table as shown on
the right side. X Y XY X2 Y2
1 5
2 10
3 10
4 15
5 25
6 30
4 15 60 16 225
c. Square all the entries in the Y
column. Put them under Y2 5 25 125 25 625
column.
6 30 180 36 900
3.
a. Get the sum of all entries in X Y XY
the X column. This is ∑ 𝑿. X2 Y2
1 5 5 1 25
b. Get the sum of all entries in
the Y column. This is ∑ 𝒀. 2 10 20 4 100
4. Substitute the values obtained Here n = 6 because there are six (6)
from Step 3 in the formula: pairs of values.
6(420) − (21)(95)
=
√[6(91) − (21)2][6(1,975) − (95)2]
r ≈ 0.96395 or 0.96