(LBOLYTC) Notes
(LBOLYTC) Notes
● TYPES OF GRAPHS
1. Pie chart/circle graph - any data
2. Bar Graph
➢ Bar chart
➢ Histogram
3. Line Graph
➢ Frequency Polygon
- > Ogives, < Ogives
BAR CHART
➢ with gaps between bars = discrete
➢ y-axis: Class Interval (CI)
➢ x-axis: Frequency (f)
OGIVES
< Ogive
➢ y-axis: Upper Class Boundary (UCB)
➢ x-axis: < CF
> Ogive
➢ y-axis: Lower Class Boundary (LCB)
➢ x-axis: > CF
HISTOGRAM
➢ no gaps between bars = continuous
➢ y-axis: Class Boundary (CB)
➢ x-axis: Frequency (f)
FREQUENCY POLYGON
➢ continuous data
➢ y-axis: Class Mark (CM)
➢ x-axis: Frequency (f)
MODULE 3: DESCRIPTIVE MEASURES
➢ 𝑠 = , for samples
➢ more than two modes, data are 𝑛−1
2
multimodal ➢ σ =
2 Σ(𝑥−𝑥)
, for population
𝑁
2
➢ 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 = 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
Mean Median Mode
3. Standard Deviation (s or σ)
Exists ✓ ✓ - ➢ positive square root of the
variance
Unique ✓ ✓ - ➢ Standard deviation is the square
root of variance
Sensitive ✓ x x
➢ 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 = 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
4. Coefficient of Variation (CV)
➢ ratio of the standard deviation to
its mean expressed in percent
➢ compare variability of two
populations that are expressed
in different units of
measurement
➢ expressed as a percentage
rather than in terms of the units
of the particular data
𝑠
➢ 𝐶𝑉 = 𝑥̅
× 100
MODULE 6: TECHNIQUES AND TOOLS IN PREDICTIVE ANALYTICS
● Kurtosis
➢ K = 3, mesokurtic
➢ K > 3, leptokurtic
○ can be seen if there are
more data
➢ K < 3, platykurtic
MODULE 4: SAMPLING TECHNIQUES
○ Convenience Sampling
➢ researcher uses
subjects that are readily
MODULE 5: HYPOTHESIS TESTING
What is a hypothesis?
➢ an assumption about the population
parameter
➢ an educated guess about the population
parameter
Statistical Hypotheses
➢ guess or prediction made by the
researcher regarding the possible ● One-tailed right directional test
outcome of the study ○ used if Ha uses > symbol
Hypothesis Testing
➢ process of making an inference or
generalization on population parameters
based on the results of the study on
samples
➢ deciding between what is reality and
coincidence CRITERION:
● One-tailed test (right directional)
Steps in Hypothesis Testing ○ “Reject Ho if Zc ≥ Zt
1. Formulate Ho and Ha ● One-tailed test (left directional)
2. Set the level of significance α, usually it ○ “Reject Ho if Zc ≤ Zt”
is given in the problem ● Two-tailed test (both sides)
3. Formulate the decision rule (when to ○ “Reject Ho if Zc ≥ Zt” and
reject Ho); Find the critical value/P-value ○ “Reject Ho if Zc ≤ Zt”
4. Test Statistics; do the computation
5. Make your decision Testing the hypothesized value of the mean
6. Write a conclusion ● 𝑍𝑐 =
(𝑥−µ) 𝑛
, large sample size (n ≥ 30)
σ
(𝑥−µ) 𝑛
Types of Hypotheses Tests ● 𝑡𝑐 = 𝑠
, small sample size (n < 30)
● One-tailed left directional test
○ used if Ha uses < symbol Testing the difference between two means
𝑥1−𝑥2
● 𝑍𝑐 = 2 2
σ1 σ2
𝑛1
+ 𝑛2
𝑥1−𝑥2
● 𝑡𝑐 =
2 2
(𝑛1−1)𝑠 +(𝑛2−1)𝑠
1 2 1 1
𝑛1+𝑛2−2
(𝑛 +𝑛 )
1 2
Coefficient of determination
➢ r2
➢ how much x can influence the y
➢ how much y depends on x
Why do we use r?
➔ to analyze if a relationship exists
between two variables
Sample Problem:
V. Decision Rule
Below are the midterm (x) and final (y) grades: ➢ if the computed value r value is
x 75 70 65 90 85 85 80 70 65 90 greater than the tabular value,
disconfirm (reject) Ho.
y 80 75 65 95 90 85 90 75 70 90
VI. Conclusion/Implication
Solving by Stepwise Method: ➢ r > 0.632 (tabular value at 0.05
I. Problem level of significance with 8
➢ Is there a significant relationship degrees of freedom)
between the midterm and the ➢ Null hypothesis (Ho) is
final examination of 10 students disconfirmed
in Mathematics? ➢ There is a significant
relationship between the
II. Hypotheses midterm and the final
➢ Ho: there is no significant examination of 10 students in
relationship between the Mathematics
midterm and the final
examinations of 10 students in Simple Linear Regression Analysis
Mathematics ➢ predicts the value of y given the value of
➢ Ha: there is a significant x
relationship between the
midterm and the final When to use simple linear regression?
examination of 10 students in ➔ when there is a relationship between x
Mathematics and y variables
➔ the data should be normally distributed
III. Level of significance using the level of measurement which is
➢ n = 10 expressed in an interval or ratio data
➢ a = 0.05
➢ df = n - 2; 10 - 2 = 8 Why do we use simple linear regression?
➢ r.05 = 0.632 ➔ we are interested in predicting the value
of y, the dependent variable
IV. Statistics ➔ used for forecasting and prediction
➢ use r formula
Formulas:
● 𝑦 = 𝑏𝑥 + 𝑎
𝑛Σ𝑥𝑦−Σ𝑥Σ𝑦
● 𝑏= 2 2
𝑛Σ𝑥 −(Σ𝑥)
● 𝑎 = 𝑦 − 𝑏𝑥
Where:
● y = dependent variable
● x = independent variable
● a = y-intercept
● b = slope of the line
➢ 𝑥̅ = 77.50
➢ 𝑦̅ = 81.50
➢ r = 0.949 or 0.95
MODULE 7: ANALYSIS OF VARIANCE (ANOVA)