Week 6 - Result and Analysis 2 (UP)
Week 6 - Result and Analysis 2 (UP)
A hypothesis
• is a conjecture about a population parameter. This conjecture may or may not be
true.
• An educated guess based on theory and background information
Examples
a. Whether seat belts will reduce the severity of injuries caused by accident
b. Whether the public prefer certain colour in the fabric lining
c. Whether adding a chemical will improve water quality
d. The average life expectancy in the next decade for man will be more than 100
years
Ho : µ = 82 (remain uncahnged)
H1 : µ ≠ 82 (will be different)
This is a two-tailed test since the possible effect could be to raise or lower the
pulse
1
Situation B: A chemist invents an additive to increase the life of an automobile
battery. The mean lifetime of ordinary battery is 36 months. The hypothesis will
be:
Ho : µ ≤ 36
Ha : µ > 36
The chemist is interested only in increasing the lifespan of the battery. His
alternative hypothesis is that the mean is larger than 36. Therefore the test is
called right-tailed, interested in the increase only.
Ho : µ ≥ RM 100
H1 : µ < RM 100
This is a left-tailed test since the contractor is only interested in reducing the bill
Significant difference at p<0.05 means, 95% of the time the sample mean is
larger than the hypothesised value.
2
TESTING THE DIFFERENCE AMONG MEANS AND VARIANCE
Situations:
1. z-test
• For detecting difference between two means for large sample (two samples)
• Assumptions required
i. The sample must be independent, that is no relationship between the
subject in the sample
ii. The sample must be normally distributed
2. F-test
3. t-test
• To test the difference between two means for small independent sample (n<30)
• Assumptions
i. Sample must be independent
ii. The populations are normally distributed
3
Correlation is a statistical method used to determine whether a relationship
between variable exists. Regression describe the nature of the relationship
between variables
Best fit means that the sum of the squares of vertical distances from each point to
the line are at the minimum
Regression equation
Assumptions in Regression
i. For any value of the independent variable x, the value of the dependent must
be normally distributed about the regression line
ii. The standard deviation of each of the dependent variable must be the same
for each value of the independent variable
explained variation
r2 =
total variation
Example r2 = 0.845 mean 84.5% of the total variation is explained by the regression
line using the independent variable.
• Defined as the standard deviation of the observed y value about the predicted
y’ values
• y’ is predicted for a specific x value, thus it is a point prediction
• A prediction interval about y value can be constructed using a statistic call
standard error of estimate (Sest)
• Formula
4
∑ ( y − y' )2
S est =
n−2
β.= 0
β.≠0
• Use t-test.
• β.≠0 if calculated t is greater than t (from table) at certain significant level (α)
Multiple Regression
• For any specific value of independent variable, the value of the y variable are
normally distributed (normality assumption)
• The variances or standard deviation for the y variable are the same for each
value of the independent variable (equal variance assumption)
• There is a linear relationship between the dependent variable and the
independent variable (linearity assumption)
5
• The independent variables are not correlated
• The values for the y variables are independent
NON-PARAMETRIC TEST
Disadvantages
i. Less sensitive
ii. Less information
iii. Less efficient
• The value is assigned + when greater than median and – when lower than median
• The number of + and – signs are compared
• If the Null Hypothesis is true the number of + sign about equal to – signs !
ratio close to 1. Reject Null hypothesis when the value is greater than critical
value (from table).
USING MODELS
6
i. Model formulation
ii. Model optimazation
iii. Model calibration/verification
iv. Model Application
Model Formulation
Model optimization
• Regression analysis – analytical way
• Subjective optimization – based on experience of the modelers
Model Calibration
• Changing the coefficient
• Reduce error between observed and predicted values
Model Application
• After the model has been calibrated and validated
USING STANDARDS