Monika Project
Monika Project
RESEARCH
UNIVERSITY OF DELHI
Course : MSC( applied operation research)
SEM-2
ROLLNO.= 61652
SUBMITTED TO : MR.KAUSHAL KUMAR
SUBMITTED BY: MONIKA SIDHU
AIM OF STUDY
TO study the dependence of various factors on the
air quality
Here the total observations are 119 and total independent variables are
12
Data link =https://archive.ics.uci.edu/ml/datasets/Air+Quality
ASSUMPTION
The regression model is linear in terms of
parameter(s).
Xi and Ui are uncorrelated.
E(Ui) = 0 and V(Ui) = σ2 for all i=1, 2, 3, . . ., n.
There does not exist any relationship between
any independent variables i.e. no
Multicollinearity.
Error variance is constant for all values of i=1, 2, 3.
. ., n i.e. no Heteroscedasticity.
Errors are uncorrelated i.e. no Autocorrelation.
Functional form of the model is correct i.e. no
Specification Bias.
Errors are normally distributed as N(0, σ2).
OUR INITIAL REGRESSION MODEL IS AS
FOLLOW:
CO(GT) =B0+B1* PT08.S1(CO)+B2*
NMHC(GT) +B3* C6H6(GT)+B4*
PT08.S2(NMHC) +B5* NOX(GT) +B6*
PT08.S3(NOX) +B7* NO2(GT)+B8*
PT08.S4(NO2) +
B9*PT08.S5(O3)+
B10* T+B11* RH+B12* AH
Coefficientsa
Model Unstandardized Coefficients Standardized t Sig.
Coefficients
Variables Entered/Removeda
a. Dependent Variable: CO
b. All requested variables entered.
Histogram of Residuals
b. Dependent Variable: CO
ANOVAa
Model Sum of Df Mean F Sig.
Squares Square
Regression 64026.040 12 5335.503 3.326 .000b
1 Residual 170034.083 106 1604.095
Multiple R :- This is the correlation coefficient. It tells us how strong the linear
relationship is. Basically here it suggest the correlation coefficient between the
observed and predicted values. In our result the multiple R value is .523 which is
quite close to 1 hence, there is a strong linear relationship between the
dependent variable and independent variable.
Adjusted R square:- The adjusted R square adjusts for the number of terms in
a model. The adjusted R-squared is a modified version of R-squared that has been
adjusted for the number of predictors in the model. The adjusted R-squared
increases only if the new term improves the model more than would be expected
by chance, it decreases when a predictor improves the model by less then
expected by chance.Value is .191
Fvalue:The F-value for the model can be seen from the (ANOVA)
Table1. The Critical value at 5% level of significance is Calculated
F = 1.8792 and Tabulated F(0.05,12,119) =3.326
This table indicates that the regression model predicts the dependent
variable significantly well. Here, p < 0.05, and indicates that, overall, the
regression model statistically significantly predicts the outcome variable
(i.e., it is a good fit for the data).
Residuals Statisticsa
ANOVAa
Model Sum of Squares df Mean F Sig.
Square
• If d < dL,α, there is statistical evidence that the error terms are positively
autocorrelated.
• if d > dU,α, there is no statistical evidence that the error terms are positively
autocorrelated.
• If dL,α < d < dU,α, the test is inconclusive.
Positive serial correlation is serial correlation in which a positive error for one
observation increases the chances of a positive error for another observation.
• If (4 − d) < dL,α, there is statistical evidence that the error terms are negatively
autocorrelated.
• If (4 − d) > dU,α, there is no statistical evidence that the error terms are negatively
autocorrelated.
• If dL,α < (4 − d) < dU,α, the test is inconclusive.L,α and dU,α):
INTERPRETATION
We can see that the value of d under Durbin Watson D-test is
d=2.236
We know that value of k=no of independent variables+1
Therefore, the value of k = 12+1=13.
Here, n= no of observations in our data
Therefore, n=119
here, d>du , there is no statistical evidence that the error terms are
positively autocorrelated
Thank You