Regression - Analysis - Sumana Mondal
Regression - Analysis - Sumana Mondal
Cars
(in 1,000 units) (in lakh rupees) (Km/ltr) (Km/hr)
Assumptions : The relationship between depndent variable and of those of the predictor va
Confidence Level = 95% ( Significance Le
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.99788065450456
R Square 0.995765800634449
Adjusted R Square 0.995356039405525
Standard Error 2.30779153196702
Observations 35
ANOVA
df SS MS
Regression 3 38827.6180187373 12942.5393395791
Residual 31 165.102954405579 5.32590175501867
Total 34 38992.7209731429
Observations
The R-square and the adjusted R Square valures are very high which shows that the fitness of the reg
For the p-value less than 0.05 the variables will be significant and impactful else there wont be any im
an impact on the Sales.
Hypothesis Testing
The Overall Test The null Hypothesis H0 would be: None of the independent v
on the dependent varibales.
The Alternate Hypothese Ha is: At least one independent vari
influence
The p-values of the independent on thelike
variables dependent
price andvariable.
milage are less than 0.05 which
P-value
Intercept 8.7390238740411E-11
Price
(in lakh rupees) 0.0038334629770107
Mileage
(Km/ltr) 5.7835637568169E-38
Top speed (Km/hr) 0.547763026123756
On comparing each of the p-values for the independent variables individually with the critical v
Calculations
Equation : Sales = 50.7231271792463 -0.795026440875324* Price
Suppose increasing the price of Rocinante36 and Marengo32 by ₹1 lakh each. Then the below chan
Profit =Sales
Price( -inManufacturing
1000 units ) cost 227.897226125466
( in Lakhs)
[ sales * profit ]
Profit = Price - Manufacturing cost 2
New Overall Projected Profit = sales * profit
( in crores ) 4557.94452250931
Droping the insignificant regression variables ( Top speed ) and rebuilding the regression model using onl
Regression Statistics
Multiple R 0.997855373536924
R Square 0.995715346496514
Adjusted R Square 0.995447555652547
Standard Error 2.28493913026155
Observations 35
ANOVA
df SS MS
Regression 2 38825.6506746148 19412.8253373074
Residual 32 167.070298528013 5.2209468290004
Total 34 38992.7209731429
OBSERVATIONS :
adjusted R-squared value for old regression model
adjusted R-squared value for new regression model
f those of the predictor vaariables is linear. Observations are independent of each other.
vel = 95% ( Significance Level is 0.05 )
F Significance F
2430.11229552 7.4472712258E-37 => The value is less than 0.05, hence it is significant and the regression is possib
that the fitness of the regression line is really good with the model.
else there wont be any impact, considering that the Intercept, Price and Milage are the only variables having
be that : The independent variable ( X1, X2, X3 ) has a significant influence on the
Rejected Significant
Rejected Significant
Fail to Reject Not Significant
vidually with the critical value ( 0.05 ) it can be said that the intercept, Price and Milage are significant.
0.99535603941
0.99544755565
Assumptions : The relationship between depndent variable and of those of the predictor vaariables is linear. Observation
Confidence Level = 95% ( Significance Level is 0.05 )
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.929025133175836
R Square 0.863087698072379
Adjusted R Square 0.847875220080421
Standard Error 2.30673039512172
Observations 31
ANOVA
df SS
Regression 3 905.669817744951
Residual 27 143.667138126017
Total 30 1049.33695587097
Observations
The R-square and the adjusted R Square valures are high which sh
For the p-value less than 0.05 the variables will be significant and
an impact on the Sales.
Sales = I
Sales = -13.44764888834
Hypothesis Testing
Intercept
Price
(in lakh rupees)
Top speed (Km/hr)
Mileage
(Km/ltr)
On comparing each of the p-values for the independent var
Calculations
Equation :
Regression Statistics
Multiple R 0.92891610424611
R Square 0.86288512872776
Adjusted R Square 0.85309120935118
Standard Error 2.26683937947473
Observations 31
ANOVA
df
Regression 2
Residual 28
Total 30
Coefficients
Intercept -12.97013136159
Price
(in lakh rupees) -0.1824738201848
Top speed (Km/hr) 0.22013989044002
OBSERVATIONS :
(in lakh rupees) , Mileage (Km/ltr), Top speed (Km/hr)
b3*Top speed
MS F Significance F
301.889939248317 56.735510054881 8.7448816E-12 => The value is less than 0.05, hence it is significant and th
5.32100511577839
are valures are high which shows that the fitness of the regression line is really good with the model.
iables will be significant and impactful else there wont be any impact, considering that the Intercept, Price and Top Speed are the only
Hypothesis H0 would be: The independent variable( X1, X2, X3 ) doesn’t have a significant influence on
endent variable.
rnate Hypothesis Ha would be that : The independent variable ( X1, X2, X3 ) has a significant influence on the
ent variab
e increasing the price of Rocinante36 and Marengo32 by ₹1 lakh each. Then the below changes can be observed.
ficant regression variables ( Mileage ) and rebuilding the regression model using only significant variables.
insignificant Variables )
SS MS F Significance F
905.45725424552 452.72862712276 88.104169082 8.30202130911125E-13
143.879701625447 5.1385607723374
1049.33695587097
There is a increase by 0.01 in the Adjusted R square value in the new regression model this can be due to the
dropping of the insignificant variables.
han 0.05, hence it is significant and the regression is possible
-0.3004782976 -0.0729780465914833
0.1854626971 0.256140666971782
-0.3826896887 0.465292062727021
del.
ept, Price and Top Speed are the only variables having
Null hypothesis at 95% confidence level.
nce on
ence on the
can be observed.
variables.
Upper 95% Lower 95.0% Upper 95.0%
-5.4038074814 -20.5364552417575 -5.403807