0% found this document useful (0 votes)

51 views

Standardized Coefficients

1) Standardized coefficients are used to determine the relative importance of predictors in a model by looking at their absolute standardized coefficients, even if their non-standardized coefficients are small. 2) Tolerance and variance inflation factors (VIFs) are used to detect multicollinearity (correlation between predictors). Small tolerances and VIFs greater than 2 indicate high multicollinearity. 3) Factor analysis can be used to create uncorrelated predictor variables from original correlated variables, resolving issues of multicollinearity.

Uploaded by

CYNTHIA AGYEIWAA KUSI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

Standardized Coefficients

Uploaded by

CYNTHIA AGYEIWAA KUSI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Standardized Coefficients

To determine the relative importance of the significant predictors, look at the standardized
coefficients. Even though Price in thousands has a small coefficient compared to Vehicle type, Price
in thousands actually contributes more to the model because it has a larger absolute standardized
coefficient.

Tolerance and Variance Inflation Factors

The tolerance is the percentage of the variance in a given predictor that cannot be explained by the
other predictors. Thus, the small tolerances show that 70%-90% of the variance in a given predictor
can be explained by the other predictors. When the tolerances are close to 0, there is high
multicollinearity and the standard error of the regression coefficients will be inflated. A variance
inflation factor greater than 2 is usually considered problematic, and the smallest VIF in the table is
3.193. The collinearity diagnostics confirm that there are serious problems with multicollinearity.
Several eigenvalues are close to 0, indicating that the predictors are highly intercorrelated and that
small changes in the data values may lead to large changes in the estimates of the coefficients.

The condition indices are computed as the square roots of the ratios of the largest eigenvalue to each
successive eigenvalue. Values greater than 15 indicate a possible problem with collinearity; greater
than 30, a serious problem. Six of these indices are larger than 30, suggesting a very serious problem
with collinearity. Now try to fix the collinearity problems by rerunning the regression using z scores of
the independent variables.

To run a Linear Regression on the standardized variables, recall the Linear Regression dialog box.

► Deselect Vehicle type throughFuel efficiency as independent variables.

► Select Zscore: Vehicle typethrough Zscore: Fuel efficiency as independent variables.

► Click OK.

The eigenvalues and condition indices are vastly improved relative to the original model. However,
the collinearity statistics reported in the Coefficients table are unimproved. This is because the z-
score transformation does not change the correlation between two variables. As a multicollinearity
diagnostic, the condition index is useful for flagging datasets that could cause numerical estimation
problems in algorithms that do not internally rescale the independent variables. The z-score
transformation solves this problem, but we need another tactic for improving the variance inflation.

Using the Factor Analysis procedure, we can create a set of independent variables that are
uncorrelated and fit the dependent variable as well as the original independent variables.

► To run a Factor Analysis on the standardized variables, from the menus choose:

Analyse > Dimension Reduction > Factor...

► Select Zscore: Vehicle type through Zscore: Fuel efficiency as analysis variables.

► Click Extraction.

► In the Extract group, select Fixed number of factors and type 10 as the number of factors to
extract.

► Click Continue, then click Rotation in the Factor Analysis dialog box.
► In the Method group, select Varimax.

► Click Continue, then click Scores in the Factor Analysis dialog box.

► Select Save as variables.

► Click Continue, then click OKin the Factor Analysis dialog box.

► To run a Linear Regression on the factor scores, recall the Linear Regression dialog box.

► Deselect Zscore: Vehicle typethrough Zscore: Fuel efficiency as independent variables.

► Select REGR factor score 1 for analysis 1 [FAC1_1]through REGR factor score 10 for analysis 1
[FAC10_1]as independent variables.

► Click OK.

As expected, the model fit is the same for the model built using the factor scores as for the model
using the original predictors. Also as expected, the collinearity statistics show that the factor
scores are uncorrelated. Also note that since the variability of the coefficient estimates are not
artificially inflated by collinearity, the coefficient estimates are larger, relative to their standard
errors, in this model than in the original model. This means that more of the factors are identified
as statistically significant, which can affect your final results if you want to build a model that only
includes significant effects.

Running a stepwise linear regression

► For example, to run a stepwise Linear Regression on the factor scores, recall the Linear
Regression dialog box.

► Select Stepwise as the entry method.

Note that because stepwise methods select models based solely upon statistical merit, it may choose
predictors that have no practical significance. While stepwise methods are a convenient way to
focus on a smaller subset of predictors, you should take care to examine the results to see if they
make sense.

► Select Model as the case labeling variable.

► Click Statistics.

► Deselect Part and partial correlations andCollinearity diagnostics.

► Select Casewise diagnosticsand type 2 in the text box.

► Click Continue.

► Click Plots in the Linear Regression dialog box.

► Select SDRESID as the yvariable and ZPRED as the xvariable.

► Select Histogram.

► Click Continue.
► Click Save in the Linear Regression dialog box.

► Select Standardized in the Predicted Values group.

► Select Standardized in the Residuals group.

► Select Cook's and Leverage values in the Distances group.

► Click Continue.

► Click OK in the Linear Regression dialog box.

The new model's ability to explain sales compares favorably with that of the previous model. Look in
particular at the adjusted R-square statistics, which are nearly identical. A model with extra
predictors will always have at least as large an R-square value, but the adjusted R-square
compensates for model complexity to provide a more fair comparison of model performance.

The stepwise algorithm chooses factor scores 1, 2, 3, 5, and 6 as predictors; in order to interpret
these results, you'll need to look at the rotated component matrix in the Factor Analysis output.
• The first component (factor scores) loads most strongly on price and horsepower. Since the
regression coefficient is negative for factor score 1, you can conclude that more expensive, higher
horsepower cars can be expected to have lower sales.

• The second component loads most strongly on wheelbase and length. Since the regression
coefficient is positive for factor score 2, this suggests that larger vehicles are expected to have
higher sales.

• The third component loads most strongly on vehicle type. The positive coefficient for factor score 3
suggests that trucks are expected to have higher sales.

• The sixth component loads most strongly on engine size; note that engine size loads almost as
strongly on the first component, so the positive coefficient for factor score 6 suggests that this
partially offsets the negative association between engine size and sales that is suggested by the
negative coefficient for factor score 1.

• The fifth component loads most strongly on fuel efficient; the negative component loading
combined with the negative coefficient for factor score 5 suggests that more fuel efficient cars are
expected to have higher sales, all other things being equal.

Checking normality

The shape of the histogram follows the shape of the normal curve fairly well, but there are one or two
large negative residuals. For more information on these cases, see the casewise diagnostics.
This table identifies the cases with large negative residuals as the 3000GT and the Cutlass. This
means that, based on the expected sales predicted by the regression model, these two models
underperformed in the market. The Breeze and SW also appear to have underperformed to a
lesser extent. The plot of residuals by predicted values clearly shows the two most
underperforming vehicles. Additionally, you can see that the Breeze and SW are quite close to the
majority of cases. This suggests that the apparent underperformance of the Breeze and SW could
be due to random chance. What is of greater concern in this plot are the clusters of cases far to
the left of the general cluster of cases. While the vehicles in these clusters do not have large
residuals, their distance from the general cluster may have given these cases undue influence in
determining the regression coefficients.

To check the residuals by factor score 1, from the menus choose:

Graphs > Chart Builder...

► In the Scatter/Dot gallery, select Simple Scatter.

► Select Standardized Residualas the y variable and REGR factor score 1 for analysis 1as
the x variable.

► Click the Groups/Point ID tab and select Point ID Label.

► Select Model as the variable to label cases by.

► Click OK.

The resulting scatterplot reveals that the unusual grouping of points noted in the residuals by
predicted values scatterplot have large values for factor score 1; that is, they are high-priced
vehicles. Since the distribution of prices is right-skewed, it might be a good idea to use log-
transformed prices in future analyses. By recalling the Chart Builder, you can produce similar for
the other factor scores. The charts for factor scores 2 and 3 don't reveal anything interesting, but
the residuals by factor score 5 reveal that the Metro may be an influential point because it has a
much higher fuel efficiency than any other vehicle in the dataset and lies far outside the main
cluster of points. The residuals by factor score 5 chart reveals that the Viper may also be an
influential point because it has an unusually large engine size and lies outside the main cluster of
points.

Identifying influential points

► To check Cook's distance by the centered leverage value, recall the Chart Builder.

► Select Cook's Distance as they variable.

► Select Centered Leverage Value as the x variable.

► Click OK.

The resulting scatterplot shows a few unusual points. The 3000GT has a large Cook's distance, but it
does not have a high leverage value, so while it adds a lot of variability to the regression
estimates, it likely did not affect the slope of the regression equation. The Viper has a high
leverage value, but does not have a large Cook's distance, so it is not likely to have exerted
undue influence on the model. The most worrisome case is the Metro, which has both a high
leverage and a large Cook's distance. The next step would be to run the analysis without this
case, but we will not pursue this here.

Summary

Using stepwise methods in Linear Regression, you have selected a "best" model for predicting motor-
vehicle sales. With this model, you found two vehicle models that were underperforming in the
market, while no vehicle was clearly overperforming.

Diagnostic plots of residuals and influence statistics indicated that your regression model may be
adversely affected by the Metro. Removing this case and rerunning the analysis to see the difference
in the results would be a good next step.

The Linear Regression procedure is useful for modeling the relationship between a scale dependent
variable and one or more scale independent variables.

• Use the Correlations procedure to study the strength of relationships between the variables before
fitting a model.
• If you have categorical predictor variables, try the GLM Univariate procedure.

• If your dependent variable is categorical, try the Discriminant Analysis procedure.

• If you have a lot of predictors and want to reduce their number, use the Factor Analysis procedure.

Pricing Procedure In SAP
From Everand
Pricing Procedure In SAP
Shyamala N
4.5/5 (25)
Final Research SHS
91% (86)
Final Research SHS
30 pages
Advanced Portfolio Management: A Quant's Guide for Fundamental Investors
From Everand
Advanced Portfolio Management: A Quant's Guide for Fundamental Investors
Giuseppe A. Paleologo
No ratings yet
S2-Linear-Regression-LKW-9March2025
No ratings yet
S2-Linear-Regression-LKW-9March2025
23 pages
SPSS Regression PC
No ratings yet
SPSS Regression PC
8 pages
Why's and Wherefore's
No ratings yet
Why's and Wherefore's
15 pages
Motor Trend Car Road Tests
No ratings yet
Motor Trend Car Road Tests
5 pages
Finalised FBA CIA 3
No ratings yet
Finalised FBA CIA 3
16 pages
Homework1-1
No ratings yet
Homework1-1
3 pages
Dafm Cia 2 - 2227610
No ratings yet
Dafm Cia 2 - 2227610
16 pages
Sa 16
No ratings yet
Sa 16
5 pages
Multiple Regression Analysis Project
No ratings yet
Multiple Regression Analysis Project
9 pages
Sample Exam Answers CMA
No ratings yet
Sample Exam Answers CMA
9 pages
Factor Analysis
No ratings yet
Factor Analysis
2 pages
Meet5 Psy 312 Decision-Making Association
No ratings yet
Meet5 Psy 312 Decision-Making Association
49 pages
Regression Model Development for Revenue Dataset.docx
No ratings yet
Regression Model Development for Revenue Dataset.docx
9 pages
MTCARS Regression Analysis
No ratings yet
MTCARS Regression Analysis
5 pages
ECON 4613 - Assignment 2
No ratings yet
ECON 4613 - Assignment 2
11 pages
Data Analysis
No ratings yet
Data Analysis
1 page
Simple Linear Regression in SPSS
No ratings yet
Simple Linear Regression in SPSS
8 pages
Kuiper Ch03 PDF
No ratings yet
Kuiper Ch03 PDF
35 pages
Kuiper Ch03
No ratings yet
Kuiper Ch03
35 pages
20230305slides
No ratings yet
20230305slides
39 pages
4-Regression Diagnostics SAS
No ratings yet
4-Regression Diagnostics SAS
12 pages
Ebook Ebook PDF Business Analytics A Management Approach All Chapter PDF Docx Kindle
100% (35)
Ebook Ebook PDF Business Analytics A Management Approach All Chapter PDF Docx Kindle
35 pages
Stata
No ratings yet
Stata
6 pages
Analisis Jalur
No ratings yet
Analisis Jalur
30 pages
Linear Models Bias
No ratings yet
Linear Models Bias
17 pages
Module 7 Content
No ratings yet
Module 7 Content
10 pages
From Attributes To Factors - Brief Explanation of Factor Analysis
No ratings yet
From Attributes To Factors - Brief Explanation of Factor Analysis
9 pages
3-Linear Regreesion-Assumptions
No ratings yet
3-Linear Regreesion-Assumptions
28 pages
Statacheatsheets
No ratings yet
Statacheatsheets
6 pages
LR Assumptions_05
No ratings yet
LR Assumptions_05
12 pages
Stat A Cheat Sheets
No ratings yet
Stat A Cheat Sheets
6 pages
Cheat Sheet
No ratings yet
Cheat Sheet
3 pages
Correlation and Regression: Explaining Association and Causation
No ratings yet
Correlation and Regression: Explaining Association and Causation
23 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
18 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
45 pages
Chapter 10 Regression Slides
No ratings yet
Chapter 10 Regression Slides
46 pages
Factor
No ratings yet
Factor
15 pages
Cheat Sheet: With Stata
No ratings yet
Cheat Sheet: With Stata
6 pages
Text Problems Solved
No ratings yet
Text Problems Solved
9 pages
Model Evalution
No ratings yet
Model Evalution
6 pages
Assumptions of Linear Regression
No ratings yet
Assumptions of Linear Regression
11 pages
Linear Regression: Ramlee@fpe. 1
No ratings yet
Linear Regression: Ramlee@fpe. 1
51 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Team AN
No ratings yet
Team AN
23 pages
Using R For Basic Statistical Analysis
No ratings yet
Using R For Basic Statistical Analysis
11 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
day 3
No ratings yet
day 3
18 pages
Atkinson-Riani - Robust Diagnostic Regression Analysis
No ratings yet
Atkinson-Riani - Robust Diagnostic Regression Analysis
341 pages
Chat Openai Com Share d1822345 3a2b 42c7 9060 79766097ae3b
No ratings yet
Chat Openai Com Share d1822345 3a2b 42c7 9060 79766097ae3b
14 pages
Stats101A - Chapter 3
No ratings yet
Stats101A - Chapter 3
54 pages
Using Multivariate Statistics: Barbara G. Tabachnick
No ratings yet
Using Multivariate Statistics: Barbara G. Tabachnick
22 pages
Chapter12 Slides
No ratings yet
Chapter12 Slides
15 pages
Lecture 3
No ratings yet
Lecture 3
90 pages
Applied Econometrics: A Simple Introduction
From Everand
Applied Econometrics: A Simple Introduction
K.H. Erickson
5/5 (2)
Business Intelligence Questions, Analytical & Reporting Hint
From Everand
Business Intelligence Questions, Analytical & Reporting Hint
Dr. Zemelak Goraga
No ratings yet
Linear Regression with Multiple Covariates
From Everand
Linear Regression with Multiple Covariates
Brett Kottmann
No ratings yet
A Quantitative Approach to Commercial Damages: Applying Statistics to the Measurement of Lost Profits
From Everand
A Quantitative Approach to Commercial Damages: Applying Statistics to the Measurement of Lost Profits
Mark G. Filler
No ratings yet
Financial Risk Management: Applications in Market, Credit, Asset and Liability Management and Firmwide Risk
From Everand
Financial Risk Management: Applications in Market, Credit, Asset and Liability Management and Firmwide Risk
Jimmy Skoglund
No ratings yet
Module 1-Lesson 1
No ratings yet
Module 1-Lesson 1
8 pages
Basic Education Learning Recovery and Continuity Plan
No ratings yet
Basic Education Learning Recovery and Continuity Plan
20 pages
Lesson 1: Textual Analysis: Remember
No ratings yet
Lesson 1: Textual Analysis: Remember
10 pages
Practice Ch10
No ratings yet
Practice Ch10
29 pages
Product Portfolio Management: Kenneth Crow DRM Associates PD-Trak Solutions
83% (6)
Product Portfolio Management: Kenneth Crow DRM Associates PD-Trak Solutions
30 pages
Hubungan Kecanduan Game Online Dengan Interaksi Sosial Pada Remaja Di SMKN 1 Guguak
No ratings yet
Hubungan Kecanduan Game Online Dengan Interaksi Sosial Pada Remaja Di SMKN 1 Guguak
6 pages
Contemporary Theme
No ratings yet
Contemporary Theme
4 pages
Inferential Statistics Guided Project
No ratings yet
Inferential Statistics Guided Project
34 pages
3.-Kunal-Kanti-Hazra-Prianka-Sengupta-Modan-Mohan-Chel-Contemporary-Issues-and-Challenges-on-Higher-Education
No ratings yet
3.-Kunal-Kanti-Hazra-Prianka-Sengupta-Modan-Mohan-Chel-Contemporary-Issues-and-Challenges-on-Higher-Education
11 pages
The Implementation of Polya's Model in Solving Problem-Questions in Mathematics by Grade 7 Students
No ratings yet
The Implementation of Polya's Model in Solving Problem-Questions in Mathematics by Grade 7 Students
14 pages
Dear Hiring Manager
No ratings yet
Dear Hiring Manager
6 pages
Masters of Arts (English)
No ratings yet
Masters of Arts (English)
94 pages
Master Thesis Opportunities in Europe
100% (3)
Master Thesis Opportunities in Europe
6 pages
2006 09 FDA - Ind Statistics Slides
No ratings yet
2006 09 FDA - Ind Statistics Slides
36 pages
Phishing Website Identification Based On Double Weight Random Forest
No ratings yet
Phishing Website Identification Based On Double Weight Random Forest
4 pages
Factors Associated With Work Values and Teaching Performance of Dawis Norte Elementary School Teachers in Carmen District
No ratings yet
Factors Associated With Work Values and Teaching Performance of Dawis Norte Elementary School Teachers in Carmen District
13 pages
Formation Rehabilitation by Blanketing Using C.C. Crib/Cube & Rail Cluster Method
No ratings yet
Formation Rehabilitation by Blanketing Using C.C. Crib/Cube & Rail Cluster Method
20 pages
01.0 PP I IV Frontmatter
No ratings yet
01.0 PP I IV Frontmatter
4 pages
ECC 563 - Topic 2 - Planning A Research Study - ZJ19062021
No ratings yet
ECC 563 - Topic 2 - Planning A Research Study - ZJ19062021
15 pages
Book Ics Part-I 2019
50% (2)
Book Ics Part-I 2019
70 pages
Critical Journal Review Task Template
No ratings yet
Critical Journal Review Task Template
2 pages
Education studies issues and critical perspectives 1st Edition Derek Kassem pdf download
100% (1)
Education studies issues and critical perspectives 1st Edition Derek Kassem pdf download
81 pages
NGO Assessement Report-Final-9 Feb 2020
No ratings yet
NGO Assessement Report-Final-9 Feb 2020
48 pages
The Parts of A Thesis Proposal
No ratings yet
The Parts of A Thesis Proposal
2 pages
1st Merit List BS Medical Laboratory Technology MLT E Department of Medical Laboratory Technology BAHAWALPUR BWP Merit Fall 2021 Fall 2021
No ratings yet
1st Merit List BS Medical Laboratory Technology MLT E Department of Medical Laboratory Technology BAHAWALPUR BWP Merit Fall 2021 Fall 2021
3 pages
Extraterrestrials On Earth
100% (2)
Extraterrestrials On Earth
7 pages
Aapro EORTC Guidelines For GCSF
No ratings yet
Aapro EORTC Guidelines For GCSF
25 pages
Energizer for qualitative and quantitative research
No ratings yet
Energizer for qualitative and quantitative research
2 pages
Predicting Short-Term Electricity Demand by Combining The Advantages of Arma and Xgboost in Fog Computing Environment
No ratings yet
Predicting Short-Term Electricity Demand by Combining The Advantages of Arma and Xgboost in Fog Computing Environment
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Standardized Coefficients

Uploaded by

Standardized Coefficients

Uploaded by

Standardized Coefficients

Tolerance and Variance Inflation Factors

► Deselect Vehicle type throughFuel efficiency as independent variables.

► Select Zscore: Vehicle typethrough Zscore: Fuel efficiency as independent variables.

Analyse > Dimension Reduction > Factor...

► Select Save as variables.

► Deselect Zscore: Vehicle typethrough Zscore: Fuel efficiency as independent variables.

Running a stepwise linear regression

► Select Stepwise as the entry method.

► Select Model as the case labeling variable.

► Deselect Part and partial correlations andCollinearity diagnostics.

► Select Casewise diagnosticsand type 2 in the text box.

► Click Plots in the Linear Regression dialog box.

► Select SDRESID as the yvariable and ZPRED as the xvariable.

► Select Standardized in the Predicted Values group.

► Select Standardized in the Residuals group.

► Select Cook's and Leverage values in the Distances group.

► Click OK in the Linear Regression dialog box.

To check the residuals by factor score 1, from the menus choose:

Graphs > Chart Builder...

► Click the Groups/Point ID tab and select Point ID Label.

► Select Model as the variable to label cases by.

Identifying influential points

► Select Cook's Distance as they variable.

► Select Centered Leverage Value as the x variable.

• If your dependent variable is categorical, try the Discriminant Analysis procedure.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Standardized Coefficients

Uploaded by

Standardized Coefficients

Uploaded by

Standardized Coefficients

Tolerance and Variance Inflation Factors

► Deselect Vehicle type throughFuel efficiency as independent variables.

► Select Zscore: Vehicle typethrough Zscore: Fuel efficiency as independent variables.

Analyse > Dimension Reduction > Factor...

► Select Save as variables.

► Deselect Zscore: Vehicle typethrough Zscore: Fuel efficiency as independent variables.

Running a stepwise linear regression

► Select Stepwise as the entry method.

► Select Model as the case labeling variable.

► Deselect Part and partial correlations andCollinearity diagnostics.

► Select Casewise diagnosticsand type 2 in the text box.

► Click Plots in the Linear Regression dialog box.

► Select *SDRESID as the yvariable and *ZPRED as the xvariable.

► Select Standardized in the Predicted Values group.

► Select Standardized in the Residuals group.

► Select Cook's and Leverage values in the Distances group.

► Click OK in the Linear Regression dialog box.

To check the residuals by factor score 1, from the menus choose:

Graphs > Chart Builder...

► Click the Groups/Point ID tab and select Point ID Label.

► Select Model as the variable to label cases by.

Identifying influential points

► Select Cook's Distance as they variable.

► Select Centered Leverage Value as the x variable.

• If your dependent variable is categorical, try the Discriminant Analysis procedure.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

► Select SDRESID as the yvariable and ZPRED as the xvariable.