Chapter 3 - Polynomial Regression and Interaction
Chapter 3 - Polynomial Regression and Interaction
INTERACTION
y=β0+β1x+β2x2+β3x3+⋯+βnxn+ϵ
y is the dependent variable.
DISCUSSION
yi = b0 + b1xi + b2xi2 + ei
may be represented as a linear model:
yi = b0 + b1xi1 + b2xi2 + ei
with the transformed variables xi1= xi and xi2 = xi2 .
• The order of the polynomial function is the highest exponent of x; the model
above is a second-order model.
• To estimate a polynomial function x is often first deviated from its mean (or
median) to reduce collinearity between x and higher powers of x. A variable
deviated from its mean is called centered. The transformation is x = X -
X. where x (lower case) represents the centered variable and X (uppercase)
the original (uncentered) variable.
A polynomial function can be used when
• the true response function is polynomial
• the true response function is unknown but a polynomial is a good
approximation of its shape
2. Graphic Representation of the Model
• The response function E{y} for any polynomial model with one predictor
variable can be represented on a 2-dimensional plot of y against x.
• A second-degree polynomial implies a parabolic relationship. The signs of the
coefficients determine the shape of the response function:
• when b2 is positive, y increases as the value of x increases
• when b2 is negative, y eventually decreases as the value of x increases
NOTES
• when evaluating the shape of a polynomial response function, it is necessary
to keep within the range of x in the data, as extrapolating beyond this range
may lead to misleading predictions
• it is possible to convert from the coefficients of the centered model (involving
x) to the non-centered model involving the original X however, the conversion
is rarely needed for substantive purposes.
• with a second-order polynomial the coefficient of x 2 (x centered) is the same
as that of X2 (X uncentered)
• fitting a polynomial regression with powers higher than three is rarely done
as the interpretation of the coefficients becomes difficult and interpolation
tends to become erratic. (A polynomial of order n-1 can always be fitted
exactly to n points.)
• polynomial regression models are often fitted with the hierarchical
approach in which higher powers are introduced one at a time and tested for
significance, and if a term of a high order is included (say, x 3) then all terms
of lower order (x and x2) are also included.
which, in other words, represent the following polynomials of 2nd., 4th. and 5th.
order, respectively.