Summary - Topic 8: Response and Explanatory Variables
Summary - Topic 8: Response and Explanatory Variables
When describing the association between two numeric variables, comment on:
– the direction (positive or negative)
– the strength (strong, moderate or weak, depending on the value of r)
– the form (linear or non-linear)
– if an outlier is clear (include that when describing the association).
– Pearson’s product-moment correlation coefficient, r, which measures the strength of a linear
trend and associates it with a numerical value between −1 and +1.
The value of the correlation coefficient, r, is calculated using the statistics functions on your CAS
calculator. In order to calculate this value, we assume that:
– the data is numeric
– there are no outliers in the data
– the association is linear.
Pearson’s product-moment correlation coefficient formula
The reliability of making predictions depends heavily on the initial data. A larger sample and a
strong correlation will make for a reliable prediction. In general, interpolation is more reliable
than extrapolation.
Interpreting the least squares line of best fit:
– Interpretation of the intercept (a): On average, when the value of the explanatory variable is
0, it is predicted that the value of the response variable will be a.
– Interpretation of the slope (b): On average, for each 1 unit increase in the explanatory
variable, the response variable will increase/decrease by b units.
Note: A negative slope indicates that the response variable will decrease with an increase in the
explanatory variable.