0% found this document useful (0 votes)

19 views7 pages

Lab2-Markdown XFL (CLEAN)

This document outlines a lab exercise focused on linear regression using the Carseats dataset. It includes steps for analyzing relationships between variables, conducting linear regression models, and interpreting coefficients. The document also covers diagnostic plots, significance of variables, and additional regression models with different parameters.

Uploaded by

liu.7133

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views7 pages

Lab2-Markdown XFL (CLEAN)

Uploaded by

liu.7133

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lab 2 - Linear Regression (Exercises)

Jan 16, 2025

For this lab exercise, we will use the Carseats data. Make sure you have loaded all the
required packages and the dataset before starting with the questions.

library(ISLR)
library(MASS)

data("Carseats")
#?Carseats #use this command to learn more about the data

1. Let’s start by looking at the relationship between all the variables. Remember, we use
the pairs() command to view the scatter plot of all the variables. Can you identify
what variables are factors? (There are 3 factor variables)

pairs(Carseats) #figure below minimized for space

80 0 50 30 1.0

Sales
0

CompPrice
80

Income
20

Advertising
0

Population
0
50

Price
1.0

ShelveLoc

Age
30

Education
10
1.0

Urban
1.0

0 20 0 1.0 10 1.0

1
2. Suppose you are interested in predicting the Sales of child car seats. You identify
Price, Advertising and Age are the key variables. Conduct a linear regression model
looking at this relationship. In other words, estimate:

Sales = β0 + β1 P rice + β2 Advertising + β3 Age + ε

If you coded properly, the coefficient for Price should be: -0.058.

##
## Call:
## lm(formula = Sales ~ Price + Advertising + Age, data = Carseats)
##
## Residuals:
## Min 1Q Median 3Q Max
## -6.6247 -1.5288 0.0148 1.5220 6.2925
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 16.003472 0.718754 22.266 < 2e-16 ***
## Price -0.058028 0.004827 -12.022 < 2e-16 ***
## Advertising 0.123106 0.017095 7.201 3.02e-12 ***
## Age -0.048846 0.007047 -6.931 1.70e-11 ***
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
##
## Residual standard error: 2.269 on 396 degrees of freedom
## Multiple R-squared: 0.3595, Adjusted R-squared: 0.3547
## F-statistic: 74.1 on 3 and 396 DF, p-value: < 2.2e-16

2
3. Using the plot() command, graph the diagnostic plot for the linear regression you just
used. Make sure you include par(mfrow=c(2,2)) to ensure that all the plots are or-
dered properly. Do you think a linear regression model is adequate for this relationship?

Standardized residuals
Residuals vs Fitted Q−Q Residuals
353 26 353 26
Residuals

2
0
−5

−3
51 51

2 4 6 8 10 12 −3 −2 −1 0 1 2 3

Fitted values Theoretical Quantiles

Standardized residuals

Standardized residuals
Scale−Location Residuals vs Leverage
3535126

0 2
1.0

−3 166
Cook's distance
144
0.0

2 4 6 8 10 12 0.00 0.01 0.02 0.03 0.04

Fitted values Leverage

4. Are all the variables significant? What does being significant really mean? Write down
the interpretation for the Intercept and Age coefficient?

Yes, all variables are significant (p-value less than 0.05). Being significant means that the
coefficient does not equal 0. When Price, Advertising, and Age is 0, the sales is 16.00 on
average (Intercept). As the age increases by 1, sales is associated with a decline of 0.04 on
average (Age coefficient).

3
5. Just for practice, try to perform the linear regression:

Sales = β0 + β1 P rice + ε

. Then using abline() plot the linear regression line. You should be getting a plot
like this:
15
10
Sales

5
0

50 100 150

Price

4
6. Using the predict() command try to calculate the confidence and prediction intervals
for the regression model in Q.5, when Income Price is 30, 60, and 90. If you coded this
properly, you should be getting:

Confidence intervals:

## fit lwr upr

## 1 12.049725 11.112929 12.986521
## 2 10.457534 9.819637 11.095431
## 3 8.865344 8.496982 9.233705

Prediction intervals:

## fit lwr upr

## 1 12.049725 6.983945 17.11550
## 2 10.457534 5.438426 15.47664
## 3 8.865344 3.873328 13.85736

7. Let’s try to add a few more parameters to our regression model. Let us estimate this
model:
Sales = β0 + β1 Log(P rice) + β2 Income + β3 (Income2 ) + ε
Try to only output the coefficients this time. The solution is provided below. Make
sure your coefficients matches the solutions.

## (Intercept) log(Price) Income I(Income^2)

## 3.160110e+01 -5.343843e+00 2.261351e-02 -7.084581e-05

5
8. Now, lets estimate this model:

Sales = β0 + β1 P rice + β2 Income + β3 U rban + β4 (Income × U rban) + ε

The coefficient of β4 = −0.0159.

##
## Call:
## lm(formula = Sales ~ Price + Income * Urban, data = Carseats)
##
## Residuals:
## Min 1Q Median 3Q Max
## -6.6964 -1.8640 -0.0938 1.6930 7.5875
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 11.968842 0.839695 14.254 < 2e-16 ***
## Price -0.052415 0.005318 -9.856 < 2e-16 ***
## Income 0.023487 0.007811 3.007 0.00281 **
## UrbanYes 1.082502 0.703561 1.539 0.12470
## Income:UrbanYes -0.015931 0.009546 -1.669 0.09594 .
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
##
## Residual standard error: 2.507 on 395 degrees of freedom
## Multiple R-squared: 0.2196, Adjusted R-squared: 0.2117
## F-statistic: 27.79 on 4 and 395 DF, p-value: < 2.2e-16

6
9. Finally, lets estimate Sales = β0 + β1 Education + ε. Note, Education is formatted
as a numeric variable. In this mode, try to convert this variable to a factor. If done
correctly, your regression output should have 8 different levels of Education.

##
## Call:
## lm(formula = Sales ~ Education, data = Carseats)
##
## Residuals:
## Min 1Q Median 3Q Max
## -8.1317 -1.9725 -0.0588 1.8270 8.7988
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 8.2458 0.4077 20.223 <2e-16 ***
## Education11 -0.7746 0.5766 -1.343 0.1800
## Education12 -0.7781 0.5737 -1.356 0.1758
## Education13 -1.1935 0.5932 -2.012 0.0449 *
## Education14 -1.1706 0.6048 -1.936 0.0536 .
## Education15 -0.1142 0.6228 -0.183 0.8547
## Education16 -1.0014 0.5797 -1.727 0.0849 .
## Education17 -0.7430 0.5737 -1.295 0.1960
## Education18 -0.9693 0.6048 -1.603 0.1098
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1
##
## Residual standard error: 2.825 on 391 degrees of freedom
## Multiple R-squared: 0.0195, Adjusted R-squared: -0.0005622
## F-statistic: 0.972 on 8 and 391 DF, p-value: 0.4574

Police Organisation at State Level
100% (2)
Police Organisation at State Level
49 pages
Survey Questionnaire Final
90% (10)
Survey Questionnaire Final
4 pages
Assessment in Double Entry Accounting
No ratings yet
Assessment in Double Entry Accounting
7 pages
Transmission Servicing Volvo 850
No ratings yet
Transmission Servicing Volvo 850
7 pages
Grievance Report by Evangeline Ano
83% (6)
Grievance Report by Evangeline Ano
19 pages
Simple Regression
100% (1)
Simple Regression
50 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Case Study: A Case Study On Subledger Accounting, Oracle Release 12
No ratings yet
Case Study: A Case Study On Subledger Accounting, Oracle Release 12
13 pages
Simple Lin Regress Inference
No ratings yet
Simple Lin Regress Inference
51 pages
Quantitative Methods: Regression Models, Types of Errors
No ratings yet
Quantitative Methods: Regression Models, Types of Errors
42 pages
TCMG - MEEG 573 - SP - 20 - Lecture - 7
No ratings yet
TCMG - MEEG 573 - SP - 20 - Lecture - 7
69 pages
DSME2040 Regression Students
No ratings yet
DSME2040 Regression Students
35 pages
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
No ratings yet
Simple Linear Regression 1. Review of Least Squares Procedure 2. Inference For Least Squares Lines
51 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Industrial Report
No ratings yet
Industrial Report
56 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
64 pages
Linear Regression
No ratings yet
Linear Regression
97 pages
Homework 2
100% (1)
Homework 2
14 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
6th Lecture Note 108335647 230518 203102
No ratings yet
6th Lecture Note 108335647 230518 203102
35 pages
Sales and Advertising
No ratings yet
Sales and Advertising
14 pages
Dar Solved Ans
No ratings yet
Dar Solved Ans
20 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Government of India Technical Centre, Opposite Safdarjung Airport, New Delhi-110003
No ratings yet
Government of India Technical Centre, Opposite Safdarjung Airport, New Delhi-110003
11 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
Simple Linear Regression: # Loading Data
No ratings yet
Simple Linear Regression: # Loading Data
5 pages
Residual Analysis For Simple Linear Regression: X B B y N e N e
No ratings yet
Residual Analysis For Simple Linear Regression: X B B y N e N e
15 pages
Shivam Batra (19BPS1131) 21/01/2022: List
No ratings yet
Shivam Batra (19BPS1131) 21/01/2022: List
5 pages
Mindanao State University General Santos City: Simple Linear Regression
No ratings yet
Mindanao State University General Santos City: Simple Linear Regression
12 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
Activity 7
No ratings yet
Activity 7
5 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
7 pages
Estimation of Causal Relationships I: Illustration 1
No ratings yet
Estimation of Causal Relationships I: Illustration 1
8 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
Project (Time) Control For An EPC Project
No ratings yet
Project (Time) Control For An EPC Project
12 pages
Amta - Final - Notes.r: ### Step Wise AIC Regression
No ratings yet
Amta - Final - Notes.r: ### Step Wise AIC Regression
6 pages
06 Least Squar Regression
No ratings yet
06 Least Squar Regression
25 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
IE 451 Fall 2023-2024 Homework 4 Solutions
No ratings yet
IE 451 Fall 2023-2024 Homework 4 Solutions
19 pages
2024-Lecture 11
No ratings yet
2024-Lecture 11
37 pages
3712012
No ratings yet
3712012
2 pages
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
No ratings yet
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
16 pages
Homework 2
100% (1)
Homework 2
12 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
51 pages
Chapter 06-Regression Analysis
No ratings yet
Chapter 06-Regression Analysis
41 pages
09 Inference For Regression Part1
No ratings yet
09 Inference For Regression Part1
12 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Border Irrigation: Advantages
No ratings yet
Border Irrigation: Advantages
8 pages
Academic and Support Services: San Carlos Campus Organizational Chart
No ratings yet
Academic and Support Services: San Carlos Campus Organizational Chart
1 page
Otondro Prohori, Guarding Who, Against What
No ratings yet
Otondro Prohori, Guarding Who, Against What
10 pages
KASAMA/SSC Constitution and by Laws of 2000
100% (1)
KASAMA/SSC Constitution and by Laws of 2000
12 pages
E+H-PROMAG W 400 - Tender Text - TTW400EN
No ratings yet
E+H-PROMAG W 400 - Tender Text - TTW400EN
2 pages
Marine Hsse Final Assignment Chop Saw
No ratings yet
Marine Hsse Final Assignment Chop Saw
11 pages
Linear Regression
No ratings yet
Linear Regression
21 pages
Norman Cordero Marquez, Petitioner, vs. Commission On Elections, Respondent.
No ratings yet
Norman Cordero Marquez, Petitioner, vs. Commission On Elections, Respondent.
9 pages
Lab 4
No ratings yet
Lab 4
7 pages
New Indy Complaint
No ratings yet
New Indy Complaint
5 pages
Thayer, Vice President Kamala Harris Visit To Vietnam Scene Setter
No ratings yet
Thayer, Vice President Kamala Harris Visit To Vietnam Scene Setter
3 pages
Volume 5-2 (C) - ESIA For Padibe West
No ratings yet
Volume 5-2 (C) - ESIA For Padibe West
288 pages
ISLP - Website 135 200
No ratings yet
ISLP - Website 135 200
66 pages
ISLP - Website-135-200 (1) - 1-60
No ratings yet
ISLP - Website-135-200 (1) - 1-60
60 pages
Linear Regression Model
No ratings yet
Linear Regression Model
5 pages
Nationalaccountsdatalu en
No ratings yet
Nationalaccountsdatalu en
148 pages
Regression Analysis Using R
No ratings yet
Regression Analysis Using R
17 pages
Slide Sledge Brochure
100% (1)
Slide Sledge Brochure
2 pages
Refrigerated vs. Desiccant Dryers - Choosing The Right One - Rev
No ratings yet
Refrigerated vs. Desiccant Dryers - Choosing The Right One - Rev
48 pages
Simple Linear Regression Sample
No ratings yet
Simple Linear Regression Sample
55 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
MATH3714 Jan 2024
No ratings yet
MATH3714 Jan 2024
9 pages
Haile 0000
No ratings yet
Haile 0000
81 pages
Exercice V
No ratings yet
Exercice V
5 pages
15.simple Linear Regression-530
No ratings yet
15.simple Linear Regression-530
54 pages
Lecture 3 - Linear Regression Imran 20022025 092939am
No ratings yet
Lecture 3 - Linear Regression Imran 20022025 092939am
46 pages
Great Debaters
No ratings yet
Great Debaters
51 pages
Machine Learning-Lecture 1 (Student)
No ratings yet
Machine Learning-Lecture 1 (Student)
14 pages
Topher Eufaula Layton Resume
No ratings yet
Topher Eufaula Layton Resume
2 pages
Jovision JVS-517-TDL
No ratings yet
Jovision JVS-517-TDL
2 pages
Lec 05 2 - Time Series Regression Model
No ratings yet
Lec 05 2 - Time Series Regression Model
75 pages
Lec 05 - Time Series Regression Model
No ratings yet
Lec 05 - Time Series Regression Model
32 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Groundnut
No ratings yet
Groundnut
64 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
Deena Assignment 2
No ratings yet
Deena Assignment 2
16 pages
ARRI Pro Cam Accs BRCH
No ratings yet
ARRI Pro Cam Accs BRCH
24 pages
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
No ratings yet
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
8 pages
University Chemistry 1st Edition Peter E Siska Ebook and TestBank Bundle Fast Access
No ratings yet
University Chemistry 1st Edition Peter E Siska Ebook and TestBank Bundle Fast Access
325 pages
Data Interpretation Guide For All Competitive and Admission Exams
From Everand
Data Interpretation Guide For All Competitive and Admission Exams
Mohmmad Khaja Shareef
2.5/5 (6)
Excel Techniques
From Everand
Excel Techniques
Online Trainees
2/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lab2-Markdown XFL (CLEAN)

Uploaded by

Lab2-Markdown XFL (CLEAN)

Uploaded by

Lab 2 - Linear Regression (Exercises)

Jan 16, 2025

pairs(Carseats) #figure below minimized for space

Sales = β0 + β1 P rice + β2 Advertising + β3 Age + ε

Fitted values Theoretical Quantiles

2 4 6 8 10 12 0.00 0.01 0.02 0.03 0.04

Fitted values Leverage

## fit lwr upr

## fit lwr upr

## (Intercept) log(Price) Income I(Income^2)

Sales = β0 + β1 P rice + β2 Income + β3 U rban + β4 (Income × U rban) + ε

The coefficient of β4 = −0.0159.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.