0% found this document useful (0 votes)

3 views9 pages

Ex1 R Solution

The document outlines ITEC 621 Exercise 1, which serves as a refresher on R programming concepts, data manipulation, descriptive analytics, and predictive analytics. It includes detailed instructions on using Quarto for coding and formatting, as well as specific exercises involving R functions, data frames, statistical analysis, and linear regression modeling. Students are required to submit their work in a professional format, highlighting the importance of clarity and accuracy in their coding and reporting.

Uploaded by

kaushik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views9 pages

Ex1 R Solution

Uploaded by

kaushik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

ITEC 621 Exercise 1 - R Refresher

J. Alberto Espinosa

2025-01-04

This Quarto file contains ITEC 621 Exercise 1.

Table of contents
General Instructions........................................................................................................................................ 1
Quarto Overview (please read carefully)................................................................................................ 1
1. Basic R Concepts .......................................................................................................................................... 2
2. Data Manipulation ....................................................................................................................................... 3
3. Basic Descriptive Analytics ...................................................................................................................... 4
4. Basic Predictive Analytics ........................................................................................................................ 7

General Instructions
Download the Quarto template for this exercise Ex1_R_YourLastName.Qmd and save it
with your own last name exactly. Then open it in R Studio and complete all the exercises
and answer the questions below in the template. Run the code to ensure everything is
working fine. When done, knit your R Markdown file into a Word document and submit it.
No need to submit the .Qmd, file just the Word or PDF knitted file. If for some reason you
can’t knit a Word or PDF file, you can knit to an HTML file and then save it as a PDF. Some
LMS systems don’t accept HTML submissions and your HTML file may not display well
without its companion files folder.
This exercise is somewhat similar to HW0 in KSB-999, which you were required to
complete before starting this course. So, if you already did that, this should be an easy
exercise and a good warm up refresher. If you didn’t do it, this is you opportunity to catch
up. This course moves fast and it assumes that you have some familiarity with R.

Quarto Overview (please read carefully)

See full instructions on how to use Quarto in ITEC_Quarto.Qmd.
When you create a Quarto file, it will look like text commingled with R code. You can edit
your Quarto file in either Source or Visual mode, but clicking on the corresponding top left
button. Visual is OK for demos, but I highly encourage you to write your code in Source
view. You will also see a button option named Render in your tool bar (it will only show if
your file has the .Qmd extension. Once you are done with all the coding, click on the Render
button and Quarto will knit your document in the format specified in the YAML, with all
your marked up text and R results.
Important: This is a business course and, as such you are required to submit all exercises,
homework and project reports with a professional, businesslike appearance, free of
grammatical errors and typos, and with well articulated interpretation narratives. No
knitting, improper knitting and submissions with writing and formatting issues will
have up to 3-point (out of 10) deductions for exercises and up to 10-point (out of
100) deductions in homework.
Quarto contains three main types of content:
1. The YAML (YAML Ain’t Markup Language) header, which is where you place the
title, author, date, type of output, etc. It is at the top of the R Markdown file and
starts and ends with ---. I suggest using the format docx.

2. Markup sections, which is where you type any text you wish, which will show up as
typed text. You will learn these later.

3. Code chunks: which is where you write your R code. An R code chunk starts with a
```{r} and ends with a ```.

Your knitted file must:

• Display all your R commands (leave echo: true in the YAML. FYI, echo: false
suppresses the R code)
• Display the resulting R output results
• Contain any necessary text and explanations, as needed; and
• Be formatted for good readability and in a businesslike manner
• Be in the same order as the questions and with the corresponding question numbers

1. Basic R Concepts
1.1 Write a simple R function named area() that takes 2 values as parameters (x and y,
representing the two sides of a rectangle) and returns the product of the two values
(representing the rectangle’s area). Then use this function to display the area of a rectangle
of sides 6x4. Then, use the functions paste(), print() and area() to output this result:
The area of a rectangle of sides 6x4 is 24, where 24 is calculated with the area()
function you just created.
area <- function(x,y) {return(x*y)}
area(4,6)

[1] 24

print(paste("The area of a 4x6 rectanlge is", area(4,6)))

[1] "The area of a 4x6 rectanlge is 24"

1.2 Write a simple for loop for i from 1 to 10. In each loop cycle, compute the area of a
rectangle of sides i and i*2 (i.e., all rectangles have one side double the lenght than the
other) and for each of the 10 rectangles display “The area of an 1 x 2 rectangle is 2” for i=1,
“The area of an 2 x 4 rectangle is 8”, and so on.
for (i in 1:10) {
print(paste("The area of a", i, "x", i * 2,
"rectangle is", area(i, 2 * i)))
}

[1] "The area of a 1 x 2 rectangle is 2"

[1] "The area of a 2 x 4 rectangle is 8"
[1] "The area of a 3 x 6 rectangle is 18"
[1] "The area of a 4 x 8 rectangle is 32"
[1] "The area of a 5 x 10 rectangle is 50"
[1] "The area of a 6 x 12 rectangle is 72"
[1] "The area of a 7 x 14 rectangle is 98"
[1] "The area of a 8 x 16 rectangle is 128"
[1] "The area of a 9 x 18 rectangle is 162"
[1] "The area of a 10 x 20 rectangle is 200"

2. Data Manipulation
2.1 Copy the Credit.csv data file to your working directory (if you haven’t done this yet).
Then read the Credit.csv data file into a data frame object named Credit (Tip: use the
read.table() function with the parameters header=T, sep=",", row.names=1). Then, list
the first 5 columns of the top 5 rows (Tip: use Credit[1:5, 1:5])
Credit <- read.table("Credit.csv",
header = T,
sep = ",",
row.names = 1)
Credit[1:5, 1:5]

Income Limit Rating Cards Age

1 14.891 3606 283 2 34
2 106.025 6645 483 3 82
3 104.593 7075 514 4 71
4 148.924 9504 681 3 36
5 55.882 4897 357 2 68

2.2 Using the class() function, display the object class for the Credit data set, and for
Gender (i.e., Credit$Gender), Income and Cards
class(Credit)

[1] "data.frame"

class(Credit$Gender)

[1] "character"
class(Credit$Income)

[1] "numeric"

class(Credit$Cards)

[1] "integer"

2.3 Create a vector named income.vect with data from the Income column. Then use the
head() function to display the first 6 values of this vector.
income.vect <- Credit$Income
head(income.vect)

[1] 14.891 106.025 104.593 148.924 55.882 80.180

3. Basic Descriptive Analytics

3.1 Compute the mean, minimum, maximum, standard deviation and variance for all
the values in the income.vect vector. Store the respective results in variables name
mean.inc, min.inc, etc. Then, use the c() function to create a vector called income.stats
with 5 values you computed above. Then use the names() function to give the
corresponding names “Mean”, “Min”, “Max”, “StDev”, and “Var”. Then display the
income.stats vector, but wrap it within the round() function with a parameter digits = 2
to display only 2 decimals.
Technical Note: The names() function needs to create a vector with the respective names
above, which need to correspond to the values in income.vect. Therefore, you need to use
the c() function to create a vector with these 5 names.
mean.inc <- mean(income.vect)
min.inc <- min(income.vect)
max.inc <- max(income.vect)
sd.inc <- sd(income.vect)
var.inc <- var(income.vect)

income.stats <- c(mean.inc, min.inc, max.inc, sd.inc, var.inc)

names(income.stats) <- c("Mean","Min","Max","StDev", "Var")
round(income.stats, digits = 2)

Mean Min Max StDev Var

45.22 10.35 186.63 35.24 1242.16

3.2 Display a boxplot for the predictor Income. Tip: you can do this 2 ways. First you can
attach() the Credit data set (which loads the data set in the work environment) and then
do a boxplot() for Income. Or, do it without attaching, but using the table prefix (i.e.,
Credit$Income). Use the xlab = attribute to name include the label “Income”. Then display
similar boxplots but this time broken down by Gender (i.e., Credit$Income ~
Credit$Gender).
boxplot(Credit$Income,
xlab = "Income")

boxplot(Credit$Income ~ Credit$Gender)

3.3 Display a histogram for the variable Rating, with the main title “Credit Rating
Histogram” (main =) and X label “Rating” (xlab =). Then draw a QQ Plot for Rating (Tip:
use the qqnorm() function first to draw the data points and then use the qqline() function
to layer the QQ Line on top).
hist(Credit$Rating,
main = "Credit Rating Histogram",
xlab="Rating")

qqnorm(Credit$Rating)
qqline(Credit$Rating)
3.4 Briefly answer in your own words: Do you think that this data is somewhat normally
distributed? Why or why not? In your answer, please refer to both, the Histogram and the
QQ Plot.
# The data is somewhat normal in the middle, but the qqplot deviates from the
qqline providing some indication of non-normality at the tails. The histogram
shows some skewness to the right indicating some departure from normality, bu
t it has a bell shape in the center of the data, which is consistent with the
QQ Plot.

4. Basic Predictive Analytics

4.1 First, enter the command options(scipen = 4) to minimize the display values with
scientific notation. Then, create a simple linear regression model object with the lm()
function to fit credit Rating as a function of Income and save the results in an object
named lm.rating. Then display the model summary results with the summary() function.
Tip: use the formula Rating ~ Income, data = Credit inside the lm() function.
options(scipen = 4)
lm.rating <- lm(Rating ~ Income,
data = Credit)
summary(lm.rating)

Call:
lm(formula = Rating ~ Income, data = Credit)

Residuals:
Min 1Q Median 3Q Max
-173.855 -79.417 -0.384 79.747 171.955

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 197.8411 7.7089 25.66 <2e-16 ***
Income 3.4742 0.1345 25.83 <2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 94.71 on 398 degrees of freedom

Multiple R-squared: 0.6263, Adjusted R-squared: 0.6253
F-statistic: 667 on 1 and 398 DF, p-value: < 2.2e-16

4.2 Now, plot Credit Rating (Y axis) against Income (X axis), with respective labels “Income”
and “Credit Rating”. Tip: feed the same formula you used in the lm() function above, but
using the plot() function instead. Then draw a regression line by feeding lm.rating into
the abline() function.
Note: notice that I added the parameters #| fig-width: 8 and #| fig-height = 8 to
control the size of the figure. Notice that the YAML has fig-width: 10 and fig-height: 6,
which are global parameters affecting the entire document. You can change any parameters
withing a code cell as I did below to override a global parameter, just for the specific code
cell. The rest of the script is unaffected.
plot(Rating ~ Income, data = Credit)
abline(lm.rating)

4.3 Write a simple linear model to predict credit ratings using these predictors: Income,
Limit, Cards, Married and Balance. Name the resulting model lm.rating.5. Then display
the regression using the summary() function.
lm.rating.5 <- lm(Rating ~ Income + Limit + Cards + Married + Balance,
data = Credit)
summary(lm.rating.5)

Call:
lm(formula = Rating ~ Income + Limit + Cards + Married + Balance,
data = Credit)

Residuals:
Min 1Q Median 3Q Max
-24.0051 -7.0024 -0.9291 6.3789 26.2751

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 27.1070066 2.1867611 12.396 < 2e-16 ***
Income 0.0975008 0.0335195 2.909 0.00383 **
Limit 0.0641536 0.0009004 71.247 < 2e-16 ***
Cards 4.7108256 0.3762419 12.521 < 2e-16 ***
MarriedYes 2.1217503 1.0441007 2.032 0.04281 *
Balance 0.0084355 0.0031308 2.694 0.00735 **
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 10.14 on 394 degrees of freedom

Multiple R-squared: 0.9958, Adjusted R-squared: 0.9957
F-statistic: 1.85e+04 on 5 and 394 DF, p-value: < 2.2e-16

4.4 Question: what do you think are the most influential predictors of credit rating?
# All predictors are statistically significant (i.e., they have asterisks nex
t to them and the p-values are smaller than 0.05). Also, all predictors are p
ositive, so they all have a positive influence on credit rating. Limit and Ca
rds are the most significant and the number of Cards seems to have the strong
est effect.

Pmbok 6th Edition Free Download PDF
No ratings yet
Pmbok 6th Edition Free Download PDF
3 pages
Assignment 2: Introduction To R: Text Like This Will Be Problems For You To Do and Turn In. (There Are 7 in All.)
No ratings yet
Assignment 2: Introduction To R: Text Like This Will Be Problems For You To Do and Turn In. (There Are 7 in All.)
15 pages
Photography Proposal Example
No ratings yet
Photography Proposal Example
7 pages
Introduction To Linear Algebra With Applications
0% (3)
Introduction To Linear Algebra With Applications
7 pages
Unit 2
No ratings yet
Unit 2
32 pages
ITEC 621 Exercise 1 - R Refresher: General Instructions
No ratings yet
ITEC 621 Exercise 1 - R Refresher: General Instructions
8 pages
Introduction To R For Business Analytics
No ratings yet
Introduction To R For Business Analytics
7 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Chapter - 03 - Review of Basic Data
No ratings yet
Chapter - 03 - Review of Basic Data
92 pages
Basics of Data Analysis and Graphics in
No ratings yet
Basics of Data Analysis and Graphics in
103 pages
Apunts BLOC 1 Estadística
No ratings yet
Apunts BLOC 1 Estadística
15 pages
R Commands
No ratings yet
R Commands
18 pages
DSCI Key Terms and Ideas For Review
No ratings yet
DSCI Key Terms and Ideas For Review
98 pages
DA Lab Week-2
No ratings yet
DA Lab Week-2
22 pages
Data in R
No ratings yet
Data in R
7 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
40 pages
First Course On R
No ratings yet
First Course On R
26 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Experiment # 4
No ratings yet
Experiment # 4
10 pages
R Software Project
No ratings yet
R Software Project
42 pages
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
No ratings yet
Data Science Using R - Lab Manual-Complete Ver 2.0 - Nov 2024
36 pages
N2 Data in R
No ratings yet
N2 Data in R
7 pages
MIT R For Machine Learning
No ratings yet
MIT R For Machine Learning
9 pages
Unit 2 Notes - Data Analysis Using R
No ratings yet
Unit 2 Notes - Data Analysis Using R
19 pages
MDPN460 Lecture05
No ratings yet
MDPN460 Lecture05
32 pages
Brief R Tutorial
No ratings yet
Brief R Tutorial
8 pages
Intro To Statistic Using R - Session 2
No ratings yet
Intro To Statistic Using R - Session 2
1 page
DS Lab
No ratings yet
DS Lab
31 pages
Introduction To R
No ratings yet
Introduction To R
20 pages
Advance R Prog.-1
No ratings yet
Advance R Prog.-1
24 pages
Practical 3 Intro To R
No ratings yet
Practical 3 Intro To R
10 pages
R Manual
No ratings yet
R Manual
10 pages
R Programming Notes
No ratings yet
R Programming Notes
23 pages
Introduction To R
No ratings yet
Introduction To R
23 pages
R Software - Notes
No ratings yet
R Software - Notes
18 pages
MIS 4.hafta (Introduction To R)
No ratings yet
MIS 4.hafta (Introduction To R)
52 pages
MTech R Notes
No ratings yet
MTech R Notes
14 pages
CT Queston Solution
No ratings yet
CT Queston Solution
4 pages
Data - Analysis - With - R - 24
No ratings yet
Data - Analysis - With - R - 24
47 pages
A Brief Guide To R For Beginners in Econometrics: Department of Economics, Stockholm University
No ratings yet
A Brief Guide To R For Beginners in Econometrics: Department of Economics, Stockholm University
31 pages
R Socialscience
No ratings yet
R Socialscience
62 pages
R Short Tutorial
No ratings yet
R Short Tutorial
5 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
R Lab
No ratings yet
R Lab
114 pages
Untitled
No ratings yet
Untitled
59 pages
R
No ratings yet
R
13 pages
Practical 1 - Data Frame Manipulation - 072502
No ratings yet
Practical 1 - Data Frame Manipulation - 072502
16 pages
P6ADBMS
No ratings yet
P6ADBMS
34 pages
SSMDA Expt 7
No ratings yet
SSMDA Expt 7
16 pages
Introduction To R, Version 2
No ratings yet
Introduction To R, Version 2
51 pages
S24 Stats10 Lab1-1
No ratings yet
S24 Stats10 Lab1-1
8 pages
Module 1 Rprogramming Introduction Part A
No ratings yet
Module 1 Rprogramming Introduction Part A
20 pages
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
No ratings yet
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
10 pages
R Programming Slides
No ratings yet
R Programming Slides
73 pages
Gd Script
From Everand
Gd Script
Marijo Trkulja
No ratings yet
Profound Python Libraries
From Everand
Profound Python Libraries
Onder Teker
No ratings yet
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
From Everand
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
Adam Freeman
No ratings yet
More on C# in Front Office
From Everand
More on C# in Front Office
Xing Zhou
No ratings yet
Microsoft Office Productivity Pack: Microsoft Excel, Microsoft Word, and Microsoft PowerPoint
From Everand
Microsoft Office Productivity Pack: Microsoft Excel, Microsoft Word, and Microsoft PowerPoint
Steven Bright
No ratings yet
Programming PowerPoint With VBA Straight to the Point
From Everand
Programming PowerPoint With VBA Straight to the Point
Eduardo N Sanchez
No ratings yet
Excel 101: A Beginner's Guide for Mastering the Quintessence of Excel 2010-2019 in no time!
From Everand
Excel 101: A Beginner's Guide for Mastering the Quintessence of Excel 2010-2019 in no time!
Johannes Wild
No ratings yet
Learning Open Office: Calc & Base
From Everand
Learning Open Office: Calc & Base
Durgesh
No ratings yet
Icom IC-T90A Instruction Manual
100% (1)
Icom IC-T90A Instruction Manual
100 pages
Benchmarking Edge For Successful Sales Execution1
No ratings yet
Benchmarking Edge For Successful Sales Execution1
14 pages
MSS-SP-25 (2013) PDF
67% (3)
MSS-SP-25 (2013) PDF
31 pages
Ie4-1le7 Simotics Motor Brochure - 06.24
No ratings yet
Ie4-1le7 Simotics Motor Brochure - 06.24
4 pages
Viewsonic-Manuals N3235w-1M SM 1a
No ratings yet
Viewsonic-Manuals N3235w-1M SM 1a
100 pages
6FM9Y
No ratings yet
6FM9Y
2 pages
KehuaFrance 3kW
No ratings yet
KehuaFrance 3kW
2 pages
Lista de Accesorios Nueva
No ratings yet
Lista de Accesorios Nueva
11 pages
ASD Applicaton Format1
No ratings yet
ASD Applicaton Format1
3 pages
2022-23 B.C.A (CBCS) Syllabus
No ratings yet
2022-23 B.C.A (CBCS) Syllabus
28 pages
Dictionary of Mahratta Language
No ratings yet
Dictionary of Mahratta Language
664 pages
Panel Kapasitor Bank-Model - PDF 1
No ratings yet
Panel Kapasitor Bank-Model - PDF 1
1 page
Agarwal Dhar 2014 Editorial Big Data Data Science and Analytics The Opportunity and Challenge For Is Research
No ratings yet
Agarwal Dhar 2014 Editorial Big Data Data Science and Analytics The Opportunity and Challenge For Is Research
6 pages
Fourier Series Project BY.A. SELECTED
No ratings yet
Fourier Series Project BY.A. SELECTED
12 pages
Code
No ratings yet
Code
4 pages
(Electrical Power Systems) (By: C.L. Wadhwa) (Published: July, 2009)
No ratings yet
(Electrical Power Systems) (By: C.L. Wadhwa) (Published: July, 2009)
5 pages
UPDATED - HGDML - ALL QUIZ QUESTIONS and ANSWERS v2.3.1
100% (1)
UPDATED - HGDML - ALL QUIZ QUESTIONS and ANSWERS v2.3.1
15 pages
Wpq-105-03 Gmaw 3g Jose A. Rivas
No ratings yet
Wpq-105-03 Gmaw 3g Jose A. Rivas
1 page
Ai Class 10th
No ratings yet
Ai Class 10th
8 pages
SRTPV Solar Application Form
No ratings yet
SRTPV Solar Application Form
30 pages
Uiet 2009 Cutoff
No ratings yet
Uiet 2009 Cutoff
17 pages
Database Management System - Practical File
No ratings yet
Database Management System - Practical File
11 pages
PNZ Series
No ratings yet
PNZ Series
2 pages
Lab Experiment 07 Logical Operations
No ratings yet
Lab Experiment 07 Logical Operations
6 pages
Template For GigaByte Journal Data Report Submissions
No ratings yet
Template For GigaByte Journal Data Report Submissions
10 pages
Official Glossary - ISC 2 CC Preparation
No ratings yet
Official Glossary - ISC 2 CC Preparation
12 pages
Copyright Project
No ratings yet
Copyright Project
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Ex1 R Solution

Uploaded by

Ex1 R Solution

Uploaded by

ITEC 621 Exercise 1 - R Refresher

This Quarto file contains ITEC 621 Exercise 1.

Quarto Overview (please read carefully)

Your knitted file must:

print(paste("The area of a 4x6 rectanlge is", area(4,6)))

[1] "The area of a 4x6 rectanlge is 24"

[1] "The area of a 1 x 2 rectangle is 2"

Income Limit Rating Cards Age

[1] 14.891 106.025 104.593 148.924 55.882 80.180

3. Basic Descriptive Analytics

income.stats <- c(mean.inc, min.inc, max.inc, sd.inc, var.inc)

Mean Min Max StDev Var

4. Basic Predictive Analytics

Residual standard error: 94.71 on 398 degrees of freedom

Residual standard error: 10.14 on 394 degrees of freedom

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.