0% found this document useful (0 votes)

7 views6 pages

Hdat9200ch1 Rcorner

Uploaded by

Mohiuddin Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views6 pages

Hdat9200ch1 Rcorner

Uploaded by

Mohiuddin Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

© Copyright 2025 UNSW Sydney. All rights reserved except where otherwise stated.

R CORNER
Introduction to R
The origins of R
The origins of R lie in a language called S, which was developed by the AT&T
telecommunications company (formerly the Bell Telephone Company) in the US in the late 1970s
and early 1980s, for the purposes of facilitating statistical analysis of telephone call and
customer data. The S language became a commercial software product called S/Plus, which
rapidly became the language of choice for many statisticians, particularly those developing new
statistical methods.

In the early 1990s, two statisticians at the University of Auckland in New Zealand, Ross Ihaka and
Robert Gentleman, decided to create an alternative to the S language, but which was closely
modelled on it, which could be freely used on computers in their university without having to pay
license fees. They named this language R (after their shared first initial, and as a pun on the
name "S"). Over the course of the 1990s and 2000s, R rapidly became very popular with
statisticians, particularly those who were already using S or S/Plus, and eventually came to be
arguably the dominant programming language for statistics and statistical graphics, and one of
the most popular programming languages for data science, currently only challenged by Python.

R is a fully-featured, 'industrial-strength' programming language which is mature and highly

trusted, and is very suitable for most data science and data analysis tasks.

Free, open-source software

Open-source software is software that is made available under a licence that allows others to
use and/or modify the software code. Although this sounds like a recipe for chaos and anarchy,
in practice it works very well and open-source software is now very widely used almost
everywhere, including in mission-critical areas. Indeed, much of the internet runs on computers
running the Linux operating system: Linux is open-source software.

The free, open-source licensing for R means that anyone can install it on as many computers as
they wish without having to pay any licensing fees. This is a tremendous advantage: it means
that skills are highly portable between jobs or projects, because the software on which those
skills have been acquired can be installed anywhere. It also facilitates the use of R with 'big data',
which may require the use of many computers simultaneously, all running the same program
code on different parts of the data in parallel.

What is R?
R is a high-level programming language, and as such it can be used for a very wide variety of
tasks.

'High-level' means that the language abstracts away (hides) many of the messy details of writing
program code to run on a computer. This allows the coder to concentrate on the task rather than
on the characteristics of computer on which it will run. Thus, R Code tends to be highly portable
- Code written on one type of computer, say, under macOS, will run unchanged (or almost
unchanged) on, say, a Windows- or Linux-based computer.

R requires you to write code to get most things done. There are no point-and-click interfaces
that will do everything that you need to do: writing code is inescapable. But that's a Good
Thing™, because actions performed through code are repeatable and reproducible actions,and
the code coupled with a good source control system automatically provides an audit trail of how
data has been manipulated and the process of arriving at the final analysis.

R is an interpreted language
This means that when you run R code, a special program called an interpreter converts the R
code on-the-fly into a lower-level intermediate language (which isn't designed to be written by
humans), and that is then converted into "machine language" (one level below even assembly
language) as the program is actually executing on the computer hardware.

R is a functional language
The sense of functional here is not that "it works well" (it does!), but rather that every operation
in the R language is performed by calling a function, with one or more arguments passed as
parameters to the function.

The function operates on those arguments and returns a value, or some data, or another
function.

R is vector-oriented
R is vector-oriented, by default. This means that all data in R are stored in vectors (or two-
dimensional matrices, or multi-dimensional arrays, both of which are actually vectors with two
or more dimensions imposed on them, whereas as a vector has just one dimension).

A vector is a container for data which can store zero or more values of a particular data type, and
these values can be accessed by an index, where the index is an integer or a name eg, the third
value in a vector can be accessed, or the value named "Robert" in a vector can be accessed.

Even when a single value (sometimes referred to as a scalar ) is assigned to a "variable" in R, in

fact it is being assigned to a vector of length one. For example:
a <- 3

length(a)

In fact, "variables" in R are actually just names which refer to vectors or matrices or arrays, or
lists, or other objects, in R.

R stores data as vectors (or matrices or arrays) because it enables almost all arithmetic
operations and function in R to be vectorised, meaning that a single operation will operate on
every value in the vector, without the need for an explicit loop in your code to process each
value.

An example makes this clear. Notice that comments are preceded with # and that referring to an
object by itself will print that object's value (or a summary of its contents if it is a more complex
object):

# assign a vector of three integers to b

b <- c(3, 5, 6) # the c() function concatenates or combines its
arguments into a vector

# show the contents of b

# now add 2 to b and show the result

b <- b + 2
b

Notice that 2 has been added to each of the 3 three elements in the vector named b, without the
need for an explicit loop.

R is strongly typed and uses dynamic variable declaration

This means that atomic vectors (including matrices and arrays) can only hold one type of data eg
integer numbers, or floating point numbers, or character strings.

Object names in R
A syntactically valid object name in R consists of letters, numbers and the dot or underline
characters and starts with a letter or the dot not followed by a number. Thus, names such as
".2way" are not valid. Neither are the following reserved words: if, else, repeat,
while, function, for, in, next, break, TRUE, FALSE, NULL, Inf, NaN,
NA, NA_integer_, NA_real_, NA_complex_, NA_character_.

Note that object names can include dots (periods), and many function and argument names in R
take this form, such as data.frame() or na.rm. Current best practice in R is to avoid the use
of a dot in object names, and to use an underscore instead. Object names may not have spaces in
them.
White space
R doesn't care about "whitespace" (spaces and tabs, and indenting).

Program code is just written one complete statement to a line. If a line of code contains an
incomplete line of code, the R interpreter will look for the completion of the statement on the
next line.

Comments
Comments require prepending your statement with a hash symbol # (also known as a pound
symbol). There is no specific provision for multiline comments in R - just prepend each line of
the comment with a hash.

# this is a comment, and is ignored by R

# so is this

c <- 3 + 4 # this is a line of code containing a complete expression

Accessing the R help system

R has an extensive help system that documents almost every aspect of the R language and
analysis ecosystem. All the documentation is available online via the R web site, as well as from
third-party providers. Be warned, the documentation is very, very extensive - it runs to about
3500 pages just for the core R system and standard packages alone, and thus is best treated as a
reference resource rather than a manual that can be read from cover-to-cover.

Individual help pages can also be accessed offline while using R by typing a question mark
followed by a function or command name in a code cell in Jupyter Notebook e.g. ?order will
display the help page for the order() function.

One caveat about the official R help pages: they are written with experienced, highly technical
readers in mind, and can often seem inscrutable or almost deliberately difficult to understand.
However, most help pages include example code to help demonstrate what they are
documenting.

Additional resource for learning R programming

These R Corners will build weekly into a complete set, providing all the R knowledge required for
this course. However, if you wish to or want to learn about R programming in greater depth,
then the following resource is recommended:

• the Safari book (freely available through UNSW library in e-book online format) Learning
R by Richard Cotton.

Hints for completion of Phase I R exercises

Let's assume we wish to use R to simulate the toss of a fair coin. For each activity, the first step is
to devise an appropriate vector of possible outcomes from which to randomly allocate. The
second step is to then use the sample() function to generate the random allocation.
toss_outcome <- c(0, 1)

The above code creates an R object 'toss_outcome', which is a vector [0,1].

Note the assign (or gets; left arrow and dash) in R. This assigns whatever is on the right to the
object on the left. We do not favour the use of '=' for assignment in R. R users consider this to be
bad practice!

We use 'c' (concatenate) to join elements of the vector.

White space is not important in R; notice how I leave a space after a comma to aid readibility of
the code. Equally, you can split long lines of code over multiple lines.

Running the code cell creates toss_outcome. If we wish to see the object, we can just call it thus:

toss_outcome

However, we can wrap the object in the print() function:

print(toss_outcome)

This is preferable, as it makes it explicit that we wish to view the object 'toss_outcome'.

Note that we are using 0/1 to denote Tails/Heads. This is standard practice for binary (No/Yes)
outcomes. However, we could have used strings (enclosed in quotation marks), if we so desired:

toss_outcome_string <- c("Tails", "Heads")

print(toss_outcome_string)

Now we have our vector of possible outcomes from tossing a coin, lets simulate a single coin
toss in R:

sample(toss_outcome, size = 1)

How did I know to use the sample() function? The easiest way to discover the required function is
a simple Google search with 'R' included in the search terms. The R help can be accessed by '?'
(or '??' for a general search) e.g.:

? sample

We now have the R help for the sample() function. Let's take a closer look.

sample(x, size, replace = FALSE, prob = NULL)

The function is sample(). The things inside the brackets are known as the function arguments, or
just the arguments. Think of these as the tuning parameters that can be varied to provide the
required flexibility. For example, suppose we wish to simulate 10 coin tosses. We simply specify
the 'size' argument to be 10, thus:
sample(toss_outcome, size = 10)

Mmmm, an error - notice the third argument 'replace=FALSE'. This means that if we don't
specify the 'replace' argument in our function call, then it will default to FALSE. This means that
after the 1st coin toss, our vector 'toss_outcome' only has 1 value left to sample (the opposite of
what was tossed in the first toss). After the 2nd coin toss, there are no more options left. Thus,
we need to specify 'replace=TRUE'. This means we replace the first toss back into list of possible
outcomes for the 2nd toss, and so on...

sample(toss_outcome, size = 10, replace = TRUE)

Note that if we specify the arguments in the default order, we do not need to name the
argument. Thus:

sample(toss_outcome, 10, TRUE)

However, naming the arguments is useful if we wish to pass the arguments 'out of order' e.g.:

sample(toss_outcome, replace = TRUE, size = 10)

Notice how the last 3 simulations of 10 coins tosses give different results. set.seed() allows us to
set the starting point, or seed, for the pseudo-random number generator, or PRNG, e.g.:

set.seed(1010) # For reproducibility

sample(toss_outcome, 10, TRUE)

This is good news for reproducible coding! I can now tell you that you just got the result:

1 1 1 1 1 1 0 1 1 0

Finally, we can use table() to sum the number of tails and heads:

set.seed(1010)
tosses <- sample(toss_outcome, 10, TRUE)
table(tosses) # Could wrap function in function i.e.
table(sample(toss_outcome, 10, TRUE) )

or 2 tails and 8 heads.

R Programming A Step-by-Step Guide For Absolute Beginners by Daniel Bell
100% (1)
R Programming A Step-by-Step Guide For Absolute Beginners by Daniel Bell
145 pages
Learn R Programming in A Day
100% (8)
Learn R Programming in A Day
229 pages
Azure Data Factory Notes 1682135573
No ratings yet
Azure Data Factory Notes 1682135573
78 pages
Web Technology Lab Program
60% (5)
Web Technology Lab Program
61 pages
Maths Assinment
No ratings yet
Maths Assinment
84 pages
Bank Management System in Java and Mysql
100% (1)
Bank Management System in Java and Mysql
26 pages
R Programming (R16) Ii B.Tech I Sem
No ratings yet
R Programming (R16) Ii B.Tech I Sem
124 pages
Introduction To R Programming Notes For Students
No ratings yet
Introduction To R Programming Notes For Students
41 pages
R Quick Guide
No ratings yet
R Quick Guide
140 pages
Unit - 1 Q) What Is R Programming? What Are The Features of R Programming?
No ratings yet
Unit - 1 Q) What Is R Programming? What Are The Features of R Programming?
32 pages
Chapter 13
No ratings yet
Chapter 13
21 pages
R Programming Presentation
100% (1)
R Programming Presentation
23 pages
R Presentation
No ratings yet
R Presentation
19 pages
R Programming: 7.1 R, Matlab, and Python
No ratings yet
R Programming: 7.1 R, Matlab, and Python
24 pages
Co358U R' Programming Lab: Government College of Engineering Jalgaon M.S. Department of Computer Engineering
No ratings yet
Co358U R' Programming Lab: Government College of Engineering Jalgaon M.S. Department of Computer Engineering
97 pages
Introduction To R
No ratings yet
Introduction To R
67 pages
Data Analysis Using R
100% (1)
Data Analysis Using R
78 pages
Racle Forms 10g
No ratings yet
Racle Forms 10g
52 pages
All Unit R - Programming Notes PDF
No ratings yet
All Unit R - Programming Notes PDF
736 pages
Introduction To R
No ratings yet
Introduction To R
30 pages
R-Codes SCS1621
No ratings yet
R-Codes SCS1621
151 pages
R - Overview
No ratings yet
R - Overview
178 pages
Software Estimation
No ratings yet
Software Estimation
80 pages
E5 - Statistical Analysis Using R
100% (1)
E5 - Statistical Analysis Using R
45 pages
1) Open Source: R Advantages
No ratings yet
1) Open Source: R Advantages
39 pages
BPM Camunda
No ratings yet
BPM Camunda
2 pages
Design Patterns Lecture
No ratings yet
Design Patterns Lecture
50 pages
R PROGRAMMING Material Upto Variable Assignment
No ratings yet
R PROGRAMMING Material Upto Variable Assignment
11 pages
PW1 2
No ratings yet
PW1 2
20 pages
Statistical Computing & R Programming Notes PDF
100% (2)
Statistical Computing & R Programming Notes PDF
22 pages
Sierra Atlantic Software Services LTD: Date of Download
No ratings yet
Sierra Atlantic Software Services LTD: Date of Download
4 pages
ANSI IEEE STD 983-1986 IEEE Guide For Software Quality Assurance Planning (, Institute of Electrical & Electronics Enginee)
No ratings yet
ANSI IEEE STD 983-1986 IEEE Guide For Software Quality Assurance Planning (, Institute of Electrical & Electronics Enginee)
31 pages
1.R Unit 1
No ratings yet
1.R Unit 1
49 pages
Stateful Widget Lifecycle: Example
No ratings yet
Stateful Widget Lifecycle: Example
6 pages
Create Web Service in Java Using Apache Axis2 and Eclipse
No ratings yet
Create Web Service in Java Using Apache Axis2 and Eclipse
14 pages
Object Oriented Programming EE-123L: Dha Suffa University Department of Electrical Engineering - Semester III
No ratings yet
Object Oriented Programming EE-123L: Dha Suffa University Department of Electrical Engineering - Semester III
7 pages
Lec 1
No ratings yet
Lec 1
42 pages
Chapter 5-Input
No ratings yet
Chapter 5-Input
13 pages
R Programming in Statistics
No ratings yet
R Programming in Statistics
403 pages
R Language
No ratings yet
R Language
59 pages
Log
No ratings yet
Log
3 pages
R Tutorial Session 1-2
100% (1)
R Tutorial Session 1-2
8 pages
Lab 01
No ratings yet
Lab 01
11 pages
Introduction To Programming
No ratings yet
Introduction To Programming
37 pages
Serial Data Plotting Programs - Arduino Stack Exchange
No ratings yet
Serial Data Plotting Programs - Arduino Stack Exchange
10 pages
Introduction To R
No ratings yet
Introduction To R
10 pages
5 3 Web Services SOAP UDDI WSDL (Compatibility Mode)
No ratings yet
5 3 Web Services SOAP UDDI WSDL (Compatibility Mode)
98 pages
EmbNet Owner's Manual
No ratings yet
EmbNet Owner's Manual
29 pages
Introduction To R Notes
No ratings yet
Introduction To R Notes
16 pages
Unit 4
No ratings yet
Unit 4
105 pages
How To Use Temperature Screening Version iVMS-4200
No ratings yet
How To Use Temperature Screening Version iVMS-4200
8 pages
R Lang
No ratings yet
R Lang
49 pages
R Programming R Basics For Beginners. (Z-Library)
No ratings yet
R Programming R Basics For Beginners. (Z-Library)
177 pages
Unit 1
No ratings yet
Unit 1
22 pages
D1 2 Intro R
No ratings yet
D1 2 Intro R
52 pages
Introduction To R
No ratings yet
Introduction To R
6 pages
Pplpresentation 211012192639
No ratings yet
Pplpresentation 211012192639
35 pages
D1 R-Intro
No ratings yet
D1 R-Intro
33 pages
Unit 1 - R Programming
No ratings yet
Unit 1 - R Programming
30 pages
Unit 3 Notes Programming in C
No ratings yet
Unit 3 Notes Programming in C
46 pages
Introduction and Installation of R
No ratings yet
Introduction and Installation of R
23 pages
R Programming Language - 2020 Edition
No ratings yet
R Programming Language - 2020 Edition
228 pages
Owen TheRGuide
No ratings yet
Owen TheRGuide
61 pages
Chapter1 Notes
No ratings yet
Chapter1 Notes
73 pages
R Manual
No ratings yet
R Manual
84 pages
DAR Programming - An Approach To Data Analytics-1
No ratings yet
DAR Programming - An Approach To Data Analytics-1
156 pages
CO - DKB3343 Edited MAC 2021
No ratings yet
CO - DKB3343 Edited MAC 2021
8 pages
R Lang-Unit-01
100% (1)
R Lang-Unit-01
50 pages
Nirula R Programming Lab Manual
No ratings yet
Nirula R Programming Lab Manual
94 pages
Module-3 Functions
No ratings yet
Module-3 Functions
18 pages
R Programming Language Unit01
No ratings yet
R Programming Language Unit01
133 pages
NET Developer Max Shevchenko
No ratings yet
NET Developer Max Shevchenko
1 page
Project Report
No ratings yet
Project Report
8 pages
Sessions
No ratings yet
Sessions
88 pages
BigData - BCom Unit 3
No ratings yet
BigData - BCom Unit 3
15 pages
Irrs 1
No ratings yet
Irrs 1
8 pages
1mod References
No ratings yet
1mod References
52 pages
R Programming Cheat Sheet
No ratings yet
R Programming Cheat Sheet
7 pages
Microservices Vs Monoliths
No ratings yet
Microservices Vs Monoliths
12 pages
Web Design
No ratings yet
Web Design
4 pages
R Notes
No ratings yet
R Notes
46 pages
UNIT 1 R Handouts-UN
No ratings yet
UNIT 1 R Handouts-UN
83 pages
Lecture Intro To Stat in R
No ratings yet
Lecture Intro To Stat in R
94 pages
R Stats Cheatsheet
No ratings yet
R Stats Cheatsheet
1 page
Dyslexia Scenario
No ratings yet
Dyslexia Scenario
2 pages
R For Beginners
No ratings yet
R For Beginners
76 pages
Rinku Mitra MLID241017
No ratings yet
Rinku Mitra MLID241017
18 pages
Manu Final
No ratings yet
Manu Final
53 pages
Unit I
No ratings yet
Unit I
15 pages
Introduction To R Programming
No ratings yet
Introduction To R Programming
60 pages
Module 5 Introduction To R Programming
No ratings yet
Module 5 Introduction To R Programming
17 pages
Topic Wise Interview Questions
No ratings yet
Topic Wise Interview Questions
54 pages
Learn R Programming in 24 Hours
From Everand
Learn R Programming in 24 Hours
Alex Nordeen
No ratings yet
Beginning R: The Statistical Programming Language
From Everand
Beginning R: The Statistical Programming Language
Mark Gardener
4.5/5 (4)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Hdat9200ch1 Rcorner

Uploaded by

Hdat9200ch1 Rcorner

Uploaded by

© Copyright 2025 UNSW Sydney. All rights reserved except where otherwise stated.

R is a fully-featured, 'industrial-strength' programming language which is mature and highly

Free, open-source software

Even when a single value (sometimes referred to as a scalar ) is assigned to a "variable" in R, in

# assign a vector of three integers to b

# show the contents of b

# now add 2 to b and show the result

R is strongly typed and uses dynamic variable declaration

# this is a comment, and is ignored by R

c <- 3 + 4 # this is a line of code containing a complete expression

Accessing the R help system

Additional resource for learning R programming

Hints for completion of Phase I R exercises

The above code creates an R object 'toss_outcome', which is a vector [0,1].

We use 'c' (concatenate) to join elements of the vector.

However, we can wrap the object in the print() function:

toss_outcome_string <- c("Tails", "Heads")

sample(x, size, replace = FALSE, prob = NULL)

sample(toss_outcome, size = 10, replace = TRUE)

sample(toss_outcome, 10, TRUE)

sample(toss_outcome, replace = TRUE, size = 10)

set.seed(1010) # For reproducibility

or 2 tails and 8 heads.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.