0% found this document useful (0 votes)
5 views59 pages

EXERCISE10

The document contains a series of practical exercises using R programming, covering basic arithmetic operations, vector assignments, data manipulation, and statistical analysis. It includes step-by-step solutions for various exercises involving calculations, data frames, and functions. The exercises are designed to help users gain familiarity with R programming and its applications in data analysis.

Uploaded by

23stcs21
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views59 pages

EXERCISE10

The document contains a series of practical exercises using R programming, covering basic arithmetic operations, vector assignments, data manipulation, and statistical analysis. It includes step-by-step solutions for various exercises involving calculations, data frames, and functions. The exercises are designed to help users gain familiarity with R programming and its applications in data analysis.

Uploaded by

23stcs21
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 59

PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE 1

67+78-45
>67+78-45
[1] 100

43+52 -3x81
>4^3+5^2-3*81
[1] -154

3. √28+∛547- 47/53
>sqrt(28)+547^(1/3)-47/53
[1] 12.583

4.〖 e〗^3+12% of 75
solution

>exp(3)+0.12*75
[1] 29.08554

5. ∛729+log(23/42)
solution

>729^(1/3)+log(23/42)
[1] 8.397825

1
PRACTICAL 10: EXERCISES USING R PROGRAMMING

6.(1.01)6+(2.67)3.4 – (3.2)(-2.1)
solution

>(1.01)^6+(2.67)^3.4-(3.2)^(-2.1)
[1] 29.16739

7.233 +4562 -56


solution
>23^3+456^2-56
[1] 220047

2
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-2
1.Assign single values to X and Y as 3 and 4. Then find Z = X + Y; W = X*Y; A = Z + W; B =
A2 +√Y; C= X3+Y3
SOLUTION
>x=3;y=4
>z=x+y;z
[1] 7
>w=x*y;w
[1] 12
>a=z+w;a
[1] 19
>b=a^2+sqrt(y);b
[1] 363
>c=x^3+y^3;c
[1] 91

2.Assign combination of values (equal length) to X and Y and do above calculations. foreg X=
[2, 3, 5, 7] and Y= [11,13,17,19]
SOLUTION
>x=c(2,3,5,7);y=c(11,13,17,19)
>z=x+y;z
[1] 13 16 22 26
>w=x*y;w
[1] 22 39 85 133
>a=z+w;a
[1] 35 55 107 159
>b=a^2+sqrt(y);b
[1] 1228.317 3028.606

3
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[3] 11453.123 25285.359


>c=x^3+y^3;c
[1] 1339 2224 5038 7202

3. For problem 2 obtain the values for X/2, Y/3, X/Y


SOLUTION
>x/2
[1] 1.0 1.5 2.5 3.5
>y/3
[1] 3.666667 4.333333 5.666667
[4] 6.333333
>x/y
[1] 0.1818182 0.2307692
[3] 0.2941176 0.3684211

4. Assign a vector of character strings (“Bob”, “Jack”, “Jill”) for names.


>names=c("bob","jack","jill");names
[1] "bob" "jack" "jill"

4
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-3

1.Use sequence operator to get a sequence


i.from 1 to 20
ii.from 20 to 10
iii.From 2 to 30 of width 2

i.1:20
[1] 1 2 3 4 5 6 7 8
[9] 9 10 11 12 13 14 15 16
[17] 17 18 19 20
ii.20:10
[1] 20 19 18 17 16 15 14 13
[9] 12 11 10
iii.2*1:15
[1] 2 4 6 8 10 12 14 16
[9] 18 20 22 24 26 28 30

2.Assign value 15 to n and find the difference between 1: n-1 and 1:(n-1)
SOLUTION
>n=15
>1:n-1
[1] 0 1 2 3 4 5 6 7
[9] 8 9 10 11 12 13 14
>1:(n-1)
[1] 1 2 3 4 5 6 7 8

5
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[9] 9 10 11 12 13 14

EXERCISE-4
Enter the following data using rep function
1,1,1,1,2,2,3,3,3,3,3,
4,4,4,4,5,5,6,6,6,6,7,8,8,8
1,1,2,2,3,3,4,4,5,5,6,6
10,10,10,10,11,11,11,11,12,12,12,12
SOLUTION
>a=c(rep(1,4),rep(2,2),rep(3,5));a
[1] 1 1 1 1 2 2 3 3 3 3 3
>b=c(rep(4,4),rep(5,2),rep(6,4),7,rep(8,3));b
[1] 4 4 4 4 5 5 6 6 6 6 7 8 8 8
>c=rep(1:6,each=2);c
[1] 1 1 2 2 3 3 4 4 5 5 6 6
>d=rep(10:12,each=4);d
[1] 10 10 10 10 11 11 11 11 12 12 12 12

6
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-5

1.Usingdata.frame function make the following frequency distribution.


q
AGE FREQ MARKS FREQ VARIABLE FREQ
11 5 15 2 13 1
12 10 20 2 17 1
13 120 25 3 19 2
14 22 30 3 24 2
15 13 35 3 29 3
16 5 40 4 33 3

SOLUTION
>age=11:16;freq=c(5,10,120,22,13,5);d1=data.frame(age,freq);d1
agefreq
1 11 5
2 12 10
3 13 120
4 14 22
5 15 13
6 16 5> marks=C(15:40,by=5)
> marks=c(15,20,25,30,35,40);freq=c(2,2,3,3,3,4);d2=data.frame(marks,freq);d2
marksfreq
1 15 2
2 20 2
3 25 3
4 30 3
5 35 3
6 40 4
varaible=C(13,17,19,24,29,33);freq=c(1,1,2,2,3,3);d3=data.frame(varaiable,freq);d3

7
PRACTICAL 10: EXERCISES USING R PROGRAMMING

varaiblefreq
1 13 1
2 17 1
3 19 2
4 24 2
5 29 3
6 33 3

2.For the above data change the names of the columns


Mid age and No of cases.
Score and No of students
Income in ‘000 and No of families
SOLUTION

>colnames(d1)=c("mid age","no of cases");d1

MID AGE NO OF
CASES
1 11 5
2 12 10
3 13 120
4 14 22
5 15 13
6 16 5
>colnames(d2)=c("score","no of students");d2
score no of students
1 15 2
2 20 2
3 25 3
4 30 3
5 35 3
6 40 4
>colnames(d3)=c("income in 00","no of families");d3

8
PRACTICAL 10: EXERCISES USING R PROGRAMMING

x income in 00 no of families
1 13 1
2 17 1
3 19 2
4 24 2
5 29 3
6 33 3

9
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-6

1.Following is the data set: 5, 12, 21, 25, 25, 30, 25, 40, 42, 38, 50, 45, 60, 65, 50,70, 80, 50,13.
Use the built-in functions discussed above, on the data set x.
SOLUTION

>x=scan()
1: 5 12 21 25 30 25 40 42 38 50 45 60 65 50 70 80 50 13 20
20:
Read 19 items
>length(x)
[1] 19
>max(x)
[1] 80
>min(x)
[1] 5
>range(x)
[1] 5 80
>quantile(x)
0% 25% 50% 75% 100%
5 25 40 50 80
> IQR(x)
[1] 25
>mean(x)
[1] 39.26316
>median(x)
[1] 40
>var(x)
10
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[1] 428.9825
>sort(x)
[1] 5 12 13 21 25 25 25 30 38 40
[11] 42 45 50 50 50 60 65 70 80

11
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-7

For the given data sets;


Enter the data set either using the scan function or c function .
Find the index for its maximum and minimum value
Find the summary.
Find all functions wrt this data set
Construct the discrete distribution.

Data set I: 13, 17, 24, 21, 28, 28, 13, 27, 17, 23, 17, 24, 21, 17, 23, 21
SOLUTION
>x=scan()
1: 13 17 24 21 28 28 13 27 17 23 17 24 21 17 23 21
17:
Read 16 items
>max(x)
[1] 28

>min(x)
[1] 13
>summary(x)
Min. 1st Qu. Median Mean
13.00 17.00 21.00 20.88
3rd Qu. Max.
24.00 28.00
>quantile(x)
0% 25% 50% 75% 100%

12
PRACTICAL 10: EXERCISES USING R PROGRAMMING

13 17 21 24 28
>names(x)
NULL
>table(x)
x
13 17 21 23 24 27 28
2 4 3 2 2 1 2

Data set II:0, 1, 2, 3, 4, 5, 6, 6, 5, 4, 4, 5, 5, 4, 4, 3, 3, 3, 3, 2, 2, 2, 3, 2, 3, 2, 2, 2, 1, 1, 1, 0, 0, 1,


0, 3, 3, 2, 2, 2, 3, 2, 3, 2, 2, 2, 1, 1, 1, 0, 0, 1,0
SOLUTION
>x=scan()
1: 0 1 2 3 4 5 6 6 5 4 4 5 5 4 4 3 3 3 3 2 2 2 3 2 3 2 2 2 1 1 1 0 0 1 0 3 3 2 2 2 3 2 3 2 2 2 1 1 1 0
010
54:
Read 53 items
>max(x)
[1] 6
>min(x)
[1] 0
>summary(x)
Min. 1st Qu. Median Mean
0.00 1.00 2.00 2.34
3rd Qu. Max.
3.00 6.00
>quantile(x)
0% 25% 50% 75% 100%
0 1 2 3 6
>table(x)

13
PRACTICAL 10: EXERCISES USING R PROGRAMMING

x
1 2 3 4 5 6
7 9 15 11 5 4 2

14
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-8

1.A psychologist estimates the I.Q. of 60 children. The values are as follows :103, 98, 87, 85, 67,
96, 115, 109, 127, 103, 95, 123, 94, 88, 102, 76, 73, 80, 84, 102, 115, 93, 76, 81, 132, 90, 119,
84, 97, 120, 114, 101, 153, 98, 99, 105, 110, 107, 110, 128, 89, 112, 118, 101, 122, 146, 96, 109,
72, 97, 94, 94, 79, 79, 100, 54, 102, 89, 43, 111.
SOLUTION
>x=c(103, 98, 87, 85, 67, 96, 115, 109, 127, 103, 95, 123, 94, 88, 102, 76, 73, 80, 84, 102, 115,
93, 76, 81, 132, 90, 119, 84, 97, 120, 114, 101, 153, 98, 99, 105, 110, 107, 110, 128, 89, 112,
118, 101, 122, 146, 96, 109, 72, 97, 94, 94, 79, 79, 100, 54, 102, 89, 43, 111)
>summary(x)
Min. 1st Qu. Median Mean 3rd Qu. Max.
43.00 87.75 98.50 99.10 110.25 153.00
>(153-43)/5
[1] 22
>seq(43,160,by=22)
[1] 43 65 87 109 131 153
>ci=seq(43,160,by=22)
>length(x)
[1] 60
>range(x)
[1] 43 153
>y=cut(x,ci,right=F);y

[1] [87,109) [87,109) [87,109) [65,87)


[5] [65,87) [87,109) [109,131) [109,131)
[9] [109,131) [87,109) [87,109) [109,131)
[13] [87,109) [87,109) [87,109) [65,87)
[17] [65,87) [65,87) [65,87) [87,109)

15
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[21] [109,131) [87,109) [65,87) [65,87)


[25] [131,153) [87,109) [109,131) [65,87)
[29] [87,109) [109,131) [109,131) [87,109)
[33] <NA> [87,109) [87,109) [87,109)
[37] [109,131) [87,109) [109,131) [109,131)
[41] [87,109) [109,131) [109,131) [87,109)
[45] [109,131) [131,153) [87,109) [109,131)
[49] [65,87) [87,109) [87,109) [87,109)
[53] [65,87) [65,87) [87,109) [43,65)
[57] [87,109) [87,109) [43,65) [109,131)

5 Levels: [43,65) [65,87) [87,109) ... [131,153)


>fd=cbind(table(y));fd
[,1]
[43,65) 2
[65,87) 12
[87,109) 27
[109,131) 16
[131,153) 2

2.The following data regarding weight of new born babies is obtained from the office records of
a hospital. Weight (kgs.) 3.7, 3.4, 4.1, 4.0, 3.7, 4.7, 3.3, 2.4, 3.1, 4.2, 3.8, 3.6, 4.2, 4.3, 2.9, 3.6,
3.3, 4.8, 4.0, 3.9, 3.5, 3.5, 3.8, 3.8, 4.2, 3.9, 4.9, 3.2, 4.0, 3.8, 3.2, 2.7, 3.4., 3.3, 3.0, 3.1, 3.5, 3.7,
3.9, 4.3, 3.8, 3.7, 3.0, 4.4, 4.1, 3.6, 3.7, 3.4, 3.7, 3.3, 3.5, 3.7, 3.0, 2.9, 3.1, 3.3, 4.2.
SOLUTION

>x=c(3.7, 3.4, 4.1, 4.0, 3.7, 4.7, 3.3, 2.4, 3.1, 4.2, 3.8, 3.6, 4.2, 4.3, 2.9, 3.6, 3.3, 4.8, 4.0, 3.9, 3.5,
3.5, 3.8, 3.8, 4.2, 3.9, 4.9, 3.2, 4.0, 3.8, 3.2, 2.7, 3.4, 3.3, 3.0, 3.1, 3.5, 3.7, 3.9, 4.3, 3.8, 3.7, 3.0,
4.4, 4.1, 3.6, 3.7, 3.4, 3.7, 3.3, 3.5, 3.7, 3.0, 2.9, 3.1, 3.3, 4.2)

16
PRACTICAL 10: EXERCISES USING R PROGRAMMING

>summary(x)
Min. 1st Qu. Median Mean 3rd Qu. Max.
2.400 3.300 3.700 3.651 4.000 4.900
>(4.9-2.4)/5
[1] 0.5
>ci=seq(2.4,5.5,by=0.5)
>y=cut(x,ci,right=F);y
[1] [3.4,3.9) [3.4,3.9) [3.9,4.4) [3.9,4.4) [3.4,3.9)
[6] [4.4,4.9) [2.9,3.4) [2.4,2.9) [2.9,3.4) [3.9,4.4)
[11] [3.4,3.9) [3.4,3.9) [3.9,4.4) [3.9,4.4) [2.9,3.4)
[16] [3.4,3.9) [2.9,3.4) [4.4,4.9) [3.9,4.4) [3.9,4.4)
[21] [3.4,3.9) [3.4,3.9) [3.4,3.9) [3.4,3.9) [3.9,4.4)
[26] [3.9,4.4) [4.9,5.4) [2.9,3.4) [3.9,4.4) [3.4,3.9)
[31] [2.9,3.4) [2.4,2.9) [3.4,3.9) [2.9,3.4) [2.9,3.4)
[36] [2.9,3.4) [3.4,3.9) [3.4,3.9) [3.9,4.4) [3.9,4.4)
[41] [3.4,3.9) [3.4,3.9) [2.9,3.4) [4.4,4.9) [3.9,4.4)
[46] [3.4,3.9) [3.4,3.9) [3.4,3.9) [3.4,3.9) [2.9,3.4)
[51] [3.4,3.9) [3.4,3.9) [2.9,3.4) [2.9,3.4) [2.9,3.4)
[56] [2.9,3.4) [3.9,4.4)
6 Levels: [2.4,2.9) [2.9,3.4) [3.4,3.9) ... [4.9,5.4)
>fd=cbind(table(y));fd
[,1]
[2.4,2.9) 2
[2.9,3.4) 15
[3.4,3.9) 22
[3.9,4.4) 14
[4.4,4.9) 3
[4.9,5.4) 1

17
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-9

1.Access the data set treering containing tree-ring widths in dimensionless unit, from the base
package of R. Use R-commands to answer the following
how many observations are in the data set?
What is the minimum and maximum observation?
List observation greater than the 1.8.
Find the quartiles of the data set.
Find the index for the maximum and minimum value of data set.
Construct appropriate frequency distribution table
SOLUTION:

>data(treering);d=treering;
>length(d)
[1] 7980
>summary(d)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.0000 0.8370 1.0340 0.9968 1.1970 1.9080
>d[d>1.8]
[1] 1.844 1.850 1.856 1.820 1.884 1.908 1.826 1.802
>length(d[d>1.8])
[1] 8
>d[1:5]
[1] 1.345 1.077 1.545 1.319 1.413
>d[7976:7980]
[1] 1.027 1.173 1.471 1.444 1.160
>which(d==.0000)
[1] 1395
>which(d==1.9080)
18
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[1] 2185
>ci=seq(0,2,0.2);ci
[1] 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0
>y=cut(d,ci,right=F);fd=cbind(table(y));fd
[,1]
[0,0.2) 121
[0.2,0.4) 254
[0.4,0.6) 473
[0.6,0.8) 914
[0.8,1) 1795
[1,1.2) 2457
[1.2,1.4) 1459
[1.4,1.6) 430
[1.6,1.8) 69
[1.8,2) 8

2.Access the data set rivers, from the base package of R. Use R-commands to answer the
following
how many observations are in the data set?
What is the minimum and maximum observation?
List observation greater than the median.
Find the quartiles of the data set.
Find the index for the maximum and minimum value of data set.
Construct appropriate frequency distribution table

SOLUTION:
>data(rivers);d=rivers
>length(d)
19
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[1] 141
>summary(d)
Min. 1st Qu. Median Mean 3rd Qu. Max.
135.0 310.0 425.0 591.2 680.0 3710.0
>d[d>425.0]
[1] 735 524 450 1459 465 600 870 906 1000 600
[11] 505 1450 840 1243 890 525 720 850 630 730
[21] 600 710 470 680 570 560 900 625 2348 1171
[31] 3710 2315 2533 780 460 431 760 618 981 1306
[41] 500 696 605 1054 735 435 490 460 1270 545
[51] 445 1885 800 538 1100 1205 610 540 1038 444
[61] 620 652 900 525 529 500 720 430 671 1770
>d[1:5]
[1] 735 320 325 392 524
>which(d==135.0)
[1] 8
>which(d==3710.0)
[1] 68
>(3710-135)/5
[1] 715
>ci=seq(135,3710,by=715)
>y=cut(d,ci,right=F);fd=cbind(table(y));fd
[,1]
[135,850) 117
[850,1.56e+03) 18
[1.56e+03,2.28e+03) 2
[2.28e+03,3e+03) 3
[3e+03,3.71e+03) 0

20
PRACTICAL 10: EXERCISES USING R PROGRAMMING

3.For the given data


X 0 1 2 3 4
F 6 28 36 25 5
Add a column of cumulative frequency(cf)
Add a column of relative frequency(rf) (frequency/total frequency)
Add a column of relative cumulative frequency (cf/total frequency)
SOLUTION:

>x=0:4
>f=c(6,28,36,25,5)
>d1=data.frame(x,f);d1
x f
10 6
2 1 28
3 2 36
4 3 25
54 5
>cf=transform(d1,cfreq=cumsum(f));cf
x fcfreq
10 6 6
2 1 28 34
3 2 36 70
4 3 25 95
5 4 5 100
>cf1=transform(d1,rf=f/sum(f));cf1
x f rf
1 0 6 0.06

21
PRACTICAL 10: EXERCISES USING R PROGRAMMING

2 1 28 0.28
3 2 36 0.36
4 3 25 0.25
5 4 5 0.05

4. Access the data set swiss, from the base package of R. Use R-commands to answer the
followingFertility Agriculture Examination Education Catholic
Find the mean and variance for Agriculture
Construct a continuous frequency distribution for either Examination orEducation
Find the number of observation that has Catholic less than 60
Get all the information with respect to 6throw
Get all the information with respect to 6thcolumn
Get all the information with respect to the 5th,10th,…..& 45th observations.
Get all the information with respect to the 1th,17th,29th,33rd,47th observations.

SOLUTION:
>data("swiss")
>mean(swiss$Fertility)
[1] 70.14255
>var(swiss$Agriculture)
[1] 515.7994
>summary(swiss$Examination)
Min. 1st Qu. Median Mean 3rd Qu. Max.
3.00 12.00 16.00 16.49 22.00 37.00
>(37-3)/5
[1] 6.8
>ci=seq(3,45,by=5)
22
PRACTICAL 10: EXERCISES USING R PROGRAMMING

>x=cut(swiss$Examination,ci,righth=F)
>fd=cbind(table(x));fd

[,1]
(3,8] 6
(8,13] 7
(13,18] 15
(18,23] 9
(23,28] 4
(28,33] 2
(33,38] 2
(38,43] 0
>sum(swiss$Catholic< 60)
[1] 31
>swiss[6, ]
Fertility Agriculture Examination Education
Porrentruy 76.1 35.3 9 7
Catholic Infant.Mortality
Porrentruy 90.57 26.6
>swiss[, 6]
[1] 22.2 22.2 20.2 20.3 20.6 26.6 23.6 24.9 21.0 24.4
[11] 24.5 16.5 19.1 22.7 18.7 21.2 20.0 20.2 10.8 20.0
[21] 18.0 22.4 16.7 15.3 21.0 23.8 18.0 16.3 20.9 22.5
[31] 15.1 19.8 18.3 19.4 20.2 17.8 16.3 18.1 20.3 20.5
[41] 18.9 23.0 20.0 19.5 18.0 18.2 19.3

>observations_indices<- c(5, 10, seq(15, 45, by = 5))


>observations_info<- swiss[observations_indices, ]

23
PRACTICAL 10: EXERCISES USING R PROGRAMMING

>print("Information of specified observations:")


[1] "Information of specified observations:"
>print(observations_info)

Fertility Agriculture Examination


Neuveville 76.9 43.5 17
Sarine 82.9 45.2 16
Cossonay 61.7 69.3 22
Lavaux 65.1 73.0 19
Oron 72.5 71.2 12
Yverdon 65.4 49.5 15
Monthey 79.4 64.9 7
La Chauxdfnd 65.7 7.7 29
V. De Geneve 35.0 1.2 37
Education Catholic Infant.Mortality
Neuveville 15 5.16 20.6
Sarine 13 91.38 24.4
Cossonay 5 2.82 18.7
Lavaux 9 2.84 20.0
Oron 1 2.40 21.0
Yverdon 8 6.10 22.5
Monthey 3 98.22 20.2
La Chauxdfnd 11 13.79 20.5
V. De Geneve 53 42.34 18.0

>specified_indices<- c(1, 17, 29, 33, 47)


>specified_observations_info<- swiss[specified_indices, ]
>print("Information of specified observations:")

24
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[1] "Information of specified observations:"


>print(specified_observations_info)

Fertility Agriculture Examination Education


Courtelary 80.2 17.0 15 12
Grandson 71.7 34.0 17 8
Vevey 58.3 26.8 25 19
Herens 77.3 89.7 5 2
Rive Gauche 42.8 27.7 22 29

Catholic Infant.Mortality
Courtelary 9.96 22.2
Grandson 3.30 20.0
Vevey 18.46 20.9
Herens 100.00 18.3
Rive Gauche 58.33 19.3

25
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-10

1.Access the data set cars from the base library of R


Construct Boxplot for the variables in it.
Obtain the summary of the variables

>data(cars);d1=cars;attach(d1)
>dim(d1)
[1] 50 2
>names(d1)
[1] "speed" "dist"
>s=speed
>boxplot(s,xlab="speed")
>d=dist
>boxplot(d,xlab="distance")
>identify(rep(1,length(d)),d);

>d1[49,]

26
PRACTICAL 10: EXERCISES USING R PROGRAMMING

speed dist
49 24 120

2. Access the data cats from the library MASS and plot sexwise boxplot for the variable
Hwt(heart weight)
>library(MASS)
> head(cats)
Sex BwtHwt
1 F 2.0 7.0
2 F 2.0 7.4
3 F 2.0 9.5
4 F 2.1 7.2
5 F 2.1 7.3
6 F 2.1 7.6
> data(cats);
>attach(cats);names(cats);
The following objects are masked from cats (pos = 3):

Bwt, Hwt, Sex

[1] "Sex" "Bwt" "Hwt"


>boxplot(Hwt~Sex);
>identify(as.numeric(Sex),Hwt)
>boxplot(Hwt~Sex,col=c("blue","red"));
>identify(as.numeric(Sex),Hwt)

> cats[c(47,144),]
Sex BwtHwt
47 F 3.0 13.0
27
PRACTICAL 10: EXERCISES USING R PROGRAMMING

144 M 3.9 20.5

>

3.Access the data set InsectSprays from the base package of R. Construct parallel boxplots for
different sprays.
Hint: >boxplot(count~spray)

>data("InsectSprays")
>attach(InsectSprays);names(InsectSprays);
[1] "count" "spray"

>boxplot(count~spray);

28
PRACTICAL 10: EXERCISES USING R PROGRAMMING

4. Following are the body mass index values (kg/m2) for 14 subjects in sample
24.4, 3.04, 21.4, 25.4, 21.3, 23.8, 20.8, 22.9, 23.2, 21.1, 23.0, 20.6, 26.0, 20.9
i) compute mean, median, variance, standard deviation and coefficient of variation
ii) construct box and whisker plot. If outliers are found identify them.
iii)Compute Bowley’s measure of skewness

> x=c(24.4, 3.04, 21.4, 25.4, 21.3, 23.8, 20.8, 22.9, 23.2, 21.1, 23.0, 20.6, 26.0, 20.9)
> mean(x)
[1] 21.27429
> median(x)
[1] 22.15
> var(x)
[1] 30.62987

29
PRACTICAL 10: EXERCISES USING R PROGRAMMING

>sd(x)
[1] 5.534426
> cv=sd(x)/mean(x)*100
> cv
[1] 26.01463
> Q3=quantile(BMI_values,0.75)
> Q1=quantile(BMI_values,0.25)
> Q2=quantile(BMI_values,0.50)
>bowleys_skewness=(Q3+Q1-2*Q2)/(Q3-Q1);bowleys_skewness
75%
0.1111111
>boxplot(x)

30
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-11

1.Following are the number of accidents that occurred at 60 major intersections in a certain city
during a weekend: 0 1 0 2 4 2 5 0 3 0
2 0 1 4 4 4 1 2 1 2 5 0
4 1 0 2 1 1 4 2 5 3 2 0
5 1 1 0 6 3 1 5 0 3 0 0
6 3 2 2 3 1 4 0 3 0 0 1
2 4
Prepare a frequency distribution table and draw a bar chart. Comment on the nature of the
distribution.
SOLUTION
>t=table(x)
>t
x
0 1 2 3 4 5 6
15 12 11 7 8 5 2

This indicates a negative skewness, with most major intersections experiencing few or no
accidents during the weekend, while a few among them encountered higher accident counts, up
to 6 accidents.

31
PRACTICAL 10: EXERCISES USING R PROGRAMMING

2.From the information obtained in Q1 draw a pie diagram


accidents=c(0:6)
frequency=c(15,12,11,7,8,5,2)
sum(frequency)
[1] 60
angles=((frequency/60)*360);angles
[1] 90 72 66 42 48 30 12
pie(accidents,angles,main="PIE CHART of accidents at intersections",col="black")
pie(accidents,angles,main="PIE cHART of accidents at intersections",col="light blue")

EXCERISE-12
1. Draw a histogram and frequency polygon for the following data.
Height 0-7 14-21 21- 20-35 35- 42-49 49-50
7-14 28 42
No. of 31 35 42 82 54 19
people: 71
26
> mid=seq(3.5,52.5,7)

32
PRACTICAL 10: EXERCISES USING R PROGRAMMING

> freq=c(26,31,35,42,82,71,54,19)
> y=rep(mid,freq)
> brk=seq(0,56,7)
> hist(y,breaks=brk)
> hist(y,breaks=brk,col="green")
> plot(mid,freq,type="b")
> h=hist(y,breaks=brk,col="light blue");h
$breaks
[1] 0 7 14 21 28 35 42 49 56

$counts
[1] 26 31 35 42 82 71 54 19

$density
[1] 0.010317460 0.012301587 0.013888889 0.016666667 0.032539683 0.028174603
[7] 0.021428571 0.007539683

$mids
[1] 3.5 10.5 17.5 24.5 31.5 38.5 45.5 52.5

$xname
[1] "y"

$equidist
[1] TRUE

attr(,"class")
[1] "histogram"

33
PRACTICAL 10: EXERCISES USING R PROGRAMMING

> lines(c(min(h$breaks),h$mids,max(h$breaks)),lwd=2,c(0,h$counts,0),type="b")
>

2. Plot the histogram and frequency polygon on the same graph for the given data
Class 20-30 30-40 40-50 50-60 60-70 70-80 80-90
interval
Frequency 10 24 18 12 8 5 3

> class_intervals=seq(20,80,by=10)
> frequencies=c(10,24,18,12,8,5,3)
> midpoints=class_intervals+5
> data=rep(midpoints,times=frequencies)
> class_intervals=seq(20,100,by=10)
> hist(data,breaks=class_intervals, main="histogram ",xlab="class intervals" ,ylab="frequency",
col="light pink", border="black")
> points(midpoints,frequencies,type="b",col="black",pch=19,lwd=2,cex=1.5)

34
PRACTICAL 10: EXERCISES USING R PROGRAMMING

35
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-13

Plot the scatter plot and compute the both the correlation coefficient for the following
data
i)
X 0 4 8 12
Y 8.34 8.89 9.16 9.50

ii)
A 11.1 10.3 12.0 15.1 13.7 18.5 17.3 14.2 14.8 15.3
B 10.9 14.2 13.8 21.5 13.2 21.1 16.4 19.3 17.4 19.0

iii)
C 5.12 6.18 6.77 6.65 6.36 5.90 5.48 6.02 10.34 8.51
D 2.30 2.54 2.95 3.77 4.18 5.31 5.53 8.83 9.48 14.20

>x=c(0,4,8,12)
>y=c(8.34,8.89,9.16,9.50)
>p=plot(x,y)
>cor(x,y,method="spearman")
[1] 1

36
PRACTICAL 10: EXERCISES USING R PROGRAMMING

>a=c(11.1,10.3,12.0,15.1,13.7,18.5,17.3,14.2,14.8,15.3)
>b=c(10.9,14.2,13.8,21.5,13.2,21.1,16.4,19.3,17.4,19.0)
>p=plot(a,b)
>cor(a,b,method="spearman")
[1] 0.6969697

>d=c(2.30,2.54,2.95,3.77,4.18,5.31,5.53,8.83 ,9.48,14.20)
>p=plot(c,d)
>cor(c,d,method="spearman")
[1] 0.4181818

37
PRACTICAL 10: EXERCISES USING R PROGRAMMING

2.
X1 Y1 X2 Y2
10 8.04 10 9.14
8 6.95 8 8.14
13 7.58 13 8.74
9 8.81 9 8.77
11 8.33 11 9.26
14 9.96 14 8.10
6 7.24 6 6.13
4 4.26 4 3.10
12 10.84 12 9.13
7 4.82 7 7.26
5 5.68 5 4.78

For the above two data set verify the following


Mean of x1is same as mean of x2
Mean of y1 is same as mean of y2
Correlation coefficient between (x1,y1) is same as (x2,y2)

38
PRACTICAL 10: EXERCISES USING R PROGRAMMING

Draw the scatter plot and comment on the findings

>x1=c(10,8,13,9,11,14,6,4,12,7,5)
>x2=c(10,8,13,9,11,14,6,4,12,7,5)
>mean(x1)
[1] 9
>mean(x2)
[1] 9
y1=c(8.04,6.95,7.58,8.81,8.33,9.96,7.24,4.26,10.84,4.82,5.68)
>y2=c(9.14,8.14,8.74,8.77,9.26,8.10,3.10,9.13,7.26,4.78)
>mean(y1)
[1] 7.500909
>mean(y2)
[1] 7.642
>cor(x1,y1,method = "spearman")
[1] 0.8181818
>cor(x2,y2,method=”sperman”)
>
> p=plot(x1,y1,main="scatter plot 1")
> p=plot(x2,y2,main="scatter plot 2")

39
PRACTICAL 10: EXERCISES USING R PROGRAMMING

40
PRACTICAL 10: EXERCISES USING R PROGRAMMING

41
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-14

1.The table shown the score of 10 students on maths(X) test and stats(Y) test. The maximum
score in each test was 50.
Obtain the line of regression of X on Y.
Print this equation on the graph
if it is known that a student gets 28 in stats, what would be his/her score in maths?
X 34 37 36 32 32 36 35 34 29 35
Y 37 37 34 34 33 40 39 37 36 35

x=c(34,37,36,32,32,36,35,34,29,35)
y=c(37,37,34,34,33,40,39,37,36,35)
plot(x,y)
fit=lm(y~x);abline(fit);fit

Call:
lm(formula = y ~ x)

Coefficients:
(Intercept) x
23.7769 0.3654

> text(locator(1),"y=0.3654*x+23.7769")

42
PRACTICAL 10: EXERCISES USING R PROGRAMMING

2. Calculate person’s coefficient of correlation for the following data.


X: 45 55 56 58 60 65 68 70 75 80 85
Y: 56 50 48 60 62 64 65 70 74 82 90
Plot the line of best fit and Estimate Y when X = 78

>x=c(45,55,56,58,60,65,68,70,75,80,85)
>y=c(56,50,48,60,62,64,65,70,74,82,90)
>cor(x,y)
>[1] 0.9188406
>plot(x,y)
>fit=lm(y~x);abline(fit);fit

Call:
lm(formula = y ~ x)

Coefficients:
(Intercept) x
0.9044 0.9917
> text(locator(1),"y=0.9917*x+0.9904")

43
PRACTICAL 10: EXERCISES USING R PROGRAMMING

3.Calculate the coefficient of correlation by Karl Person’s method from the following data
relating to overhead expenses and cost of production
Overhead expense (1000 Rs.) 80 90 100 110 120 130 140 150 160
Cost of (Rs. 1000) 15 15 16 19 17 18 16 18
19
Plot the line of best fit and estimate X when Y = 22

> x=10*8:16
> y=c(15,15,16,19,17,18,16,18,19)
>cor(x,y)
[1] 0.6928203
>plot(x,y)
> fit=lm(y~x);abline(fit);fit

Call:
lm(formula = y ~ x)

Coefficients:
(Intercept) x

44
PRACTICAL 10: EXERCISES USING R PROGRAMMING

12.20 0.04

> text(locator(1),"y=0.04*x+12.20")
>

45
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-15

The incident of occupational disease is such that the workers have 20% chance of
catching it, what is the probability that out of 6 workers chosen (i) 4 or more are disease. (ii)
atmost 2 catches the disease
> n=6
> p=0.20
> prob_4_or_more=1-pbinom(3,n,p);prob_4_or_more
[1] 0.01696
> prob_at_most_2=pbinom(2,n,p);prob_at_most_2
[1] 0.90112

2.The probability that a patient recovers from a sax blood disease 0.21. If 15 people are known to
have contracted this disease what is the probability that: a) Atleast 10 survive? b) From 3 to 8
survive
>n=6
> p=0.21
> prob_atleast_10_survive=1-pbinom(9,n,p);prob_atleast_10_survive
[1] 0.0001745072
> prob_3_to_8_survive=pbinom(8,n,p)-pbinom(2,n,p);prob_3_to_8_survive
[1] 0.6373935

3.Find the probability that seven of ten persons will recover from a tropical disease, given that
the probability is 0.8, that any one of these will recover from the disease.
> n=10
> p=0.8
> prob_7_recovers=pbinom(7,n,p);prob_7_recovers
[1] 0.3222005

4.A basketball player hits on seventy-five percent of his shots from the free throw line. What is th
probability that he makes exactly two of his next four free shots?
> n=4
> p=0.75
> prob_making_2=pbinom(2,n,p);prob_making_2
[1] 0.2617188

5.In a certain city, incompatibility is given as the legal reason in 70% of all divorce cases. Find
the probability that 5 of the next 6 divorce casesin this city will blame incompatible.

46
PRACTICAL 10: EXERCISES USING R PROGRAMMING

> n=6
> p=0.70
> prob_five_blame=pbinom(5,n,p);prob_five_blame
[1] 0.882351

6. A automobile safety engineer claims that one in ten automobile accidents is due to driver
fatigue. What is the probability that at least three of five automobile accidents are due to driver
fatigue?
> n=5
> p=0.1
> prob_atleast_three=1-pbinom(2,n,p);prob_atleast_three
[1] 0.00856

7. Seven unbiassed and coins are tossed, and No. of heads are noted. The experiment is repeated
128 times and the following results are obtained. Fit a binomial distribution and obtain the
expected frequencies.
No.of 0 1 2 3 4 5 6 7
Heads (x)
Frequenc 7 6 17 35 30 23 7 3
y

> x=c(0,1,2,3,4,5,6,7)
> f=c(7,6,17,35,30,23,7,3)
> fx=f*x
> fx
[1] 0 6 34 105 120 115 42 21
> sum(fx)
[1] 443
> sum(f)
[1] 128
> mean=(sum(fx)/sum(f));mean
[1] 3.460938
> prob=mean/7;prob
[1] 0.4944196
> p=dbinom(x,7,prob,log=FALSE);p
[1] 0.008443672 0.057800941
[3] 0.169574947 0.276385951
[5] 0.270284716 0.158590899
[7] 0.051696666 0.007222208
> exp_freq=(p*sum(f));exp_freq
[1] 1.0807900 7.3985205
[3] 21.7055932 35.3774017
[5] 34.5964436 20.2996351

47
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[7] 6.6171732 0.9244427


> expected_frequency=round(exp_freq);expected_frequency
[1] 1 7 22 35 35 20 7 1
> y=cbind(p,expected_frequency);y
p expected_frequency
[1,] 0.008443672 1
[2,] 0.057800941 7
[3,] 0.169574947 22
[4,] 0.276385951 35
[5,] 0.270284716 35
[6,] 0.158590899 20
[7,] 0.051696666 7
[8,] 0.007222208 1

8.A set of six similar coins are tossed 640 times and the following results are obtained
No. of 0 1 2 3 4 5 6
Head(x)
Frequenc 7 64 140 210 130 75 12
y
Fit a binomial distribution assuming that the nature of the coin is unknown
> x=c(1,2,3,4,5,6)
> f=c(7,64,140,210,130,75,12)
> fx=f*x;fx
[1] 0 64 280 630 520 375 72
> mean=(sum(fx)/sum(f));mean
[1] 3.04232
> prob=mean/6;prob
[1] 0.5070533
> p=dbinom(x,6,prob,log=FALSE);p
[1] 0.01434828 0.08855329
[3] 0.22771852 0.31231348
[5] 0.24093818 0.09913323
[7] 0.01699502
> exp_freq=(p*sum(f));exp_freq
[1] 9.154202 56.497000
[3] 145.284417 199.255999
[5] 153.718559 63.247000
[7] 10.842822
> expected_frequency=round(exp_freq);expected_frequency
[1] 9 56 145 199 154 63 11
> y=cbind(p,expected_frequency);y
p expected_frequency
[1,] 0.01434828 9

48
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[2,] 0.08855329 57
[3,] 0.22771852 145
[4,] 0.31231348 199
[5,] 0.24093818 154
[6,] 0.09913323 63
[7,] 0.01699502 11

49
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-16

A hospital switch board receives an average of 4 emergency calls in a 10 minutes


interval.
a) What is the probability that there are at the most 2 emergency calls in 10 minutes
interval
b) There are exactly 3 emergency calls in 10 minutes
c) Atleast 4 calls in 10 minutes interval

>ppois(2,4,lower.tail = TRUE,log.p = FALSE)


[1] 0.2381033
>dpois(3,4,log=F)
[1] 0.1953668
>1-ppois(3,4,lower.tail = TRUE,log.p = FALSE)
[1] 0.5665299

2.Assuming that the chance of a traffic accident in a City of Delhi is 0.001 on how many days
out of 1000 days can we expect no accidents and more than 3 accidents.
> lambda=1
> prob_no_accidents=dpois(0,lambda);prob_no_accidents
[1] 0.3678794
> x=prob_no_accidents*1000;x
[1] 367.8794
> prob_more_than_3_accidents=1-ppois(3,lambda);prob_more_than_3_accidents
[1] 0.01898816
> x=prob_more_than_3_accidents*1000;x
[1] 18.98816

3.Fit a Poisson distribution to following data w.r.t.No.of. R.B.C.s per cell


No. of R.B.C. 0 1 2 3 4
No.ofcells 142 156 6 50
> x=0:4
> f=c(142,156,69,27,50)
> fx=f*x;fx

50
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[1] 0 156 138 81 200


> mean=(sum(fx)/sum(f));mean
[1] 1.295045
> p=dpois(x,mean);p
[1] 0.27388553 0.35469409
[3] 0.22967241 0.09914537
[5] 0.03209943
> exp_freq=p*sum(f);exp_freq
[1] 121.60517 157.48418 101.97455
[4] 44.02055 14.25215
> expected_frequency=round(exp_freq);expected_frequency
[1] 122 157 102 44 14
> y=cbind(p,expected_frequency);y
p expected_frequency
[1,] 0.27388553 123
[2,] 0.35469409 158
[3,] 0.22967241 103
[4,] 0.09914537 45
[5,] 0.03209943 15

4.If the number of mistakes made by a typist follows a Poisson distribution with mean 3, what is
the chance that he/she
i) makes 2 mistakes, ii) makes atleast 2 mistakes
>dpois(2,3,log=F)
[1] 0.2240418
>1-ppois(2,3,lower.tail = T,log.p = F)
[1] 0.5768099

5.The number of accidents occurring in a factory in a year is a Poission variate with mean 5. Find
the probability that.
i) more than 2 accidents take place
ii) more than 4 accidents occur in 1 year
>1-ppois(2,5,lower.tail = T,log.p=F)
[1] 0.875348
>1-ppois(4,5,lower.tail = T,log.p=F)
[1] 0.5595067

51
PRACTICAL 10: EXERCISES USING R PROGRAMMING

6.A receptionist at an office receives on an average 3 telephone calls between 10 a.m. and 10.05
a.m. Find the probability that on a particular day
i) she does not receive any call
ii) she receives atleast 2 calls

>ppois(0,3,lower.tail = T,log.p = F)
[1] 0.04978707
>1-ppois(2,3,lower.tail = T,log.p=F)
[1] 0.5768099
7.At 10.00 a.m. there is a city bus service. The number of passengers getting in at the 1st stop is a
Poisson variate with parameter 6. What is the probability that on a particular day none of them
gets in at the bus in the stop? On how many days of an year would you expect this to happen.
> lambda=8
> prob_no_passengers=dpois(0,lambda);prob_no_passengers
[1] 0.0003354626
> day_in_year=365
> x=prob_no_passengers*day_in_year;x
[1] 0.122443

8.On an average 3 street lights of a municipality fails every day. Find the standard deviation of
number of failure per day and probability that atleast one light fails per day.
>sqrt(3)
[1] 1.732051
>1-ppois(0,3,lower.tail = T,log.p = F)
[1] 0.9502129

9.On an average 1% of the pins are defective. If the box contains 300 pins, find the probability
that the box has
i) atleast 1 defective pin
ii) more than 3 defective pins
>1-ppois(0,3,lower.tail = T,log.p = F)
[1] 0.9502129

52
PRACTICAL 10: EXERCISES USING R PROGRAMMING

>1-ppois(3,3,lower.tail = T,log.p = F)
[1] 0.3527681
10.On an average 1 in every 50 valves manufactured by a firm is substandard. If the valves are
supplied in packers of 20 each
i) Find the probability that the packets will contain atleast 1 substandard valve
ii) In how many of a lot of 1000 packets would you expect substandard valves.
>lambda=20*(1/50)
> prob_atleast_1_substandard=1-dpois(0,lambda);prob_atleast_1_substandard
[1] 0.32968
> total_packets=1000
> x=total_packets*prob_atleast_1_substandard;x
[1] 329.68

11.Using the following data fit a Poisson distribution and find the expected frequencies
No.ofPrintingMistakes 0 1 2 3 4 5
No.of days 42 33 14 6 4 1
> x=c(0,1,2,3,4,5)
> f=c(42,33,14,6,4,1)
> fx=f*x;fx
[1] 0 33 28 18 16 5
> sum(f)
[1] 100
> mean=(sum(fx)/sum(f));mean
[1] 1
> p=dpois(x,mean);p
[1] 0.367879441 0.367879441
[3] 0.183939721 0.061313240
[5] 0.015328310 0.003065662
> exp_freq=p*sum(f);exp_freq
[1] 36.7879441 36.7879441
[3] 18.3939721 6.1313240
[5] 1.5328310 0.3065662
> expected_frequency=round(exp_freq);expected_frequency
[1] 37 37 18 6 2 0
> y=cbind(p,expected_frequency);y
p expected_frequency
[1,] 0.367879441 37
[2,] 0.367879441 37
[3,] 0.183939721 18
[4,] 0.061313240 6

53
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[5,] 0.015328310 2
[6,] 0.003065662 0

12.The following is the distribution of daily sales of television sets in a shop, Fit a
Poissondistribution and hence find the theoretical frequency.
No. of setssold0 1 2 3 4 5 6
No. of days 18 43 45 28 12 5 0
> x=0:6
> f=c(18,43,45,28,12,5,0)
> fx=f*x;fx
[1] 0 43 90 84 48 25 0
> sum(f)
[1] 151
> mean=(sum(fx)/sum(f));mean
[1] 1.92053
> p=dpois(x,mean);p
[1] 0.14652931 0.28141391
[3] 0.27023190 0.17299614
[5] 0.08306106 0.03190425
[7] 0.01021218
> exp_freq=p*sum(f);exp_freq
[1] 22.125926 42.493500 40.805016
[4] 26.122417 12.542220 4.817541
[7] 1.542039
> expected_frequency=round(exp_freq);expected_frequency
[1] 22 42 41 26 13 5 2
> y=cbind(p,expected_frequency);y
p expected_frequency
[1,] 0.14652931 22
[2,] 0.28141391 42
[3,] 0.27023190 41
[4,] 0.17299614 26
[5,] 0.08306106 13
[6,] 0.03190425 5
[7,] 0.01021218 2

54
PRACTICAL 10: EXERCISES USING R PROGRAMMING

EXERCISE-17

1.Given a normal distribution with mean = 50 and standard deviation = 8. Find the probability
that X assumes a value between 34 and 62.
> mean=50
> sd=8
> x1=34
> x2=62
> z1=(x1-mean)/sd;z1
[1] -2
> z2=(x2-mean)/sd;z2
[1] 1.5
> p1=pnorm(z1);p1
[1] 0.02275013
> p2=pnorm(z2);p2
[1] 0.9331928
> probability=p2-p1;probability
[1] 0.9104427

2.For a normal distribution with mean = 200 and S.D. = 25, find the probability that X assumes a
value between 200 and 260. Find the probability that X is greater than 240.
> mean=200;
> sd=25
> x1=200
> x2=260
> z1=(x1-mean)/sd;z1
[1] 0
> z2=(x2-mean)/sd;z2
[1] 2.4
> p1=pnorm(z1);p1
[1] 0.5
> p2=pnorm(z2);p2
[1] 0.9918025
probability=p1-p2;probability
[1] -0.4918025

ii)
> x=240
> z=(x-mean)/sd;z
[1] 1.6
> p=pnorm(z);p
[1] 0.9452007

55
PRACTICAL 10: EXERCISES USING R PROGRAMMING

> probability=1-p;probability
[1] 0.0547992

3.Given a Normal distribution with mean = 50 and S.D. = 13. Find the value of X that has (a)
13% of the area to its left : b) 14% of the area to its right.
i)
> mean=50
> sd=13
> pleft=0.13
> xleft=qnorm(pleft,mean,sd,lower.tail = TRUE);xleft
[1] 35.35692

ii)
>mean=50
>sd=13
>pright=0.4
>xright=qnorm(pright,mean,sd,lower,tail=FALSE);
>xright
[1] 64.04415
5.The accounts of a certain departmental store has an average balance of Rs. 120/- and S.D. = Rs.
40/-. Assuming that the account balances are normally distributed. a) what proportion of
accounts is over Rs. 150/- d) what proportion is between 100 and 150; (c) between 60 and 90.
a)
> mean=120
> sd=40
> x=50
> z=(x-mean)/sd;z
> p=pnorm(z);p
[1] 0.04005916
> probability=1-p;probability
[1] 0.9599408

b)
> x1=100
> x2=150

56
PRACTICAL 10: EXERCISES USING R PROGRAMMING

> z1=(x1-mean)/sd;z1
[1] -0.5
> z2=(x2-mean)/sd;z2
[1] 0.75
> p1=pnorm(z1);p1
[1] 0.3085375
> p2=pnorm(z2);p2
[1] 0.7733726
> probability=p2-p1;probability
[1] 0.4648351

c)
> x1=60
> x2=90
> z1=(x1-mean)/sd;z1
[1] -1.5
> z2=(x2-mean)/sd;z2
[1] -0.75
> p1=pnorm(z1);p1
[1] 0.0668072
> p2=pnorm(z2);p2
[1] 0.2266274
> probability=p1-p2;probability
[1] -0.1598202

5.The distribution of monthly income of 3000 workers of a factory follows normal law with
mean = 900 and S.D. = 100. Find
a) percentage of workers with income greater than Rs. 800
b) percentage of workers having on income less than Rs. 600.
57
PRACTICAL 10: EXERCISES USING R PROGRAMMING

a) > mean=900
> sd=100
> x=800
> z=(x-mean)/sd;z
[1] -1
> p=pnorm(z);p
[1] 0.1586553
> probability=1-p;probability
[1] 0.8413447
b)
> x=600
> z=(x-mean)/sd;z
[1] -3
> p=pnorm(z);p
[1] 0.001349898
> probability=p;probability
[1] 0.001349898

6.1200 students took an exam. The mean marks is 53% and S.D. = 15%. Assume normal
distribution of marks.
a) if 50% marks are required for passing, find how many students are expected to
score greater than 50%
b) if only 40% of students are required to be promoted what are the marks for
promotion.
a)
> mean=0.53
> sd=0.15
> x=0.5
> z=(x-mean)/sd;z

58
PRACTICAL 10: EXERCISES USING R PROGRAMMING

[1] -0.2
> p=pnorm(z);p
[1] 0.4207403
> probability=1-p;probability
[1] 0.5792597

b)
> x=0.4
> probability=qnorm(1-x,0.53,0,15);probability
[1] 0.53

59

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy