0% found this document useful (0 votes)
7 views2 pages

CSE 336 (Full 25) - MId

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views2 pages

CSE 336 (Full 25) - MId

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Register Number:

Name:
Branch & Section:
SRM UNIVERSITY – AP, ANDHRA PRADESH
Mid Term Examinations - March 2023.

Degree : B.Tech Max Marks : 25 marks


Branch : CSE
Subject Code : CSE 336 Duration : 60 min
Subject Title : Machine Learning QP Set No. :

Descriptive Questions
Answer ANY TWO of the following (2 x 12.5= 25 Marks)

1. Consider the following table that represents the value of Indian currency (INR) with
respect to USD. (5+3+2+1.5)

Year INR value per USD


1995 33
2000 45
2005 44
2010 45
2015 65
a. Design a linear regression model to find the value of Indian currency and
calculate the β0 nd β1 values.
b. Using the regression model calculate the R-square value for the training
dataset.
c. If it is given that the value of INR was 74 in 2020. Recalculate the R-Square
using your previous regression model.
d. Further, predict the value of INR in 2024.

2. a) We have data from the questionnaires survey (to ask people's opinions) and
objective testing with two attributes (acid durability and strength) to classify whether a
special paper tissue is good or not. Here is four training samples:

X1 = Acid Durability X2 = Strength(kg/square meter) Y = Classification


(seconds)

7 7 Bad

7 4 Bad

3 4 Good

1 4 Good
Register Number:
Name:
Branch & Section:
Now the factory produces a new paper tissue that passes laboratory test with X1 = 3
and X2 = 7. Without another expensive survey, can we guess what the classification
of this new tissue is? where K=3 and Euclidean distance is used to calculate the
distance from training samples. (8 marks)

b) A data set of cancer diagnoses has 100000 patient details with different features
with cancer detected ( as positive) and not detected (as negative). Total cancer
detected is 10% of total diagnoses. Your machine learning model detects 15000 as
cancer patients from the dataset with a Recall/TP rate of 80%. Draw the confusion
Matrix and calculate the Accuracy, error, precision and FP rate of your model (4.5
marks)

3. For the given dataset calculate the Information gain and design the decision tree.
Conclude which attribute is the irrelevant attribute in the dataset. (11+1.5)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy