0% found this document useful (0 votes)

35 views2 pages

AIDI 1002 FinalExam Section 01

The document describes a final exam for a machine learning course. It contains 3 questions - the first involves increasing the training set size for an iris classification model and plotting the results, the second involves binary classification using a discriminant function on some sample data, and the third involves performing k-means clustering on some sample data and comparing the results to true labels.

Uploaded by

uniquelifeofvj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views2 pages

AIDI 1002 FinalExam Section 01

Uploaded by

uniquelifeofvj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Last Name: First Name: Student ID:

AIDI 1002: Machine Learning Programming — Final Exam Fall 2023

Due Date : December 15, 2023, 1:00 PM - 3:00 PM

Note : Submit two files in the submission folder. First is your colab notebook including your code and outputs and second is
the pdf of colab notebook with the following naming convention for both the files.
(File name : Lastname_Firstname_FinalExam.pd f /.ipynb)

1. (30 Points) Increasing Training Set Size Experiment: Consider the iris dataset for multiclass classification and perform
the following steps.

1. Divide the data into 80% training and 20% testing.

2. From the training set only take 5% of the data and train the supervised learning models (Logistic Regression,
Decision Trees, Random Forest, and Naive Bayes) and test it on the test set created in the previous step.

3. Repeat the training again with adding 5% training data every time until you use the whole training set.

4. In every training, test your models on the 20% of the test set and store the accuracy and f1-score of the model.

5. Plot the sample graph for accuracy and f1-score as provided below:

2. (30 Points) Binary Classification with Discriminant: Consider the following 15 data points with two features, i.e., X and
Y , and their associated classes:

X = [5, 1, 9, 6, 5, 6, 1, 9, 10, 11, 8, 7, 13, 8, 19]

Y = [14, 16, 17, 10, 9, 17, 15, 3, 3, 1, 4, 5, 1, 3, 15]

C = [c1 , c1 , c1 , c1 , c1 , c1 , c1 , c2 , c2 , c2 , c2 , c2 , c2 , c2 , c2 ].

Note that these data points are ordered so that (x1 , y1 , label1 ) = (5, 14, c1 ) and (x15 , y15 , label15 ) = (19, 15, c2 ).

A researcher defined a discriminant function for binary class classification as g(x, y) = −x + 2y + xy where x ∈ X and y ∈ Y .
Accordingly, the classes are selected as follows.

1

c1 if g(x, y) ≥ 35


c2 otherwise


Report the accuracy of the predicted labels using the researcher’s discriminant function. (In your Jupyter Notebook,
show how you find these numbers and print them.)

Answer:

Number of misclassified in c1 = Number of misclassified in c2 =

3. (40 points) K-Means Clustering: Consider the 30 data points and their corresponding class labels stored in a dictionary
named “data_dict”.

data_dict = { ( 2 . 0 , 3 . 4 3 , 4 . 3 7 ) : 2 , ( 2 . 4 9 , 4 . 2 8 , 4 . 8 3 ) : 2 , ( 2 . 5 8 , 4 . 3 6 , 4 . 4 8 ) : 2 , ( 2 . 6 6 , 4 . 4 5 , 5 . 9 5 ) : 2 ,
(2.82 , 3.66 , 4.51): 2 , (3.03 , 4.37 , 5.07): 2 , (3.27 , 4.54 , 4.57): 2 , (3.41 , 3.94 , 5.35): 2 ,
(3.53 , 4.32 , 5.41): 2 , (3.53 , 4.6 , 6 . 8 ) : 1 , (3.61 , 4.25 , 5.21): 1 , (3.61 , 4.78 , 5.47): 1 ,
(3.72 , 5.44 , 5.88): 1 , (3.87 , 4.96 , 4.52): 2 , (4.13 , 5.29 , 6 . 6 ) : 1 , (4.25 , 5.97 , 5.48): 1 ,
(4.61 , 4.9 , 5.11): 1 , (4.73 , 4.4 , 6.78): 1 , (4.97 , 4.25 , 5 . 0 ) : 1 , (4.98 , 5.27 , 6.79): 1 ,
(5.08 , 3.51 , 4.69): 3 , (5.15 , 3.58 , 4 . 2 ) : 3 , (5.67 , 2.27 , 4.65): 3 , (5.67 , 3.81 , 5.75): 3 ,
(5.94 , 2.34 , 4.12): 3 , (6.06 , 3.16 , 4.36): 3 , (6.09 , 3.19 , 4.02): 3 , (6.43 , 3.42 , 4.18): 3 ,
( 6 . 5 6 , 2 . 7 , 4 . 0 3 ) : 3 , ( 6 . 7 9 , 3 . 4 6 , 4 . 8 1 ) : 3}

For instance, the first point has coordinates (x1 , x2 , x3 ) = (2.0, 3.43, 4.37) and belongs to class 2. In total we have three
classes: 1, 2, and 3.

As a discriminant function, consider a distance function based on below center coordinates (encoded as a dictionary of
values) for each class labels.

c e n t e r s _ d i c t = {}
c e n t e r s _ d i c t [ ( 3 , 4 , 5 ) ] = 1 # c e n t e r c o o r d i n a t e s f o r c l a s s 1 , i . e . , c1 =4, c2 =5, c3=6
c e n t e r s _ d i c t [ ( 4 , 5 , 6 ) ] = 2 # c e n t e r c o o r d i n a t e s f o r c l a s s 2 , i . e . , c1 =3, c2 =4, c3=5
c e n t e r s _ d i c t [ ( 6 , 3 , 5 ) ] = 3 # c e n t e r c o o r d i n a t e s f o r c l a s s 3 , i . e . , c1 =6, c2 =3, c3=5

Note that a discriminant function based on cosine distance can be written as

q
a.b
Cosine distance between point a = [a1, a2] and b = [b1, b2] is d = 1 − ∥a∥∥b∥ (a.b = ∑ni=1 ai bi ; ∥a∥ = ∑ni=1 a2i )

Based on above discriminant functions, perform a K-Means Clustering task over 30 points in data_dict and then compare
it with true labels. Print the number of correctly classified instance in the answer.

Grade 1 Mathematics Lesson Plan
100% (33)
Grade 1 Mathematics Lesson Plan
4 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Foundations of Data Science - Unit 5 - Accuracy KNN
No ratings yet
Foundations of Data Science - Unit 5 - Accuracy KNN
24 pages
Analysis Course HW2
No ratings yet
Analysis Course HW2
13 pages
ML Unit-2 (CEC)
No ratings yet
ML Unit-2 (CEC)
96 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
Classification and K Nearest Neighbour Algorithm
No ratings yet
Classification and K Nearest Neighbour Algorithm
53 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Ai Combined Update
No ratings yet
Ai Combined Update
274 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
ISYE6501 Homework 2
No ratings yet
ISYE6501 Homework 2
11 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Mnbnmnbnnmbbhhuyrgh
No ratings yet
Mnbnmnbnnmbbhhuyrgh
3 pages
ML PG Assignment 3
No ratings yet
ML PG Assignment 3
3 pages
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
No ratings yet
Name: Mussab Bin Shahid Sap-Id: 2024 Assignment: Machine-Learning
5 pages
HW02 - KNN DT
No ratings yet
HW02 - KNN DT
3 pages
Solution 1
No ratings yet
Solution 1
6 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
MT2023 Sol
No ratings yet
MT2023 Sol
8 pages
Patter Recognition (Spring 2010) Midterm Exam: and ω are distributed according to
No ratings yet
Patter Recognition (Spring 2010) Midterm Exam: and ω are distributed according to
4 pages
V
No ratings yet
V
8 pages
cp4252 Machine Learning Lab Manual
No ratings yet
cp4252 Machine Learning Lab Manual
21 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
178 hw1
No ratings yet
178 hw1
4 pages
Linear - Classification
No ratings yet
Linear - Classification
72 pages
ML Lab
No ratings yet
ML Lab
23 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
HW02 Sol - KNN DT
No ratings yet
HW02 Sol - KNN DT
8 pages
CS-3035 (ML) - CS Mid March 2023
No ratings yet
CS-3035 (ML) - CS Mid March 2023
3 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
Machine Learning Final Manual
No ratings yet
Machine Learning Final Manual
45 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
Assingment On Database
No ratings yet
Assingment On Database
16 pages
Midterm - APS1070 - 2019 - 09 Fall
No ratings yet
Midterm - APS1070 - 2019 - 09 Fall
2 pages
Epoch 4 Operations Manual
100% (1)
Epoch 4 Operations Manual
164 pages
10 EST Solution
No ratings yet
10 EST Solution
16 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
K-Nearest Neighbor: General Gist
No ratings yet
K-Nearest Neighbor: General Gist
14 pages
Lab 8
No ratings yet
Lab 8
7 pages
Machine Learning 20CSE09
No ratings yet
Machine Learning 20CSE09
3 pages
ML Lab Programs (1-13)
No ratings yet
ML Lab Programs (1-13)
44 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
74 pages
Case Study - Classifier
No ratings yet
Case Study - Classifier
5 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 3
30 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
11 pages
DM 2023
No ratings yet
DM 2023
8 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
ML Lab Manual (1-10) FINAL
No ratings yet
ML Lab Manual (1-10) FINAL
34 pages
Interview Questions (TD)
No ratings yet
Interview Questions (TD)
9 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
Pega Interview Questions
100% (1)
Pega Interview Questions
3 pages
Performance of A 30 M Deep Instrumented Diaphragm Wall, 1984, M. M. Soares
No ratings yet
Performance of A 30 M Deep Instrumented Diaphragm Wall, 1984, M. M. Soares
6 pages
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
No ratings yet
Previous Exam Exercises On Classification: Exercise 4 2012: Classification With 2 Features
9 pages
(Hotel Name) Feedback Form: Customer Name: Address: Email/Phone Account
No ratings yet
(Hotel Name) Feedback Form: Customer Name: Address: Email/Phone Account
2 pages
CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3
No ratings yet
CSE 474/574 Introduction To Machine Learning Fall 2011 Assignment 3
3 pages
Texto Ingles Informatica
100% (1)
Texto Ingles Informatica
2 pages
Maria Montessori
No ratings yet
Maria Montessori
6 pages
Drag On A Circular Cylinder: Instructed by
100% (1)
Drag On A Circular Cylinder: Instructed by
12 pages
2014 CMOST Presentation
100% (1)
2014 CMOST Presentation
86 pages
Fourth Quarter Week 7 (Day 1 - 2) : For Teachers
No ratings yet
Fourth Quarter Week 7 (Day 1 - 2) : For Teachers
17 pages
The History of Using Solar Energy
No ratings yet
The History of Using Solar Energy
8 pages
Statistics Assignment Sample Solutions
No ratings yet
Statistics Assignment Sample Solutions
9 pages
Village Development Synopsis
No ratings yet
Village Development Synopsis
18 pages
Power Point Work Sheet
No ratings yet
Power Point Work Sheet
3 pages
Passenger Elevators (High-Speed Custom-Type)
No ratings yet
Passenger Elevators (High-Speed Custom-Type)
19 pages
Create A Plan of Action (HARDCOPY)
No ratings yet
Create A Plan of Action (HARDCOPY)
5 pages
Damiano Rossello: DEB University of Catania
No ratings yet
Damiano Rossello: DEB University of Catania
78 pages
FIO IoXtreme User Guide Linux
No ratings yet
FIO IoXtreme User Guide Linux
77 pages
Experimental Research (Scientific Inquiry) : Mcgraw-Hill
No ratings yet
Experimental Research (Scientific Inquiry) : Mcgraw-Hill
38 pages
CM2 Hfa100 2001 - 02
No ratings yet
CM2 Hfa100 2001 - 02
20 pages
Pranav Sir - LR Direction Test Marathon Notes
No ratings yet
Pranav Sir - LR Direction Test Marathon Notes
3 pages
NCERT Solution For Cbse Class 9 Science Chapter 7 Diversity in Living Organisms
No ratings yet
NCERT Solution For Cbse Class 9 Science Chapter 7 Diversity in Living Organisms
7 pages
LP For FO
No ratings yet
LP For FO
21 pages
Applies To:: OM DROP: Drop Ship Setup (Doc ID 113636.1)
No ratings yet
Applies To:: OM DROP: Drop Ship Setup (Doc ID 113636.1)
2 pages
The Veldt Discussion Questions
No ratings yet
The Veldt Discussion Questions
2 pages
LITERATURE-IN-MIDWIFERY-FINAL-EXAM Jhoanna Jimlan Opiña Jan 19 2024
No ratings yet
LITERATURE-IN-MIDWIFERY-FINAL-EXAM Jhoanna Jimlan Opiña Jan 19 2024
7 pages
Gas Dynamics Outline Fall 2014
No ratings yet
Gas Dynamics Outline Fall 2014
3 pages
CS614-Assignment 1 Solution Spring 2024
No ratings yet
CS614-Assignment 1 Solution Spring 2024
4 pages
Excerpt From "Awkward" by Ty Tashiro
No ratings yet
Excerpt From "Awkward" by Ty Tashiro
1 page
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
From Everand
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
Manish Soni
No ratings yet
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

AIDI 1002 FinalExam Section 01

Uploaded by

AIDI 1002 FinalExam Section 01

Uploaded by

Last Name: First Name: Student ID:

AIDI 1002: Machine Learning Programming — Final Exam Fall 2023

1. Divide the data into 80% training and 20% testing.

X = [5, 1, 9, 6, 5, 6, 1, 9, 10, 11, 8, 7, 13, 8, 19]

Y = [14, 16, 17, 10, 9, 17, 15, 3, 3, 1, 4, 5, 1, 3, 15]

Number of misclassified in c1 = Number of misclassified in c2 =

Note that a discriminant function based on cosine distance can be written as

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.