0% found this document useful (0 votes)

6 views9 pages

2023-24 ML End-Semester Make-Up QP Anwer-Keys

The document outlines the exam pattern and topics for a Machine Learning course, detailing the weightage and marks distribution for various learning methods and evaluation techniques. It includes specific questions related to text classification, support vector machines, risk factor prediction using the Glasgow Coma Scale, k-modes clustering, AdaBoost, and ethical considerations in model interpretability. Additionally, it addresses logistic regression for spam detection and discusses the limitations of using Entropy as a heuristic in decision tree construction.

Uploaded by

2023ac05846

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views9 pages

2023-24 ML End-Semester Make-Up QP Anwer-Keys

Uploaded by

2023ac05846

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

QP Pattern:

 Instance based learning: 4-8 marks

 SVM: 4 - 8 marks
 Bayesian Learning: 4 - 8 marks
 Ensemble Learning: 4 - 8 marks
 Unsupervised Learning: 4 - 8 marks
 Model Evaluation and Comparison: 4 - 8 marks
 Pre-midterm topics: 5-10 marks

--------------------------------------------------------------------------------------------------------------------------
Course No. : DSECLZG565/ AIMLCLZ565
Course Title : Machine Learning
Nature of Exam : Open Book
Weightage : 40%
Duration : 2 Hours
Date of Exam :
Note:
1. Please follow all the Instructions to Candidates given on the cover page of the answer book.
2. All parts of a question should be answered consecutively. Each answer should start from a fresh
page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Question 1:
Consider the following dataset for text classification where three training instances are given with
corresponding classifications into the ‘+’ or –‘- category: [5]

Hindi India India +

India Kannada Hindi +
Chinese Hindi India -

Showing all intermediate calculations, find the appropriate classification for the test instance: “Chinese
Kannada Chinese” using the Multinomial NB text classification approach.
Question 2:
You have been tasked to create a discriminative model using a linear Support Vector Machine
method. Consider the below training dataset for training the model, where X1, and X2 are
independent features and Y is the target variable. [4]

X1 X2 Y
3 2 Positive
5 3 Positive
-2 -2 Negative
4 4 Positive
3 -1 Positive
1 0 Negative
-1 -1 Negative
0 2 Negative
1 4 Negative
-1 2 Negative
4 5 Positive

Answer the following questions:

A. Find the support vectors [1]
B. Determine the equation of hyperplane [3]
Solution:
A. Support vectors are (3,2), (1,0) and (3,-1) [1M- if one of the SVs is wrong then 0 M]
B.

The solution obtained using the Lagrange method is equally acceptable.

Question 3:
Use case: The Glasgow Coma Scale assesses patients according to three aspects of responsiveness: eye-
opening, motor, and verbal responses. Reporting each of these separately provides a clear,
communicable picture of a patient's state.
[3 + 5 = 8 Marks]

Note: Wherever applicable use only Manhattan distance & No scaling is required. Round all the
calculation to 4 decimal places if any. Use average as the aggregation function for the final estimation
wherever required unless specific function is recommended. Show all steps. Calculation error will also
be penalized.
Predict the risk factor of new patient with below observation using both the following independent
experiments for the above training data.
Query Instance: <Eye Opening = 5, Verbal Responses = 5, Motor Responses = 5>
i) Predict the risk factor using 3-NN model.
ii) If the initial estimation is proposed as locally weighted regression model instead, use: Patient
Risk = 10 – 0.1 XVERBALRESPONSES – 0.1 XMOTORRESPONSES, and apply 2-NN with kernel:
K(d(xq,xi)) = (-1)/ d(xq,xi) and apply the gradient descent only for one iteration with learning
rate = 0.1. Apply gradient descent only for one iteration and predict the risk factor for the query
instance.
-------------------------------
Answer Key:
a) Prediction with average of below 3-NN = 5
Manhattan
Distance
7
5
4
8
10
b) Only for the top 2-NN Below are calculated:
Y-Pred = {9.5, 9.4}
Delta gradient for Wverbalresponse = {3, 0.35}
Delta gradient for Wmotorresponse = {4.5, 1.75}
Delta gradient for Wo = {1.5, 0.35}
New (Wverbalresponse, Wmotorresponse, Wo) = (0.435, 0.725, 10.185)
Prediction with new regression equation = 4.385
Marking Scheme:
a)
2 mark: Distance calculation between test and all other instances. Order the results of the distance.
1 mark: Results of 3-NN average
b)
1 mark : Y-Pred calculation
0.5 mark : Delta gradient for Wverbalresponse calculation
0.5 mark : Delta gradient for Wmotorresponse calculation
1 mark : Delta gradient for Wo calculation
1 mark : New (Wverbalresponse, Wmotorresponse, Wo)
1 mark : Prediction with new regression equation
----------------------------------------------------------------------
Question 4:
Use case: The Glasgow Coma Scale assesses patients according to three aspects of responsiveness: eye-
opening, motor, and verbal responses. Reporting each of these separately provides a clear,
communicable picture of a patient's state. Quantified values of attributes are discretized in below data.
[4 + 1 + 2 = 7 Marks]

a) Use following distance measure to cluster the given patients into three clusters using k-modes
clustering algorithm for only one iteration. Show the step by step working of the
Expectation and Maximization step. The centroids are marked in the given table. Assume all
the features are categorical in nature and use only the following distance metric for your
calculation. Round-off all the proximity values to two decimal places.
𝑑𝑖𝑠𝑡𝑎𝑛𝑐𝑒 (𝑑𝑎𝑡𝑎1 , 𝑑𝑎𝑡𝑎2)
𝑁𝑢𝑚𝑏𝑒𝑟. 𝑜𝑓. 𝑚𝑎𝑡𝑐ℎ𝑖𝑛𝑔. 𝑐𝑎𝑡𝑒𝑔𝑜𝑟𝑖𝑐𝑎𝑙. 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑠
(1 − )
= 10 ∗ 𝑇𝑜𝑡𝑎𝑙. 𝑛𝑢𝑚𝑏𝑒𝑟. 𝑜𝑓. 𝑐𝑎𝑡𝑒𝑔𝑜𝑟𝑖𝑐𝑎𝑙. 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑠

𝑀𝑜𝑑𝑒 𝑜𝑓 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒 𝑣𝑎𝑙𝑢𝑒 𝑖𝑓 𝑐𝑙𝑢𝑠𝑡𝑒𝑟 𝑠𝑖𝑧𝑒 𝑖𝑠 𝑜𝑑𝑑

𝑀𝑒𝑑𝑖𝑎𝑛 𝑜𝑓 𝑐𝑎𝑡𝑒𝑔𝑜𝑟𝑖𝑐𝑎𝑙 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑎 𝑐𝑙𝑢𝑠𝑡𝑒𝑟 = { }
𝑡 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
𝑤ℎ𝑒𝑟𝑒 𝑡 = 𝐿𝑒𝑎𝑠𝑡 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑡 𝑎𝑡𝑡𝑟𝑏𝑢𝑡𝑒 𝑣𝑎𝑙𝑢𝑒 𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑 𝑖𝑛 𝑡ℎ𝑒 𝑒𝑛𝑡𝑖𝑟𝑒 𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑑𝑎𝑡𝑎

b) Calculate the new centroids

c) State if the below given statement is true or false w.r.t to given data whose centroids are sampled
with replacements. Justify your statement with plagiarism free explanations.
“If the number of clusters expected is equivalent to the number of data points/instances given
for training, then the algorithm is guaranteed to converge in atmost one iteration of Expectation
& Maximization”
-------------------------------
Answer Key:
a) Only the three non-centroid points are required to be compared against the centroids.
Highlighted are the members of clusters after Expectation step

Distance Distance Distance

Eye Verbal Motor with with with
Opening Responses Responses Centroid- Centroid- Centroid-
1 2 3
Centroid-
Bad Clear Weak
1-->
Bad Unclear Weak 3.33 6.67 10
Centroid-
Good Others Weak
2-->
Worst Unclear Strong 10 10 6.67
Bad Unclear Weak 3.33 6.67 10
Centroid-
Good Clear Strong
3-->

Maximization:
New Centroid 1 = (Bad, Unclear, Weak)
Centroid 2 : No Change
New Centroid 3 = (Worst, Others, Strong)
b) FALSE. Note due to sampling method chosen, same data might be chosen to be arbitrary
centroids of multiple clusters. Hence convergence may needs more than one EM iteration in
worst case.
Marking Scheme:
a)
1 mark: Distance Calculation w.r.t C1
1 mark: Distance Calculation w.r.t C2
0.5 mark: Distance Calculation w.r.t C3
0.5 mark: New Centroid using mode of member’s value for centroid 1
1 mark: New Centroid using given formula for centroid 3
b)
1 mark: No new calculation are required, All the values used in part a) has to correctly referred
to find the difference in value
c)
1 mark: Answer “False”
1 mark: Right Justification
----------------------------------------------------------------------

Question 5:
In a single iteration of AdaBoost on three sample points, we initiate the process with uniform weights
assigned to the sample points. The ground truth labels and predictions are binary, taking values of either
+1 or −1. The table provided below contains some missing values. [3]

Answer the following:

a) Find the updated weight (before normalization) of X1 instance. Note, no need to normalize the
values. [1.5]
b) Identify which instances/data points were misclassified in the first iteration. Justify your
answer. [1.5]

Part a) [1.5 marks for the correct updated value, otherwise 0]

Part b) 0.5 marks for the correct answer, 1 mark for justification]
a) Random Forest is a bagging model that incorporates feature randomness. Clarify the rationale
behind introducing feature randomness in Random Forest. [2]
2 marks for justification
Decision trees in a Random Forest may be highly correlated, especially when there
are a few dominating features that provide most of the information for splitting.
Hence, feature randomness is also introduced in Random forest

Question 6:

Illustrate a situation in which a lack of interpretability in a model could result in ethical issues.
Explain how interpretability could mitigate these concerns.

Solution:
A scenario where lack of model interpretability could raise ethical concerns is in automated
hiring systems. If a machine learning model is used to screen job applicants and is not
interpretable, it might unintentionally favor or discriminate against candidates based on
sensitive attributes like gender, ethnicity, or age without providing any rationale.

Mitigation through Interpretability:

Bias Detection: Interpretability allows stakeholders to examine the model’s decision-making

process, helping identify if and how certain features (e.g., gender or ethnicity) are
disproportionately affecting the model's decisions.
Accountability: It ensures that decisions made by ML models are transparent and accountable.
If a model unfairly discriminates against a group of applicants, interpretability helps trace back
the decision to its underlying cause, enabling corrective measures.
Regulatory Compliance: By making models interpretable, organizations can ensure
compliance with anti-discrimination laws and regulations, thus avoiding legal and reputational
damage.

a) You have trained a ML model and discovered that it yields unacceptably high error on the test
data. You also plotted a learning curve for both test data and training data as shown below.
Comment on the performance of this ML model. Additionally, discuss strategies to address such
cases, including the approaches and measures you would take in such scenarios. [2.5]
High Bias [1 mark if explanation is also provided otherwise 0.5 marks]
https://www.dataquest.io/blog/learning-curves-machine-learning/
[1.5 mark to suggest the strategies to avoid high bias]
b)

Question 7:

a) You are fitting a logistic regression model to predict whether an email is spam (class 1) or not
(class 0) based on the length of the email's subject line (Feature). The model's coefficients are:

Coefficient: 0.03
Intercept: -1.2
Calculate the predicted probability of an email being spam for an instance with a subject line
length of 50 characters and classify the instance, assuming 70% is the threshold. Additionally,
justify the assignment of a specific class to the given instance. [5 marks]

To calculate the predicted probability of an email being spam for an instance with a
subject line length of 50 characters, you can use the logistic regression equation:

logit(p)=β0+β1×Feature

Where:

logit(p) is the logarithm of the odds of the positive class (spam) probability,
- β0 is the intercept (bias) coefficient,
- β1 is the coefficient for the feature (subject line length).
In this case:

- β0=−1.2 (Intercept)
- β1=0.03 (Coefficient)
Feature=50 (Subject line length)
Substitute these values into the equation:

logit(p)=−1.2+0.03×50

logit(p)=−1.2+1.5

logit(p)=0.3

Now, to convert the logit back into a probability p, you can use the sigmoid (logistic)
function:

p=1/ (1+e^(−logit(p)) [1 marks for the equation]

Substitute the logit value:

p=1/(1+e^−0.31 [1 marks for calculation]

p=1/1+0.7408181

p≈0.57444
So, the predicted probability of an email with a subject line length of 50 characters being
classified as spam is approximately 0.57444 or 57.44%.

Since the predicted probability is 57.4% which is less than the threshold, i.e. 70%, hence
the predicted class would be “Not Spam” [0.5 marks for the correct classification, 0.5
marks for the justification]

b) What are the shortcomings of using Entropy (Information Gain) as a heuristic

measure while building decision trees?
[ 1.5 marks]
Information gain measure is biased towards attributes with a large number of distinct
values. Eg. Product_ID (unique for every tuple), resulting in large number of partitions
as Infoproduct_ID (D) = 0, Such partitioning is useless.

c) Which of the following charts of Residual Sum of Squares (RSS) and model complexity
represent training phase for a fixed dataset?

Answer c) [0.5 marks]

[1marks for the justification]

23.0 Logistic Regression-6
No ratings yet
23.0 Logistic Regression-6
24 pages
Machine 2021 Jan-Apr Practice
No ratings yet
Machine 2021 Jan-Apr Practice
26 pages
Spam News Detection Report
No ratings yet
Spam News Detection Report
9 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Introduction To Machine Learning IIT KGP Week 2
100% (1)
Introduction To Machine Learning IIT KGP Week 2
14 pages
Phase 1 Project Report
No ratings yet
Phase 1 Project Report
44 pages
Stanford University CS 229, Autumn 2014 Midterm Examination
No ratings yet
Stanford University CS 229, Autumn 2014 Midterm Examination
23 pages
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
No ratings yet
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
12 pages
10f 601 Midterm
No ratings yet
10f 601 Midterm
17 pages
IML21 Term1
No ratings yet
IML21 Term1
7 pages
ML June 2024
No ratings yet
ML June 2024
12 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Lokesh T00691325
No ratings yet
Lokesh T00691325
5 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
MSBD5001 WrittenAssignment2 2024F
No ratings yet
MSBD5001 WrittenAssignment2 2024F
5 pages
Machine Learning PYQ 2022 Ans
No ratings yet
Machine Learning PYQ 2022 Ans
17 pages
Statistical Learning
No ratings yet
Statistical Learning
4 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
IAT-II Question Paper With Solution of 18CS71 Artificial Intelligence & Machine Learning Dec-2021-Dr - Swathi.Y
No ratings yet
IAT-II Question Paper With Solution of 18CS71 Artificial Intelligence & Machine Learning Dec-2021-Dr - Swathi.Y
12 pages
E9 205 - Machine Learning For Signal Processing: Practice For Midterm Exam # 1
No ratings yet
E9 205 - Machine Learning For Signal Processing: Practice For Midterm Exam # 1
8 pages
Logistic Regression
No ratings yet
Logistic Regression
47 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
2022 - Machine Learning
No ratings yet
2022 - Machine Learning
6 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
CS-31002 (ML) - CS End April 2025
No ratings yet
CS-31002 (ML) - CS End April 2025
19 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Report Writing Skills ASSIGNMENT
50% (2)
Report Writing Skills ASSIGNMENT
4 pages
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
No ratings yet
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
3 pages
Final2019 Solutions
No ratings yet
Final2019 Solutions
23 pages
Answers 2024
No ratings yet
Answers 2024
11 pages
IBM322 Last Year ETE
No ratings yet
IBM322 Last Year ETE
5 pages
ML - Compre - Question - Paper - 2022 - 23 - Marking Scheme
No ratings yet
ML - Compre - Question - Paper - 2022 - 23 - Marking Scheme
6 pages
Development and Validation of Credit-Scoring Models
No ratings yet
Development and Validation of Credit-Scoring Models
70 pages
UNIT-III Data Warehouse and Minig Notes MDU
No ratings yet
UNIT-III Data Warehouse and Minig Notes MDU
42 pages
Cs 419 Endsemsols
No ratings yet
Cs 419 Endsemsols
6 pages
Midterm Solutions PDF
No ratings yet
Midterm Solutions PDF
17 pages
Impact of Social Networking Media Usage PDF
No ratings yet
Impact of Social Networking Media Usage PDF
11 pages
MLvsMAP Merged
No ratings yet
MLvsMAP Merged
208 pages
Final 2006
No ratings yet
Final 2006
15 pages
t4 Sol
No ratings yet
t4 Sol
8 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
56 pages
2024 Machine Learning
No ratings yet
2024 Machine Learning
8 pages
12s 701 Final
No ratings yet
12s 701 Final
17 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
CVR College of Engineering: in The Partial Fulfillment of The Requirements For The Award of The Degree of
No ratings yet
CVR College of Engineering: in The Partial Fulfillment of The Requirements For The Award of The Degree of
63 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
Quiz3 2023
No ratings yet
Quiz3 2023
2 pages
TPR (Pass Year 2019)
No ratings yet
TPR (Pass Year 2019)
6 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Cia 4 ML
No ratings yet
Cia 4 ML
60 pages
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
No ratings yet
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
12 pages
Final Exam Epfl 2020 Machine Leaning
No ratings yet
Final Exam Epfl 2020 Machine Leaning
16 pages
Endsem ML Makeup AK - 1
No ratings yet
Endsem ML Makeup AK - 1
7 pages
Endsem ML Regular AK
No ratings yet
Endsem ML Regular AK
7 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
Exam Spring 10
No ratings yet
Exam Spring 10
10 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
HW 3
No ratings yet
HW 3
7 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
Certificate Course of Data Analytics
No ratings yet
Certificate Course of Data Analytics
5 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
Regresi Ordinal
No ratings yet
Regresi Ordinal
16 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
Aiml QB With Ans - 075736
No ratings yet
Aiml QB With Ans - 075736
69 pages
Kalita & Deka (2024)
No ratings yet
Kalita & Deka (2024)
6 pages
Chronic Disease Prediction Using Machine Learning
No ratings yet
Chronic Disease Prediction Using Machine Learning
7 pages
Bookbinders Case 1
100% (1)
Bookbinders Case 1
8 pages
SOC63901 Readings Spring09
No ratings yet
SOC63901 Readings Spring09
29 pages
Chen Dan Volpe
No ratings yet
Chen Dan Volpe
21 pages
MIRAD A Method For Interpretable Ransomware Attack Detection
No ratings yet
MIRAD A Method For Interpretable Ransomware Attack Detection
19 pages
Finance & Risk Analytics: Project Report
No ratings yet
Finance & Risk Analytics: Project Report
16 pages
Kuo 2011 Associations of Family Centered Care
No ratings yet
Kuo 2011 Associations of Family Centered Care
12 pages
SNM - PRESENTATION - GRP 8
No ratings yet
SNM - PRESENTATION - GRP 8
15 pages
Tạp chí Đại học mở chỉ số ACI
No ratings yet
Tạp chí Đại học mở chỉ số ACI
20 pages
Borowsky 2001
No ratings yet
Borowsky 2001
11 pages
The Acute: Chronic Workload Ratio Predicts Injury: High Chronic Workload May Decrease Injury Risk in Elite Rugby..
No ratings yet
The Acute: Chronic Workload Ratio Predicts Injury: High Chronic Workload May Decrease Injury Risk in Elite Rugby..
9 pages
Cognitive Computing Model Brief - Hospital Admissions and ED Visits
No ratings yet
Cognitive Computing Model Brief - Hospital Admissions and ED Visits
9 pages
Detection of Malicious Hyperlinks Using Machine Learning A Proposed System
No ratings yet
Detection of Malicious Hyperlinks Using Machine Learning A Proposed System
4 pages
International Journal of Medical Informatics: Sciencedirect
No ratings yet
International Journal of Medical Informatics: Sciencedirect
8 pages
Marginal Effects in The Censored Regression Model
No ratings yet
Marginal Effects in The Censored Regression Model
7 pages
Churn Models For Prepaid Customers in The Cellular Telecommunication Industry Using Large Data Marts
No ratings yet
Churn Models For Prepaid Customers in The Cellular Telecommunication Industry Using Large Data Marts
3 pages
Modelling Is Successions in E-Commerce
No ratings yet
Modelling Is Successions in E-Commerce
12 pages
Student Solutions Manual for Mathematics for Economics, fourth edition
From Everand
Student Solutions Manual for Mathematics for Economics, fourth edition
Michael Hoy
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Painless Pre-Algebra
From Everand
Painless Pre-Algebra
Barron's Educational Series
3/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

2023-24 ML End-Semester Make-Up QP Anwer-Keys

Uploaded by

2023-24 ML End-Semester Make-Up QP Anwer-Keys

Uploaded by

QP Pattern:

 Instance based learning: 4-8 marks

Hindi India India +

Answer the following questions:

The solution obtained using the Lagrange method is equally acceptable.

𝑀𝑜𝑑𝑒 𝑜𝑓 𝑎𝑡𝑡𝑟𝑖𝑏𝑢𝑡𝑒 𝑣𝑎𝑙𝑢𝑒 𝑖𝑓 𝑐𝑙𝑢𝑠𝑡𝑒𝑟 𝑠𝑖𝑧𝑒 𝑖𝑠 𝑜𝑑𝑑

b) Calculate the new centroids

Distance Distance Distance

Answer the following:

Part a) [1.5 marks for the correct updated value, otherwise 0]

Mitigation through Interpretability:

Bias Detection: Interpretability allows stakeholders to examine the model’s decision-making

p=1/ (1+e^(−logit(p)) [1 marks for the equation]

Substitute the logit value:

p=1/(1+e^−0.31 [1 marks for calculation]

b) What are the shortcomings of using Entropy (Information Gain) as a heuristic

Answer c) [0.5 marks]

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.