0% found this document useful (0 votes)

80 views11 pages

DADS303 - MBA 3 - Machine - Learning

Assignment on Machine Learning.

Uploaded by

Varun Asthana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views11 pages

DADS303 - MBA 3 - Machine - Learning

Assignment on Machine Learning.

Uploaded by

Varun Asthana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Varun Asthana

Roll No. 2114501153

Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
ASSIGNMENT

SESSION AUG/SEP 2022

PROGRAM MASTER OF BUSINESS ADMINISTRATION (MBA)
SEMESTER III
COURSE CODE & NAME DADS303 INTRODUCTION TO MACHINE LEARNING
CREDITS 4
NUMBER OF ASSIGNMENTS & 02
MARKS 30 Marks each

Note: Answer all questions. Kindly note that answers for 10 marks questions should be approximate 400 - 450
words. Each question is followed by an evaluation scheme.

Q.No Assignment Set – 1 Marks Total Marks

Questions
1. Discuss the relevance of Machine Learning in Business with suitable example. 10 10
2. What do you mean by Regularization? Briefly discuss various methods to do 2+8 10
Regularization in Regression.
3. Briefly discuss Binary Logistic Regression. 10 10

Q.No Assignment Set – 2 Marks Total Marks

Questions
4. Explain K-Means Clustering algorithm 10 10
5. Briefly explain ‘Splitting Criteria’, ‘Merging Criteria’ and ‘Stopping criteria’ in 10 10
Decision Tree.
6. What is Support Vector Machine? What are the various steps in using Support 2+8 10
Vector Machine?
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
Q1) Discuss the relevance of Machine Learning in Business with suitable example.

A1) Relevance of Machine Learning in Business:

ML helps in serving as a solution by extracting meaningful information from a huge set of raw data. If
implemented in a correct manner it can solve many business complexities and predict complex
behavioural patterns from the customer or user data.

1. Customer Lifetime Value Prediction – Lifetime Value or LTV is the parameter that tells how
loyal the customer to a platform or a website or a business. This LTV of a customer can be
predicted using ML from purchase pattern, browsing history and behavioural patterns.
2. Predictive Maintenance – manufacturing companies often undergo preventive maintenance
that is expensive and time consuming. With the use of ML, factory data can be used to built
historical data, workflow visualization tool, flexible analysis and feedback loop. Due to this
many hidden patterns and insights can be found using ML.
3. Eliminates Manual Data Entry – predictive data modelling and ML can be helpful in
eliminating the errors caused by manual data entry. As the data discovered is of good
quality, it can be analysed and value addition can be done in the business.
4. Detecting Spam – ML usage can help in detecting spams and potential threats to a platform
or website. ML techniques including neural network detect spam and phishing messages.
5. Product Recommendation – based on the data accumulated for a product over a period of
time, analysis is done to draw insights for the product improvement. Website optimization,
product optimization and product can be made more user friendly.
6. Financial Analysis – using ML predictive modelling, large volumes of historical data can be
analysed and portfolio management, algorithmic trading and fraud detection can be done.
7. Image Recognition – Image recognition is done by various companies using data mining and
ML where pattern recognition and db knowledge history is done. It is used by different
domains like automobiles, healthcare etc.
8. Medical Diagnosis – patient’s health improvement and healthcare cost reduction is done by
using ML’s superior diagnostic tools and effective treatment plans.
9. Improving Cyber Security – in increasing the data security in an organisation ML can be used
and it can solve many problems around the same. ML allows to build new technologies
which quickly and effectively detect unknown threats.
10. Increasing Customer Satisfaction – ML can help improving the customer satisfaction. This is
achieved by analysing customer’s feedback and for problem solving common problem areas
are identified and resolved at the product level. After problem identification the customers
are also assigned to the suitable executive for solutioning.

Q2) What do you mean by Regularization? Briefly discuss various methods to do Regularization in
Regression.

A2) Before understanding the term regularization, let us understand ‘overfitting’ and ‘underfitting’
of the model. To train our model, we give it some data to learn, followed by plotting the data points
and drawing the best fit line to understand the relationship between multiple variables. This is called
data fitting. We can call our data to be best fit if all necessary patterns are made and no irrelevant
and random data point / patterns are present. This undesired datapoints are called as Noise.

So, when the model is trained with the noise, it is called as Overfitting. But a scenario where ML
model can neither learn the relationship between variables in testing data nor predict or classify a
new data point is called Underfitting.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
So, Regularization in Machine Learning is a technique that is used for calibration of ML models so
that adjusted loss function can be minimized and overfitting / underfitting can be avoided.

There are two regularization technique:

1. Ridge Regularization
 It fixes the overfitting or underfitting by adding a penalty equivalent to the sum of
the squares of magnitude of coefficients.
 It performs regularization by shrinking the coefficients.
 Let us consider cost function Cost function = Loss + λ x∑‖w‖^2 where Loss = sum of
squared residuals ; λ = penalty of errors ; w = slope of the curve / line.
 Higher the value of lambda, more is the shrinking of coefficients present. Therefore,
it reduced the multicollinearity and reduces the complexity by coefficient shrinkage.
2. Lasso Regularization
 It fixes the overfitting or underfitting by adding penalty equivalent to the sum of the
absolute values of coefficients.
 It takes the actual values of the coefficients present and performs the coefficient
minimization by not of much magnitude.
 This means that sum of the coefficients can also be 0 due to the presence of
negative coefficients.

Q3) Briefly discuss Binary Logistic Regression.

A3) Binary logistic regression is a statistical method used to model the relationship between a binary
dependent variable (also called the response or outcome variable) and one or more independent
variables (also called predictors or explanatory variables). The dependent variable takes on only two
values, typically coded as 0 or 1, representing the absence or presence of an event, respectively.

The goal of binary logistic regression is to estimate the probability of the dependent variable being 1,
given the values of the independent variables. The logistic regression model uses a logistic function
(also known as a sigmoid function) to transform a linear combination of the independent variables
into a probability value between 0 and 1.

The logistic function is defined as follows:

p = 1 / (1 + exp(-z))

where p is the probability of the dependent variable being 1, exp is the exponential function, and z is
a linear combination of the independent variables, as follows:

z = β0 + β1X1 + β2X2 + ... + βkXk

Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
where β0 is the intercept, β1 to βk are the coefficients for the independent variables X1 to Xk,
respectively.

To estimate the coefficients in the model, a method called maximum likelihood estimation is used.
The method involves finding the values of the coefficients that maximize the likelihood of observing
the data given the model.

Once the coefficients are estimated, the model can be used to predict the probability of the
dependent variable being 1, given a set of values for the independent variables. A threshold value
can then be chosen to classify the observations into two groups, typically 0 or 1.

Binary logistic regression is commonly used in various fields, such as finance, marketing, medicine,
and social sciences, to analyze and predict binary outcomes.

In the below example, we fit a binary logistic regression model using the "glm" function in R,
specifying the formula vs ~ mpg + hp + wt, where vs is the binary dependent variable and mpg, hp,
and wt are the independent variables. We use the "binomial" family to specify that we are fitting a
binary logistic regression model.

We then display the summary of the model, which shows the estimated coefficients for the
independent variables, their standard errors, the z-values, and the p-values.

Finally, we make predictions on new data by creating a new data frame with values for mpg, hp, and
wt, and using the predict function to obtain the predicted probabilities of vs being 1 for each
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
observation in the new data frame. The type = "response" argument specifies that we want the
predicted probabilities instead of the linear predictor values.

Q4) Explain K-Means Clustering algorithm

A4) K-Means Clustering is an unsupervised learning algorithm that is used to solve the clustering
problems in machine learning or data scienc and groups the unlabeled dataset into various clusters.
K refers to the number of pre-defined clusters that need to be created in the process. So for example
if K=2, there will be two clusters, and for K=3, there will be three clusters, and so on.

It allows us to arrange the datapoints in the form of cluster into different groups and thus makes it
convenient to discover the categories of groups in the unlabeled dataset on its own without the
need for any training. It is a centroid-based algorithm, where each cluster is associated with a
centroid and its main aim is to minimize the sum of distances between the data point and their
corresponding clusters.

In this algorithm we have to input the unlabeled dataset to divides the dataset into k-number of
clusters, and repeats the process until it does not find the best clusters. It should be noted that the
value of k should be predetermined in this algorithm.

The k-means clustering algorithm mainly performs two tasks:

 Determines the best value for K center points or centroids by an iterative process.
 Assigns each data point to its closest k-center. Those data points which are near to
the particular k-center, create a cluster.

Hence each cluster has datapoints with some commonalities so that they can be grouped in a
cluster.

Suppose M1 and M2 are two variables and scatter plot is as shown below:
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education

Let's take number k of clusters, i.e., K=2, to identify the dataset and to put them into different
clusters. It means here we will try to group these datasets into two different clusters.

We need to choose some random k points or centroid to form the cluster. These points can be either
the points from the dataset or any other point. So, here we are selecting the below two points as k
points, which are not the part of our dataset. Consider the below image:

Now we will assign each data point of the scatter plot to its closest K-point or centroid. We will
compute it by applying some mathematics that we have studied to calculate the distance between
two points. So, we will draw a median between both the centroids. Consider the below image:

From the above image, it is clear that points left side of the line is near to the K1 or blue centroid,
and points to the right of the line are close to the yellow centroid. Let's color them as blue and
yellow for clear visualization.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education

As we need to find the closest cluster, so we will repeat the process by choosing a new centroid. To
choose the new centroids, we will compute the center of gravity of these centroids, and will find
new centroids as below:

Next, we will reassign each datapoint to the new centroid. For this, we will repeat the same process
of finding a median line. The median will be like below image:

From the above image, we can see, one yellow point is on the left side of the line, and two blue
points are right to the line. So, these three points will be assigned to new centroids.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
As reassignment has taken place, so we will again go to the step-4, which is finding new centroids or
K-points.

We will repeat the process by finding the center of gravity of centroids, so the new centroids will be
as shown in the below image:

As we got the new centroids so again will draw the median line and reassign the data points. So, the
image will be:

We can see in the above image; there are no dissimilar data points on either side of the line, which
means our model is formed. Consider the below image:

As our model is ready, so we can now remove the assumed centroids, and the two final clusters will
be as shown in the below image:
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education

Q5) Briefly explain ‘Splitting Criteria’, ‘Merging Criteria’ and ‘Stopping criteria’ in Decision Tree.

SPLITTING CRITERIA

The objective of splitting criteria is to find the optimal way to partition the data into homogeneous
subsets in terms of the target variable.

Some of the most common ones include:

 Information gain: Information gain measures the reduction in entropy (i.e., degree of
disorder) achieved by splitting a node based on a particular variable.
 Gain ratio: Gain ratio is similar to information gain, but it takes into account the
intrinsic information of a variable, which is the degree to which a variable is capable
of making finer partitions in the data. The gain ratio penalizes variables that have too
many categories or levels.
 Chi-square: it is used to determine whether a split based on a particular variable is
statistically significant. The optimal split is the one with the highest chi-square value.
 Reduction in variance: This criterion is used in regression trees and measures the
reduction in variance achieved by splitting a node based on a particular variable. The
optimal split is the one that minimizes the weighted sum of the variance of the child
nodes.

MERGING CRITERIA

Also known as pruning criteria, are used in decision tree algorithms to determine when to stop
growing the tree by merging or pruning some of the nodes. The objective of merging criteria is to
prevent overfitting, which occurs when the tree is too complex and captures noise or random
variation in the data, rather than the underlying patterns or relationships.

 Minimum sample size: This criterion specifies a minimum number of observations

required to split a node or to create a leaf. It helps to avoid overfitting by preventing
the tree from being too complex and capturing noise or random variation in small
subsets of the data.
 Maximum depth: It helps to avoid overfitting by limiting the complexity of the tree
and preventing it from capturing noise or random variation in the data.
 Minimum improvement in purity: It helps to avoid overfitting by preventing the tree
from splitting on variables that do not improve the purity or impurity of the node
significantly.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
 Complexity parameter: it involves optimizing a complexity parameter that balances
the goodness of fit of the tree and its complexity.

STOPPING CRITERIA

Stopping criteria in decision trees refer to the rules that determine when to stop splitting a node and
make it a leaf node. The decision tree algorithm continues to split nodes until a certain stopping
criterion is met. Some commonly used stopping criteria in decision trees include:

 Maximum tree depth: This criterion specifies the maximum depth of the tree. Once
the tree reaches the maximum depth, the algorithm stops splitting and creates a leaf
node.
 Minimum number of samples: This criterion specifies the minimum number of
samples required to split a node. If the number of samples at a node is less than the
specified minimum, the node becomes a leaf node.
 Maximum number of leaf nodes: This criterion specifies the maximum number of
leaf nodes allowed in the tree. Once the maximum number is reached, the algorithm
stops splitting and creates leaf nodes.
 Minimum impurity decrease: This criterion specifies the minimum amount of
reduction in impurity that must be achieved by splitting a node. If the impurity
decrease is less than the specified minimum, the node becomes a leaf node.

Q6) What is Support Vector Machine? What are the various steps in using Support Vector
Machine?

A6) Support Vector Machine (SVM) is a fast and dependable classification algorithm that performs
very well with the limited amount of data to analyse. It is a supervised machine learning model that
uses classification algorithm for two-group classification problems. In the scatter plot below you can
see that support vectors are the data points that lie near the decision boundary.

You can also see that blue line separates the two categories. This line is called Hyperplane. The
objective of the SVM algorithm is to find a hyperplane in an N-dimensional space that distinctly
classifies the data points.
Varun Asthana
Roll No. 2114501153
Program – Online MBA
Course – Introduction to Machine Learning
Directorate of Online Education
The dimension of the hyperplane depends upon the number of features. If the number of input
features is two, then the hyperplane is just a line. If the number of input features is three, then the
hyperplane becomes a 2-D plane. It becomes difficult to imagine when the number of features
exceeds three. The distance of support vectors between the separating hyperplane is called margin.
The hyperplane is best when the margin is maximum. Now that we have understood what is SVM,
let us now understand how is it used in R:

 Loading the iris dataset

 We use 70% of the data for training and the remaining 30% for testing.

 We then fit a support vector machine model using the svm function from the
"e1071" package. We specify the formula Species ~ . to indicate that we want to
predict the "Species" variable based on all the other variables in the dataset.

 We display the model summary using the summary function, which shows the
number of support vectors, the kernel function used, and the model accuracy.

 We then make predictions on the testing set using the predict function, and create a
confusion matrix using the table function to see how well the model performed.
Finally, we calculate the model accuracy by dividing the number of correct
predictions by the total number of observations in the testing set.

Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
Mastering The Basics of Machine Learning
No ratings yet
Mastering The Basics of Machine Learning
65 pages
Unit3aiml 230421054431 97b34666
No ratings yet
Unit3aiml 230421054431 97b34666
62 pages
Lab Manual 6 23112021 114139am
No ratings yet
Lab Manual 6 23112021 114139am
6 pages
Machine Learning Interview Question
No ratings yet
Machine Learning Interview Question
72 pages
Car Evaluation
No ratings yet
Car Evaluation
33 pages
Scheduling Contractors' Farm-to-Farm Crop Harvesting Operations
No ratings yet
Scheduling Contractors' Farm-to-Farm Crop Harvesting Operations
22 pages
Unit 3 BBA 1ST BM
No ratings yet
Unit 3 BBA 1ST BM
6 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
Unit Iii Cv&ip
No ratings yet
Unit Iii Cv&ip
29 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Assignment - DADS303 - MBA 3 - Set 1 and 2
No ratings yet
Assignment - DADS303 - MBA 3 - Set 1 and 2
9 pages
d3 It ML Jan 2023 Part 2
No ratings yet
d3 It ML Jan 2023 Part 2
32 pages
Unit 2 Machine Learning
No ratings yet
Unit 2 Machine Learning
32 pages
Unit 2
No ratings yet
Unit 2
57 pages
Final Exam - 2 PDF
No ratings yet
Final Exam - 2 PDF
5 pages
Optimization Operational Research
No ratings yet
Optimization Operational Research
19 pages
Research Poster
No ratings yet
Research Poster
1 page
Mechine Learning
No ratings yet
Mechine Learning
106 pages
Parta Roth Herwitz Stability Criterion
No ratings yet
Parta Roth Herwitz Stability Criterion
28 pages
DSP-Chapter4 Student 11012016
No ratings yet
DSP-Chapter4 Student 11012016
32 pages
Digital Representation Sept 4-5-11 12
No ratings yet
Digital Representation Sept 4-5-11 12
33 pages
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
Unit 2
No ratings yet
Unit 2
133 pages
Inverse Filtering of Room Acoustics
No ratings yet
Inverse Filtering of Room Acoustics
8 pages
Unit - Iii Supervisied Learning - Notes
No ratings yet
Unit - Iii Supervisied Learning - Notes
42 pages
Self-Directed Online Machine Learning For Topology
No ratings yet
Self-Directed Online Machine Learning For Topology
19 pages
Scan Conversion
No ratings yet
Scan Conversion
11 pages
Regression
No ratings yet
Regression
45 pages
Unit 1 - Machine Learning
No ratings yet
Unit 1 - Machine Learning
17 pages
Crytpographic Hash Functions
No ratings yet
Crytpographic Hash Functions
32 pages
Types of Machine Learning
No ratings yet
Types of Machine Learning
63 pages
Artificial Intelligence Lec 4
No ratings yet
Artificial Intelligence Lec 4
13 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Cs3491 - Aiml Lab Record
No ratings yet
Cs3491 - Aiml Lab Record
26 pages
G A II: Boruvka's Algorithm Demo
No ratings yet
G A II: Boruvka's Algorithm Demo
8 pages
ML Using Python Unit3 PDF
No ratings yet
ML Using Python Unit3 PDF
8 pages
Unit 1.2 Perceptron 2024
No ratings yet
Unit 1.2 Perceptron 2024
107 pages
Machine Learning Notes ?
No ratings yet
Machine Learning Notes ?
64 pages
Unit I
No ratings yet
Unit I
14 pages
Unit 2
No ratings yet
Unit 2
92 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Prediction of Graduate Admission IEEE - 2020
No ratings yet
Prediction of Graduate Admission IEEE - 2020
6 pages
Past Paper Quests. For C++
No ratings yet
Past Paper Quests. For C++
4 pages
Nptel: Optimization - Video Course
No ratings yet
Nptel: Optimization - Video Course
3 pages
Algorithm Mode Encrypt Decrypt 3des: Mode Is Not Supported!
No ratings yet
Algorithm Mode Encrypt Decrypt 3des: Mode Is Not Supported!
1 page
Assignment Solution
No ratings yet
Assignment Solution
6 pages
Regress A o Linear
No ratings yet
Regress A o Linear
8 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
17 pages
ML 2
No ratings yet
ML 2
155 pages
U X U Y: Homework 1
No ratings yet
U X U Y: Homework 1
2 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Cutting Plane Method
No ratings yet
Cutting Plane Method
6 pages
AI Lab Report
No ratings yet
AI Lab Report
9 pages
Lec41 (Constrained Optimization)
No ratings yet
Lec41 (Constrained Optimization)
12 pages
Lab10 KMeans SPSS
No ratings yet
Lab10 KMeans SPSS
5 pages
Ee247 hw4 2010
No ratings yet
Ee247 hw4 2010
4 pages
SemVII MachineLearning
No ratings yet
SemVII MachineLearning
22 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
AI 4 Unit Notes
No ratings yet
AI 4 Unit Notes
47 pages
7 محاضرات
No ratings yet
7 محاضرات
36 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
DSR Notes 3 To 5
No ratings yet
DSR Notes 3 To 5
70 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Unit-Vi 2
No ratings yet
Unit-Vi 2
31 pages
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
No ratings yet
CS601 - Machine Learning - Unit 1 - Notes - 1672759748
13 pages
Supervised and Unsupervised Learning Algorithm-2
No ratings yet
Supervised and Unsupervised Learning Algorithm-2
52 pages
Supervised Learning Notes 1-4
No ratings yet
Supervised Learning Notes 1-4
42 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
ML Points
No ratings yet
ML Points
13 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
ML 22-23 Sem, GPT
No ratings yet
ML 22-23 Sem, GPT
14 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
Unit Iii
No ratings yet
Unit Iii
18 pages
Unit V - Big Data Programming
No ratings yet
Unit V - Big Data Programming
22 pages
A Study On Regression Algorithm in Machine Learning
No ratings yet
A Study On Regression Algorithm in Machine Learning
3 pages
ML Unit Ii
No ratings yet
ML Unit Ii
30 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DADS303 - MBA 3 - Machine - Learning

Uploaded by

DADS303 - MBA 3 - Machine - Learning

Uploaded by

Varun Asthana

Roll No. 2114501153

SESSION AUG/SEP 2022

Q.No Assignment Set – 1 Marks Total Marks

Q.No Assignment Set – 2 Marks Total Marks

A1) Relevance of Machine Learning in Business:

There are two regularization technique:

Q3) Briefly discuss Binary Logistic Regression.

The logistic function is defined as follows:

z = β0 + β1X1 + β2X2 + ... + βkXk

Q4) Explain K-Means Clustering algorithm

The k-means clustering algorithm mainly performs two tasks:

Some of the most common ones include:

 Minimum sample size: This criterion specifies a minimum number of observations

 Loading the iris dataset

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.