0% found this document useful (0 votes)

5 views13 pages

ML Unit 3 Assignment

The document discusses various machine learning concepts, including dimensionality reduction, subset selection, shrinkage methods in linear regression, and Principal Components Regression (PCR). It defines key techniques such as PCA and Lasso/Ridge regression, explaining their goals, methods, and impacts on model performance and interpretability. Additionally, it provides examples and steps for applying these techniques in practical scenarios, emphasizing their utility in handling high-dimensional data and multicollinearity.

Uploaded by

3y4dk4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views13 pages

ML Unit 3 Assignment

Uploaded by

3y4dk4

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Machine Learning

Assignment Unit 3

Name - Yusuf Nathdwarawala

Roll no. - 21/CDOE/BCA/002

Q-1 a) Define dimensionality reduction and subset selection in the

context of machine learning. What are the primary goals of these
techniques, and how do they impact model performance and
computational efficiency?

b) Discuss the main methods used for dimensionality reduction, such

as Principal Component Analysis (PCA) and Feature Selection
techniques. Provide an example of a situation where dimensionality
reduction is beneficial.

Ans. Q-1 a) Dimensionality Reduction and Subset Selection in

Machine Learning
Dimensionality Reduction:
Dimensionality reduction refers to the process of reducing the number of input variables
or features in a dataset while retaining as much information as possible. This is
important when dealing with high-dimensional data, where too many features can lead
to overfitting, increased complexity, and computational inefficiency.
● Goal: Reduce the number of features without sacrificing performance by
capturing the essential patterns in the data.
● Impact:
● Improves model performance by eliminating irrelevant or redundant
features.
● Enhances computational efficiency by reducing the amount of data the
model has to process.
● Mitigates the curse of dimensionality, where having too many features
relative to the number of observations makes the model harder to
generalize.

Subset Selection:
Subset selection is a specific type of feature selection, where a subset of the original
features is chosen to build the model. The goal is to select the most important or
relevant features for the task, often based on some criteria like statistical significance or
feature importance.
● Goal: Improve model interpretability and performance by selecting only the most
informative features.
● Impact:
● Can lead to better model accuracy by removing irrelevant or noisy data.
● Increases computational efficiency by reducing the size of the input data.

Q-1 b) Main Methods for Dimensionality Reduction

1. Principal Component Analysis (PCA):

PCA is a popular method used to reduce dimensionality by transforming the original
features into a set of linearly uncorrelated variables called principal components. These
components are ordered by how much variance they capture in the data, and only the
top components are retained.
● How it Works:
● PCA identifies the directions (principal components) that maximize the
variance in the data.
● It projects the data onto these components, reducing the number of
dimensions while retaining the majority of the information.
● Example: In an image recognition problem with thousands of pixels as features,
PCA can reduce the dimensionality by projecting the images into fewer
dimensions that still capture the main differences between images.
● Impact on Model: PCA improves computational efficiency and reduces overfitting
but may make the model harder to interpret because the original features are
transformed.
2. Feature Selection Techniques:
Feature selection directly selects a subset of the original features based on their
importance or relevance to the target variable.
● Methods:
● Filter Methods: Use statistical techniques to evaluate the importance of
each feature independently of the model (e.g., correlation, mutual
information).
● Wrapper Methods: Evaluate different subsets of features by training and
testing a model on them, such as with forward selection, backward
elimination, or recursive feature elimination (RFE).
● Embedded Methods: Feature selection is performed during the model
training process itself (e.g., LASSO or decision tree-based models).
● Example: In a medical diagnosis problem, some patient features like age,
gender, and medical history may be irrelevant to the disease, and feature
selection helps to focus on the most important ones, such as specific biomarkers.
● Impact on Model: Feature selection improves interpretability, reduces training
time, and prevents overfitting by eliminating irrelevant data.

Example of Dimensionality Reduction Use Case:

In text classification tasks, you might have thousands or even millions of unique words
(features). Many of these words will be irrelevant, redundant, or rarely used.
Dimensionality reduction techniques like PCA or feature selection (e.g., removing
low-frequency words) can help reduce the number of words used in the model while
retaining the meaningful ones, improving both the accuracy and efficiency of the model.

Summary:
● Dimensionality reduction helps simplify high-dimensional data, reducing
overfitting and improving computational efficiency.
● Subset selection chooses the most relevant features, improving model
performance and interpretability.
● PCA and feature selection are key methods, with different approaches depending
on the data and task.
● Dimensionality reduction is especially useful in cases like image processing or
text analysis, where the data has many features but only a subset is informative.

Q-2 a) Explain shrinkage methods in the context of linear regression.

What are the primary types of shrinkage techniques, and how do they
modify the regression coefficients?

b) Compare and contrast Lasso (L1 regularization) and Ridge (L2

regularization) as shrinkage methods. Discuss their advantages,
limitations, and typical use cases in regression problems.

Ans. Q-2 a) Shrinkage Methods in Linear Regression

Shrinkage in Linear Regression:

Shrinkage methods in linear regression are techniques that apply a penalty to the size
of the regression coefficients. This is done to shrink or reduce the magnitude of the
coefficients, which helps prevent overfitting and improves the model’s ability to
generalize to new data.

In standard linear regression, the goal is to minimize the sum of squared errors between
the predicted and actual values. Shrinkage methods modify this by adding a penalty
term that controls the size of the coefficients, helping to keep the model simpler and
less sensitive to fluctuations in the training data.

Primary Types of Shrinkage Techniques:

1. Ridge Regression (L2 Regularization):

● Adds a penalty equal to the sum of the squared values of the coefficients.
● This forces the regression coefficients to become smaller (closer to zero)
but not exactly zero.
2. Lasso Regression (L1 Regularization):
● Adds a penalty equal to the sum of the absolute values of the coefficients.
● This can shrink some coefficients exactly to zero, effectively performing
feature selection.
● The same terms apply as in Ridge regression, but the penalty is based on
the absolute values of the coefficients.
How Shrinkage Modifies Regression Coefficients:

● Shrinkage techniques modify the regression coefficients by adding a penalty term

to the loss function, which discourages large values for the coefficients.
● The higher the regularization parameter λ, the stronger the penalty, and the
smaller the coefficients become.
● Lasso (L1) may shrink some coefficients exactly to zero, making it useful for
feature selection.
● Ridge (L2) shrinks all coefficients towards zero but does not eliminate any of
them entirely.

Q-2 b) Comparison of Lasso (L1) and Ridge (L2)

Regularization

1. Lasso (L1 Regularization):

● Penalty: Adds the sum of the absolute values of the coefficients.

● Effect on Coefficients: Can shrink some coefficients exactly to zero, making it
useful for automatic feature selection.
● Advantages:
● Performs feature selection, which helps reduce the number of variables in
the model.
● Useful when you expect that many of the features are irrelevant.
● Limitations:
● In cases where the number of predictors is larger than the number of
observations, Lasso tends to pick one variable out of a group of highly
correlated variables and ignore the others.
● Typical Use Cases:
● Lasso is ideal when you have many features and expect that some are not
relevant to the target variable.
● It's often used in high-dimensional datasets, such as text classification or
genomic data.

2. Ridge (L2 Regularization):

● Penalty: Adds the sum of the squared values of the coefficients.

● Effect on Coefficients: Shrinks the coefficients towards zero, but none of them will
be exactly zero.
● Advantages:
● Useful when there are many small or moderately large predictors.
● Helps with multicollinearity (when predictor variables are highly
correlated), as it forces them to share the weight.
● Limitations:
● Does not perform feature selection, meaning all variables are retained in
the model.
● Typical Use Cases:
● Ridge is used when you have many predictors and you want to shrink their
effect, especially when predictors are correlated.
● Commonly used in scenarios where you know that all features have some
relevance to the outcome.

Key Differences Between Lasso and Ridge

Feature Lasso (L1) Ridge (L2)

Penalty Sum of absolute values of coefficients Sum of squared values of coefficients

Feature Selection Yes, shrinks some coefficients to zero No, retains all features

Handling Multicollinearity Selects one predictor from a group Shrinks coefficients for all correlated predictors

When to Use When you expect some irrelevant features When all features are important
Effect on Coefficients Some coefficients exactly zero Shrinks all coefficients, none exactly zero

Example of When Dimensionality Reduction Helps:

In high-dimensional datasets, like in genetics or finance, Lasso is useful for
automatically selecting relevant features while reducing the impact of irrelevant ones.
Ridge works well when you want to shrink the effect of all features, especially when they
are correlated, to prevent overfitting.

Summary:
● Shrinkage methods (Lasso and Ridge) prevent overfitting by penalizing the size
of the regression coefficients.
● Lasso (L1) is best for feature selection, while Ridge (L2) is used when you want
to reduce the impact of all features without eliminating any.
● Both methods are useful for improving model generalization and reducing the
effect of irrelevant or redundant features.

Q-3 a) Describe Principal Components Regression (PCR) and its

application in linear classification.

How does PCR utilize Principal Component Analysis (PCA) to address

issues of multicollinearity in regression models?

b) Provide an example where Principal Components Regression is

used. Explain the steps involved in applying PCR, including how to
choose the number of principal components and interpret the results.

Ans. Q-3 a) Principal Components Regression (PCR)

What is Principal Components Regression (PCR)?
Principal Components Regression (PCR) is a regression technique that combines
Principal Component Analysis (PCA) and Linear Regression. It is primarily used to
handle situations where the predictor variables (features) are highly correlated (a
problem called multicollinearity). PCR solves this issue by transforming the original
correlated predictors into a new set of uncorrelated components, called principal
components, and then using these components in a linear regression model.

How PCR Uses PCA:

1. Principal Component Analysis (PCA) transforms the original features into new
variables (principal components), which are linear combinations of the original
variables. These principal components are uncorrelated and capture the
maximum variance in the data.
2. Instead of using the original features in the regression model, PCR fits the model
on the top principal components, which reduces multicollinearity and the model's
complexity.

Addressing Multicollinearity:
Multicollinearity occurs when two or more predictor variables are highly correlated,
leading to instability in estimating regression coefficients. In PCR:
● PCA identifies the directions in the data with the most variance and transforms
the original correlated variables into uncorrelated principal components.
● By using only the first few principal components (the ones that explain the most
variance), PCR reduces the dimensionality of the data and eliminates
multicollinearity, leading to more stable regression estimates.

Q-3 b) Example of Principal Components Regression (PCR)

Let’s consider an example where PCR is used in a wine quality prediction dataset,
where several chemical properties (predictor variables) of the wine are measured to
predict the wine quality (target variable). Many of the chemical properties are highly
correlated, leading to multicollinearity, so PCR is applied to address this.

Steps Involved in Applying PCR:

1. Standardize the Data:

● Before applying PCA, it’s important to standardize or normalize the
dataset so that each feature has a mean of 0 and a standard deviation of
1. This ensures that features with larger scales don’t dominate the PCA.
● Example: from sklearn.preprocessing import StandardScaler scaler =
StandardScaler() X_scaled = scaler.fit_transform(X)tandardScaler
scaler = StandardScaler() X_scaled = scaler.fit_transform(X)

2. Apply PCA to the Predictor Variables:

● PCA is applied to the standardized data to transform the original
correlated variables into principal components.
● The explained variance ratio of the principal components is analyzed to
determine how much variance each component captures.
● Example: from sklearn.decomposition import PCA pca = PCA() X_pca =
pca.fit_transform(X_scaled)

3. Choose the Number of Principal Components:

● The number of principal components to retain is based on how much
variance they explain. Usually, we choose the components that explain
80-90% of the variance to capture most of the data’s information.
● A common approach is to look at a scree plot or cumulative explained
variance plot to decide how many components to keep.
● Example: import matplotlib.pyplot as plt
plt.plot(np.cumsum(pca.explained_variance_ratio_)) plt.xlabel('Number of

Components') plt.ylabel('Variance Explained') plt.show()

4. Fit the Regression Model on the Principal Components:

● Once the top principal components are selected, the regression model is
fitted using these components instead of the original features.
● Example: from sklearn.linear_model import LinearRegression # Choose the top
'n' principal components n_components = 5 X_pca_selected = X_pca[:,
:n_components] model = LinearRegression() model.fit(X_pca_selected, y)

5. Interpret the Results:

● The model is now trained on the principal components rather than the
original features. The regression coefficients represent the contribution of
each principal component to the prediction of the target variable.
● Interpreting coefficients in PCR is more abstract, as they reflect the
influence of a principal component, which is a combination of the original
features. However, you can transform the coefficients back to the original
feature space to understand the impact of each feature.
Choosing the Number of Principal Components:

● The number of principal components to include in the model is typically chosen

by examining the explained variance of each component. Ideally, we want to
retain the minimum number of components that explain most of the variance.
● Rule of Thumb: If the first few components explain about 80-90% of the variance,
they are usually enough for an effective regression model.

Example Walkthrough:
Consider a dataset of wine quality prediction with 12 chemical properties as predictors.
Since many chemical properties are correlated (like sugar content and alcohol level),
multicollinearity can cause instability in a linear regression model.
● After applying PCA, we find that the first 4 principal components explain 85% of
the variance in the data.
● We then use these 4 principal components in the regression model to predict
wine quality.
● By doing so, we reduce multicollinearity, improve model stability, and make the
model more computationally efficient.

Summary:
● Principal Components Regression (PCR) addresses multicollinearity by
transforming correlated variables into uncorrelated principal components and
then applying linear regression.
● PCR combines PCA with regression, which makes it ideal for datasets with many
correlated predictors.
● The number of principal components used is based on how much variance they
explain, usually around 80-90% of the total variance.
● PCR is commonly used in high-dimensional datasets where multicollinearity is a
concern.
Q-4 a) Discuss Logistic Regression and its role in
classification tasks. How does logistic regression model the
probability of a binary outcome, and what is the
interpretation of its coefficients?

b) Provide an example demonstrating how to use logistic

regression for a classification problem. Include the steps for
model fitting, evaluating performance, and interpreting the
results.

Q-4 a) Logistic Regression in Classification Tasks

What is Logistic Regression?

Logistic Regression is a method used to predict binary outcomes, where there are only
two possible results, such as yes/no, pass/fail, or spam/not spam.

Even though it's called "regression," it is used for classification problems, where you
need to assign data into categories.

How Logistic Regression Works:

● Logistic regression calculates the probability that a given input belongs to a

particular class.
● It uses a special mathematical function called the logistic function (or sigmoid
function) that outputs values between 0 and 1.
● This probability is then used to classify the data. For example:
● If the probability is greater than 0.5, the model predicts class 1 (e.g., pass
or yes).
● If the probability is less than 0.5, the model predicts class 0 (e.g., fail or
no).

Interpretation of Coefficients:

● The coefficients in logistic regression show how much a predictor (like hours
studied) affects the probability of the outcome (e.g., passing an exam).
● If a coefficient is positive, it means that an increase in the predictor will increase
the probability of the positive outcome.
● If a coefficient is negative, it decreases the probability of the positive outcome.

Q-4 b) Example of Logistic Regression for Classification

(No Coding)
Let’s imagine an example where you are trying to predict if a student will pass or fail an
exam based on the number of hours studied.

Steps to Apply Logistic Regression:

1. Collect Data:
● You have data showing how many hours students studied and whether
they passed or failed.
● For example:
● A student who studied 2 hours failed.
● A student who studied 8 hours passed.
2. Train the Model:
● Logistic regression is used to create a model that learns from this data. It
identifies the relationship between the number of hours studied and the
likelihood of passing.
● The model calculates probabilities, such as:
● A student who studied 5 hours has a 70% probability of passing.
● A student who studied only 2 hours has a 20% probability of
passing.
3. Make Predictions:
● Once trained, the model can be used to predict outcomes for new
students based on how many hours they studied.
● For example, if a student studies 6 hours, the model might predict a pass
because the probability is greater than 0.5.
4. Evaluate the Model:
● You can assess how well the logistic regression model works by
comparing its predictions to actual results. For example, if the model
predicts that a student will pass but they fail, it indicates that the model
might need improvement.

Interpreting the Model:

● If the model shows that hours studied has a positive coefficient, it means that
studying more increases the chances of passing.
● The model creates a decision boundary (usually set at a 50% probability). If the
probability is above 50%, the model predicts a pass, otherwise a fail.

Example in Real Life:

Imagine you're an admissions officer and need to decide if students will succeed in a
particular program. Using logistic regression, you can predict success (pass) or failure
based on factors like high school grades, test scores, and study habits.

Summary:
● Logistic regression is a tool used for binary classification.
● It predicts the probability of an outcome (e.g., pass or fail).
● The coefficients in the model tell you how each factor (e.g., hours studied)
impacts the likelihood of that outcome.

Blood Chemistry and CBC Analysis - Clinical Laboratory Testing From A Functional Perspective - Quick Reference Guide
75% (20)
Blood Chemistry and CBC Analysis - Clinical Laboratory Testing From A Functional Perspective - Quick Reference Guide
29 pages
The 22 Immutable Laws of Branding
No ratings yet
The 22 Immutable Laws of Branding
15 pages
Exhibit: Battte For Hotton
100% (2)
Exhibit: Battte For Hotton
17 pages
Honours 1
No ratings yet
Honours 1
5 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
9 pages
Machine Learning Unit-5
No ratings yet
Machine Learning Unit-5
49 pages
Feature selection
No ratings yet
Feature selection
19 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Shrinkage Method
No ratings yet
Shrinkage Method
2 pages
ML UNIT IV PART I
No ratings yet
ML UNIT IV PART I
11 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
Introduction To Dimensionality Reduction-1
No ratings yet
Introduction To Dimensionality Reduction-1
16 pages
Introduction To Dimensionality Reduction
No ratings yet
Introduction To Dimensionality Reduction
5 pages
Detailed_Breakdown_Ridge_Lasso
No ratings yet
Detailed_Breakdown_Ridge_Lasso
2 pages
Dimensionality Reduction Techniques You Should Know in 2021
No ratings yet
Dimensionality Reduction Techniques You Should Know in 2021
12 pages
Dimensionality Reduction Final
No ratings yet
Dimensionality Reduction Final
5 pages
3.1 Dimensionality Reduction
No ratings yet
3.1 Dimensionality Reduction
24 pages
Rakshana Sn - LAQ Week 4 DA
No ratings yet
Rakshana Sn - LAQ Week 4 DA
3 pages
L-10 - Presentation1-09052024-072206pm
No ratings yet
L-10 - Presentation1-09052024-072206pm
27 pages
ML-unit-4
No ratings yet
ML-unit-4
20 pages
Week 2 v1.1 (hidden) - Dimensionality and Evaluation
No ratings yet
Week 2 v1.1 (hidden) - Dimensionality and Evaluation
47 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
23 pages
ml_exam_answers
No ratings yet
ml_exam_answers
26 pages
Dimension Reduction
No ratings yet
Dimension Reduction
38 pages
Chapter 1.2. Overview of ML
No ratings yet
Chapter 1.2. Overview of ML
17 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
12 Dimensionality Reduction Techniqwues (with Python Codes)
No ratings yet
12 Dimensionality Reduction Techniqwues (with Python Codes)
20 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
56 pages
ASM-BDM - Module 3 - Notes
No ratings yet
ASM-BDM - Module 3 - Notes
12 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
14 pages
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
20 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
ML Unit 4 @ VS
No ratings yet
ML Unit 4 @ VS
33 pages
palo2021
No ratings yet
palo2021
30 pages
unit-3
No ratings yet
unit-3
23 pages
Dimensionality reduction
No ratings yet
Dimensionality reduction
7 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
51 pages
Dimentiality
No ratings yet
Dimentiality
4 pages
Machine
No ratings yet
Machine
21 pages
Chapter 6 - 2 Handsout Machine Learning
No ratings yet
Chapter 6 - 2 Handsout Machine Learning
67 pages
Business Data Mining Week 4
No ratings yet
Business Data Mining Week 4
12 pages
Describe in Brief Different Types of Regression Algorithms
No ratings yet
Describe in Brief Different Types of Regression Algorithms
25 pages
PA Notes 2
No ratings yet
PA Notes 2
23 pages
ML Mod 6
No ratings yet
ML Mod 6
5 pages
Unit 5 Mfds
No ratings yet
Unit 5 Mfds
4 pages
Regularization
No ratings yet
Regularization
13 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Curse
No ratings yet
Curse
12 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
Unit No.02 - Feature Extraction and Selection
No ratings yet
Unit No.02 - Feature Extraction and Selection
17 pages
Lasso Ridge Notes
No ratings yet
Lasso Ridge Notes
2 pages
ML - WEEK 05
No ratings yet
ML - WEEK 05
25 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
MLT Content
No ratings yet
MLT Content
3 pages
Day School 03
No ratings yet
Day School 03
32 pages
DR
No ratings yet
DR
20 pages
LASSO and Ridge-1
No ratings yet
LASSO and Ridge-1
15 pages
AML Unit 5
No ratings yet
AML Unit 5
13 pages
Tripti Ahmed 20 42960 1 Copy
No ratings yet
Tripti Ahmed 20 42960 1 Copy
11 pages
Dimensionality
No ratings yet
Dimensionality
9 pages
40 Machine Learning Algorithms
From Everand
40 Machine Learning Algorithms
Anam Giri
No ratings yet
The Comprehensive Guide to Machine Learning Algorithms and Techniques
From Everand
The Comprehensive Guide to Machine Learning Algorithms and Techniques
Mohammed Ahmed
5/5 (1)
Unit 3 (1)
No ratings yet
Unit 3 (1)
11 pages
AI & ML_SLM
No ratings yet
AI & ML_SLM
87 pages
MCA_Broucher_2022
No ratings yet
MCA_Broucher_2022
4 pages
Stanford University Brochure
No ratings yet
Stanford University Brochure
6 pages
Marketing Plan For Adult Softdrinks SH Lo Er
No ratings yet
Marketing Plan For Adult Softdrinks SH Lo Er
20 pages
TDS Din-K C1
No ratings yet
TDS Din-K C1
4 pages
D & F Block Best Notes
No ratings yet
D & F Block Best Notes
29 pages
Introduction To Operational Transconductance Amplifiers
No ratings yet
Introduction To Operational Transconductance Amplifiers
10 pages
Pre-intermediate final speaking test
No ratings yet
Pre-intermediate final speaking test
2 pages
Present and Past Simple Passive
No ratings yet
Present and Past Simple Passive
7 pages
Quarter 3 English Week 2 Day 4
No ratings yet
Quarter 3 English Week 2 Day 4
4 pages
Account Statement PDF
No ratings yet
Account Statement PDF
12 pages
The Great Auk, A Novel - Eckert, Allan W
100% (1)
The Great Auk, A Novel - Eckert, Allan W
214 pages
Agar - Io Gamepad Userscript
No ratings yet
Agar - Io Gamepad Userscript
4 pages
Author Dr. Joseph A. DeNoia's New Book, "Little Freddy Beamer Was A Dreamer," Is A Heartfelt Tale That Follows A Young Boy Who Refuses To Give Up On His Dreams
No ratings yet
Author Dr. Joseph A. DeNoia's New Book, "Little Freddy Beamer Was A Dreamer," Is A Heartfelt Tale That Follows A Young Boy Who Refuses To Give Up On His Dreams
4 pages
State PAAdm Asst 8296
No ratings yet
State PAAdm Asst 8296
13 pages
Am Assignment 2
No ratings yet
Am Assignment 2
39 pages
ACJ OPS To Appendix 1 (New) To EU-OPS 1.430 (H) PDF
No ratings yet
ACJ OPS To Appendix 1 (New) To EU-OPS 1.430 (H) PDF
2 pages
The Feasibility Study of Producing Bioethanol
No ratings yet
The Feasibility Study of Producing Bioethanol
23 pages
Hydraulic Pump
100% (2)
Hydraulic Pump
17 pages
Cardiovascular Physiology Lecture 1
No ratings yet
Cardiovascular Physiology Lecture 1
95 pages
Ocr A 2 Economics Answers
No ratings yet
Ocr A 2 Economics Answers
42 pages
ASTM D97-17b - Ponto de fluidez
No ratings yet
ASTM D97-17b - Ponto de fluidez
7 pages
Morocco
No ratings yet
Morocco
10 pages
List of Previously Mapped Courses 300824 (1)
No ratings yet
List of Previously Mapped Courses 300824 (1)
23 pages
Validation_Master_Plan
No ratings yet
Validation_Master_Plan
3 pages
New Engine 1.9 JTD Twin Stage Turbo
No ratings yet
New Engine 1.9 JTD Twin Stage Turbo
3 pages
The Scandal Of Holiness Jessica Hooten Wilson pdf download
No ratings yet
The Scandal Of Holiness Jessica Hooten Wilson pdf download
41 pages
Contoh RPH SDP
No ratings yet
Contoh RPH SDP
5 pages
MAG 001 - Anglerfish - Transcript Re-Formatted
No ratings yet
MAG 001 - Anglerfish - Transcript Re-Formatted
10 pages
Office Memorandum Sop Annexure A
No ratings yet
Office Memorandum Sop Annexure A
36 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML Unit 3 Assignment

Uploaded by

ML Unit 3 Assignment

Uploaded by

Machine Learning

Name - Yusuf Nathdwarawala

Q-1 a) Define dimensionality reduction and subset selection in the

b) Discuss the main methods used for dimensionality reduction, such

Ans. Q-1 a) Dimensionality Reduction and Subset Selection in

Q-1 b) Main Methods for Dimensionality Reduction

1. Principal Component Analysis (PCA):

Example of Dimensionality Reduction Use Case:

Q-2 a) Explain shrinkage methods in the context of linear regression.

b) Compare and contrast Lasso (L1 regularization) and Ridge (L2

Ans. Q-2 a) Shrinkage Methods in Linear Regression

Shrinkage in Linear Regression:

Primary Types of Shrinkage Techniques:

1. Ridge Regression (L2 Regularization):

● Shrinkage techniques modify the regression coefficients by adding a penalty term

Q-2 b) Comparison of Lasso (L1) and Ridge (L2)

1. Lasso (L1 Regularization):

● Penalty: Adds the sum of the absolute values of the coefficients.

2. Ridge (L2 Regularization):

● Penalty: Adds the sum of the squared values of the coefficients.

Key Differences Between Lasso and Ridge

Feature Lasso (L1) Ridge (L2)

Penalty Sum of absolute values of coefficients Sum of squared values of coefficients

Example of When Dimensionality Reduction Helps:

Q-3 a) Describe Principal Components Regression (PCR) and its

How does PCR utilize Principal Component Analysis (PCA) to address

b) Provide an example where Principal Components Regression is

Ans. Q-3 a) Principal Components Regression (PCR)

How PCR Uses PCA:

Q-3 b) Example of Principal Components Regression (PCR)

Steps Involved in Applying PCR:

1. Standardize the Data:

2. Apply PCA to the Predictor Variables:

3. Choose the Number of Principal Components:

Components') plt.ylabel('Variance Explained') plt.show()

4. Fit the Regression Model on the Principal Components:

5. Interpret the Results:

● The number of principal components to include in the model is typically chosen

b) Provide an example demonstrating how to use logistic

Q-4 a) Logistic Regression in Classification Tasks

What is Logistic Regression?

How Logistic Regression Works:

● Logistic regression calculates the probability that a given input belongs to a

Q-4 b) Example of Logistic Regression for Classification

Steps to Apply Logistic Regression:

Interpreting the Model:

Example in Real Life:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.