0% found this document useful (0 votes)

34 views104 pages

Unit 4 Dimenstionality Reduction

Uploaded by

rahuljssstu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views104 pages

Unit 4 Dimenstionality Reduction

Uploaded by

rahuljssstu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 104

Unit-4

Dimensionality Reduction
Syllabus
• Dimensionality Reduction:
Singular Value Decomposition
Principal Component Analysis
Linear Discriminated Analysis
• Dimensionality reduction is a process and technique to
reduce the number of dimensions -- or features -- in a
data set.
• The goal of dimensionality reduction is to decrease the
data set's complexity by reducing the number of
features while keeping the most important properties of
the original data.
What is Dimensionality Reduction?
• The number of input features, variables, or columns present
in a given dataset is known as dimensionality, and the process
to reduce these features is called dimensionality reduction.

• A dataset contains a huge number of input features in various

cases, which makes the predictive modeling task more
complicated, for such cases, dimensionality reduction
techniques are required to use.
• Dimensionality reduction arises due to the curse of
dimensionality, where the number of features in a
dataset increases, and the difficulty in finding
meaningful patterns and relationships also increases.
Dimensionality Reduction…?
• Dimensionality reduction technique can be defined as, "It is a way of
converting the higher dimensions dataset into lesser dimensions dataset
ensuring that it provides similar information."

• These techniques are widely used in Machine Learning for obtaining a

better fit predictive model while solving the classification and regression
problems.

• Handling the high-dimensional data is very difficult in practice,

commonly known as the curse of dimensionality.
Benefits of Dimensionality Reduction..
• By reducing the dimensions of the features, the space
required to store the dataset also gets reduced.
• Less Computation training time is required for reduced
dimensions of features.
• Reduced dimensions of features of the dataset help in
visualizing the data quickly.
• It removes the redundant features (if present).
Two ways of Dimensionality Reduction
• 1. Feature Selection
• 2. Feature Extraction
Feature Selection
• Feature selection is the process of selecting the subset
of the relevant features and leaving out the irrelevant
features present in a dataset to build a model of high
accuracy. In other words, it is a way of selecting the
optimal features from the input dataset.
General – features reduction technique
• In this example number 2 has 64 features… but many
of them are of no importance to decide the
characteristics of 2, are removed first.
Remove features which are of no importance
Feature Selection – 3 Methods
• 1.Filter Method
• Correlation
• Chi-Square Test
• ANOVA
• Information Gain, etc.

• 2.Wrapper Method
• Forward Selection
• Backward Selection
• Bi-directional Elimination

• 3.Embedded Method
• LASSO
• Elastic Net
Feature Extraction
• Feature extraction is the process of transforming the
space containing many dimensions into space with
fewer dimensions.

• This approach is useful when we want to keep the

whole information but use fewer resources while
processing the information.
Some common feature extraction techniques are:

1.Principal Component Analysis (PCA)

2.Linear Discriminant Analysis (LDA)
3.Kernel PCA
4.Quadratic Discriminant Analysis (QDA)etc.
ML Model design
• Consider the line passing through the samples in the
diagram.
• It (line) is the model/function/hypothesis generated
after the training phase.
The line is trying to reach all the samples as
close as possible.
• If we have an underfitted model,
Underfitting:
this means that we do not have
enough parameters to capture
the trends in the underlying
system.
• In general, in underfitting, model
fails during testing as well as
training.
• In this a complex model is built
using too many features.
• During training phase, model
works well. But it fails during
testing.
• Under/Overfitting can be solved
in different ways.
• One of the solution for
overfitting is dimensionality
reduction.
• Diagram shows that model
neither suffers from under or
overfitting.
Example to show requirement of Dimensionality reduction

• In this example important features to decide the price

are town, area and plot size. Features like number of
bathroom and trees nearby may not be significant,
hence can be dropped.
PCA
• PCA is a method of Dimensionality Reduction.
• PCA is a process of identifying Principal Components of the
samples.
• It tries to address the problem of overfitting.
Example for PCA (from SK learn (SciKit Learn) library)
• To address overfitting, reduce the
What does PCA do? dimension, without loosing the
information.
• In this example two dimension is reduced
to single dimension.
• But in general, their can be multiple
dimensions… and will be reduced.
• When the data is viewed from one angle, it
will be reduced to single dimension and the
same is shown at the bottom right corner,
and this will be Principal Component 1.
Similarly compute PC2 • Figure shows the representation of PC1
and PC2.
• Like this we have several principal
components…

• Say PC1,PC2, PC3… and so on..

• In that PC1 will be of top priority.

• Each Principal Components are

independent and are orthogonal. It
means one PC does not depends on
another…all of them are independent.
Another Example
Example to illustrate the PC
Multiple angles in which picture can be captured
• In previous slide, the last picture gives the right angle
to take the picture.
• It means, you have to identify a better angle to collect
the data without loosing much information.
• The angle shown in the last picture will capture all the
faces, without much overlapping and without loosing
information.
In this example the second one is the best angle to project
https://www.youtube.com/watch?v=g-Hb26agBFg (reference video)
https://www.youtube.com/watch?v=MLaJbA82nzk
Housing Example: More rooms..more the size
Two dimension is reduced to single dimension
• PCA is a method of dimensionality reduction.
• Example shows how to convert a two dimension to one
dimension.
How to compute PCA?
X Y
2.5 2.4 • Consider the Samples given in the table (10
0.5 0.7 Samples).
2.2 2.9
1.9 2.2
• Compute the mean of X and mean of Y
independently. Similar computation has to be
3.1 3.0 done for each features. (In this example only
2.3 2.7
two features).
2 1.6
1 1.1
1.5 1.6 • Mean of X = 1.81 and Mean of Y = 1.91
1.1 0.9
Next Step is to compute Co-Variance Matrix.

• Covariance between (x, y) is computed as given below:

• The following covariance Matrix to be computed is:

Covariance between (x and x)
• Similarly compute co variance between (x,y),(y,x) and
(y,y).
• Computed Co-Variance matrix is given in next slide
Final co-variance matrix
Alternate Method to compute Co-variance matrix
Consider Mean centered Matrix as A and now compute
Transpose of A * A to get the Covariance matrix: Divide
the resultant matrix by (n-1))
Next Step is to Compute Eigen Values using the Co-variance matrix

If A is the given matrix ( in this case co-variance matrix)

We can calculate eigenvalues from the following equation:

|A- λI| = 0
Where A is the given matrix
λ is the eigen value
I is the identity Matrix
|A- λI| = 0
Determinant computation and finally Eigen values
• Compute Eigen vector for each of the eigen value.

• Consider the first eigen value λ1 = 1.284

• C is the covariance matrix
• V is the eigen vector to be computed.
Now convert the two dimension data to single dimension
Final step
• Compute Eigen vector for the second eigen value.

• Consider the first eigen value λ2 = 0.0490

• C is the covariance matrix
• V is the eigen vector to be computed.
• Using this we can have two linear equation:
• Use any one of the following equation… final result
remains same.

• 0.5674 x1 = -0.6154 y1
• Divide both side by 0.5674.
• You will get : x1 = -1.0845 y1
• x1 = -1.0845 y1

• If y1=1, then x1 will be -1.0845

• So in that case (x1, y1) will be (-1.0845,1). This will be the initial eigen vector. Needs
normalization to get the final value.

• To normalize, take square-root of sum of square of each eigen vector values, and
consider this as ‘x’
• Finally divide each eigen vector values by ‘x’ to get the final eigen vector.
eigen vectors are generated for the eigen value : 0.490
Describe the algorithm with an example:
• Consider a 2-D dataset
• Cl =X1 =(x1,x2) ={(4,1),(2,4),(2,3),(3,6), (4,4)}
• C2=X2=(x1,x2) = {(9,10),(6,8),(9,5),(8,7),(10,8)}
PCA
Theory – Algorithms – steps explained
Steps/ Functions to perform PCA
• Subtract mean.
• Calculate the covariance matrix.
• Calculate eigenvectors and eigenvalues.
• Select principal components.
• Reduce the data dimension.
• Principal components is a form of multivariate statistical analysis and is one method of
studying the correlation or covariance structure in a set of measurements on m variables
for n observations.

• Principal Component Analysis, or PCA, is a dimensionality-reduction method that is often

used to reduce the dimensionality of large data sets, by transforming a large set of
variables into a smaller one that still contains most of the information in the large set.

• Reducing the number of variables of a data set naturally comes at the expense of accuracy,
but the trick in dimensionality reduction is to trade a little accuracy for simplicity. Because
smaller data sets are easier to explore and visualize and make analyzing data much easier
and faster for machine learning algorithms without extraneous variables to process.

• So to sum up, the idea of PCA is simple — reduce the number of variables of a data set,
• What do the covariances that we have as entries of the matrix tell us
about the correlations between the variables?
• It’s actually the sign of the covariance that matters

• if positive then : the two variables increase or decrease together

(correlated)

• if negative then : One increases when the other decreases (Inversely

correlated)

• Now, that we know that the covariance matrix is not more than a table
that summaries the correlations between all the possible pairs of
variables, let’s move to the next step.
Eigenvectors and eigenvalues are the linear algebra concepts that we need to compute from
the covariance matrix in order to determine the principal components of the data.

Principal components are new variables that are constructed as linear combinations or
mixtures of the initial variables.

These combinations are done in such a way that the new variables (i.e., principal components)
are uncorrelated and most of the information within the initial variables is squeezed or
compressed into the first components.

So, the idea is 10-dimensional data gives you 10 principal components, but PCA tries to put
maximum possible information in the first component.

Then maximum remaining information in the second and so on, until having something like
• As there are as many principal components as there are variables in the data, principal components are
constructed in such a manner that the first principal component accounts for the largest possible
variance in the data set.

• Organizing information in principal components this way, will allow you to reduce dimensionality without
losing much information, and this by discarding the components with low information and considering
the remaining components as your new variables.

• An important thing to realize here is that, the principal components are less interpretable and don’t have
any real meaning since they are constructed as linear combinations of the initial variables.
Characteristic Polynomial and characteristic equation
and
Eigen Values and Eigen Vectors

Computation for 2x2 and 3x3 Square Matrix

Eigen Values and Eigen Vectors

The eigenvectors x and eigenvalues  of a matrix A satisfy

Ax = x
If A is an n x n matrix, then x is an n x 1 vector, and  is a constant.

The equation can be rewritten as (A - I) x = 0, where I is the

n x n identity matrix.

61
2 X 2 Example : Compute Eigen Values

A= 1 -2 so A - I = 1 -  -2
3 -4 3 -4 - 

det(A - I) = (1 - )(-4 - ) – (3)(-2)

= 2 + 3  + 2

Set 2 + 3  + 2 to 0

Then =  = (-3 +/- sqrt(9-8))/2

So the two values of  are -1 and -2.

63
Example 1: Find the eigenvalues and eigenvectors of the matrix
 4  6
A
 3 5 

Solution Let us first derive the characteristic polynomial of A.
We get
  4  6 1 0    4   6 
A  I 2       
 3 5  0 1   3 5  

A  I 2  ( 4   )(5   )  18  2    2
We now solve the characteristic equation of A.
2    2  0  (  2)(  1)  0    2 or  1
The eigenvalues of A are 2 and –1.
The corresponding eigenvectors are found by using these values of  in the equation(A – I2)x = 0.
There are many eigenvectors corresponding to each eigenvalue.
For  = 2
We solve the equation (A – 2I2)x = 0 for x.
The matrix (A – 2I2) is obtained by subtracting 2 from the diagonal elements of A.
We get  6  6  x1 
 3    0
 3   x2 

This leads to the system of equations

 6 x1  6 x2  0
3 x1  3 x2  0
giving x1 = –x2. The solutions to this system of equations are x1 = –r, x2 = r, where r is a scalar.
Thus the eigenvectors of A corresponding to  = 2 are nonzero vectors of the form
 x1   1  1
v1     x2   r  
x
 2  1  1
For  = –1
We solve the equation (A + 1I2)x = 0 for x.
The matrix (A + 1I2) is obtained by adding 1 to the diagonal elements of A. We get
 3  6  x1 
 3    0
 6   x2 
This leads to the system of equations
 3 x1  6 x2  0
3 x1  6 x2  0
Thus x1 = –2x2. The solutions to this system of equations are x1 = –2s and x2 = s, where s is a
scalar. Thus the eigenvectors of A corresponding to  = –1 are nonzero vectors of the form
 x1   2   2 
v2     x2    s 
x
 2  1   1 
• Example 2 Calculate the eigenvalue equation and eigenvalues for the
following matrix –

Solution : Let A = and A–λI =

We can calculate eigenvalues from the following equation:

|A- λI| = 0 (1 –λ) [(- 1 –λ)(-λ) - 0] – 0 + 0 = 0
λ (1 - λ) (1 + λ) = 0
From this equation, we are able to estimate eigenvalues which are –
λ = 0, 1, -1.
Example2 : Eigenvalues 3x3 Matrix

Find the eigenvalues of 1 2 3

A  0 4 2
 
Solution: 
0 0 7


1 2 3 1
0 1  
0 2 3 
A  I n  0 4 2    0
0   0
1 4 2 
     

0 0 7 1
0 
0
 0 0 7  

1   2 3 
det( A  I n )  0  det  0 4 2 0
 

 0 0 7  
1    4   7     0
  1,  4, 7
Example 3: Eigenvalues and Eigenvectors
Find the eigenvalues and eigenvectors of the matrix
5 4 2
A  4 5 2
2 
 2 2
Solution The matrix A – I is obtained by subtracting  from the diagonal elements of A. Thus
3

5   4 2 
A  I 3   4 5 2 
 2 
2  
 2

The characteristic polynomial of A is |A – I3|. Using row and column operations to simplify
determinants, we get
Alternate Solution
Solve any two equations
• 2 = 1
Let  = 1 in (A – I3)x = 0. We get
( A  1I 3 ) x  0
 4 4 2   x1 
 4 4 2  x2   0

2 2 1   x3 

The solution to this system of equations can be shown to be x1 = – s – t, x2 = s, and x3 = 2t, where s and
t are scalars. Thus the eigenspace of 2 = 1 is the space of vectors of the form.

 s  t 
 s 
 

 2t  
Separating the parameters s and t, we can write
 s  t    1   1
 s   s  1  t  0
     

 2t   
 0  
 2 
Thus the eigenspace of  = 1 is a two-dimensional subspace of R3 with basis

   1   1 
   0 
  1 ,  
  0  0  
   

If an eigenvalue occurs as a k times repeated root of the characteristic equation, we say that it is of
multiplicity k. Thus l=10 has multiplicity 1, while l=1 has multiplicity 2 in this example.
Linear Discriminant Analysis (LDA)
Data representation vs. Data Classification
Difference between PCA vs. LDA

• PCA finds the most accurate data representation in a lower

dimensional space.
• Projects the data in the directions of maximum variance.
• However the directions of maximum variance may be useless for
classification
• In such condition LDA which is also called as Fisher LDA works well.
• LDA is similar to PCA but LDA in addition finds the axis that
maximizes the separation between multiple classes.
LDA Algorithm
• PCA is good for dimensionality reduction.
• However Figure shows how PCA fails to classify. (because it will
try to project this points which maximizes variance and minimizes
the error)

• Fisher Linear Discriminant Project to a line which reduces the

Projection of the samples in the second picture is the best:
• Two criteria are used by LDA to create a new axis:
1. Maximize the distance between means of the two classes.
2. Minimize the variation within each class.
Describe the algorithm with an example:
• Consider a 2-D dataset
• Cl =X1 =(x1,x2) ={(4,1),(2,4),(2,3),(3,6), (4,4)}
• C2=X2=(x1,x2) = {(9,10),(6,8),(9,5),(8,7),(10,8)}
Step 1: Compute within class scatter matrix(Sw)

• Sw= = s1+s2

• s1 is the covariance matrix for class 1 and

• s2 is the covariance matrix for class 2.

• Note : Covariance matrix is to be computed on the Mean Cantered data

• For the given example: mean of C1= (3, 3.6) and
• mean of C2=(8,4, 7.6)
• S1=Transpose of mean centred data * Mean centred data
Computed values s1,s2 and Sw
Step 2: Compute between class scatter Matrix(Sb)
• Mean 1 (M1) =(3,3.6)
• Mean 2 (M2)=(8,4,7.6)

• (M1-M2) = (3-8.4, 3.6-7.6) = (-5.4, 4.0)

Step 3: Find the best LDA projection vector
• To do this ..compute the Eigen values and eigen vector
for the largest eigen value, on the matrix which is the
product of : =

• In this example, highest eigen value is : 15.65 ( )

Compute inverse of Sw
• =
Eigen vector computed for Eigen value: 15.65
Step 4: Dimension Reduction
Summary of the Steps

• Step 1 - Computing the within-class and between-class scatter matrices.

• Step 2 - Computing the eigenvectors and their corresponding eigenvalues
for the scatter matrices.
• Step 3 - Sorting the eigenvalues and selecting the top k.
• Step 4 - Creating a new matrix that will contain the eigenvectors mapped
to the k eigenvalues.
• Step 5 - Obtaining new features by taking the dot product of the data and
the matrix from Step 4.
Singular Value Decomposition (SVD)
What is singular value decomposition explain with example?

• The singular value decomposition of a matrix A is the factorization of A into the

product of three matrices A = UDVT where the columns of U and VT are orthonormal
and the matrix D is diagonal with positive real entries. The SVD is useful in many tasks.
• Calculating the SVD consists of finding the eigenvalues and eigenvectors of AAT and ATA.
• The eigenvectors of ATA make up the columns of V , the eigenvectors of AAT make up
the columns of U.
• Also, the singular values in S are square roots of eigenvalues from AAT or ATA.
• The singular values are the diagonal entries of the S matrix and are arranged in
descending order. The singular values are always real numbers.
• If the matrix A is a real matrix, then U and V are also real.
where:
• U: mxr matrix of the orthonormal eigenvectors of AAT.
• VT: transpose of a rxn matrix containing the orthonormal eigenvectors of ATA.
• W: a rxr diagonal matrix of the singular values which are the square roots of the
eigenvalues of AAT and ATA .
End of unit 4

Sony rcp-1530 1st-Edition Rev.1 MM
No ratings yet
Sony rcp-1530 1st-Edition Rev.1 MM
172 pages
The Practically Cheating Calculus Handbook
From Everand
The Practically Cheating Calculus Handbook
S. Deviant
3.5/5 (7)
Dja2500 - 4000 Service Manual PDF
73% (22)
Dja2500 - 4000 Service Manual PDF
38 pages
Unit 3
No ratings yet
Unit 3
102 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
MLSP-6 Dimensionality Reduction
No ratings yet
MLSP-6 Dimensionality Reduction
39 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
Dimension Reduction
No ratings yet
Dimension Reduction
15 pages
It ML Unit 4 Notes Final
No ratings yet
It ML Unit 4 Notes Final
21 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
No ratings yet
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
7 pages
PCA
100% (1)
PCA
33 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
D3S2 - Unsupervised - Dimensionality Reduction
No ratings yet
D3S2 - Unsupervised - Dimensionality Reduction
81 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
03 Dimensionality Reduction
No ratings yet
03 Dimensionality Reduction
38 pages
ML Lec-20
No ratings yet
ML Lec-20
17 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
Linear Regression: Dimensionality Reduction
No ratings yet
Linear Regression: Dimensionality Reduction
7 pages
Ai & ML Week-9
No ratings yet
Ai & ML Week-9
30 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Module 5 - BECE309L - AIML - Part2
No ratings yet
Module 5 - BECE309L - AIML - Part2
34 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
Dimension Reduction
No ratings yet
Dimension Reduction
38 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
9 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Dimensionality Reduction Techniques You Should Know in 2021
No ratings yet
Dimensionality Reduction Techniques You Should Know in 2021
12 pages
9 ML
No ratings yet
9 ML
39 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
82 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Principal Components Analysis (PCA) Final
No ratings yet
Principal Components Analysis (PCA) Final
23 pages
Principal Component Analysis: Jianxin Wu
No ratings yet
Principal Component Analysis: Jianxin Wu
24 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
5 Data Pre Processing III
No ratings yet
5 Data Pre Processing III
30 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
7.3 Pca
No ratings yet
7.3 Pca
17 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
DS Unit 3 Essay Answers
No ratings yet
DS Unit 3 Essay Answers
15 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
35 pages
ML (Unit 5)
No ratings yet
ML (Unit 5)
34 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
Module 3
No ratings yet
Module 3
41 pages
Introduction to Finite Element Analysis
From Everand
Introduction to Finite Element Analysis
Rahul Basu
No ratings yet
IoT Introduction 19th March
No ratings yet
IoT Introduction 19th March
39 pages
17CS81 IoT Module5
No ratings yet
17CS81 IoT Module5
73 pages
6th Sem Open Elective II Syllabus - Final
No ratings yet
6th Sem Open Elective II Syllabus - Final
51 pages
6th Sem Open Elective III Syllabus - Final
No ratings yet
6th Sem Open Elective III Syllabus - Final
52 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
VI Semester Syllabus - New
No ratings yet
VI Semester Syllabus - New
35 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Unit 2 .Statistical Decision Making-1
No ratings yet
Unit 2 .Statistical Decision Making-1
213 pages
Unit 5
No ratings yet
Unit 5
77 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
Poetry Mid Test
No ratings yet
Poetry Mid Test
4 pages
Instructions: Meet DRU - The World's First Pizza Delivery Robot!
No ratings yet
Instructions: Meet DRU - The World's First Pizza Delivery Robot!
9 pages
Module - 3: Engineering As Social Experimentation
No ratings yet
Module - 3: Engineering As Social Experimentation
16 pages
Soal Uas Bhs. Inggris Xii
No ratings yet
Soal Uas Bhs. Inggris Xii
18 pages
Working of A Human Ear: PHASE:-#02. Chapter: - Sound
No ratings yet
Working of A Human Ear: PHASE:-#02. Chapter: - Sound
14 pages
Alluvial Soil Black Soil
No ratings yet
Alluvial Soil Black Soil
1 page
GCRG International Conference
No ratings yet
GCRG International Conference
5 pages
Inns: Civil War: Tithe Causes
No ratings yet
Inns: Civil War: Tithe Causes
262 pages
Ysio
100% (1)
Ysio
252 pages
Botany in Berlin
100% (1)
Botany in Berlin
285 pages
System Monitoring With Sar and Ksar
No ratings yet
System Monitoring With Sar and Ksar
9 pages
Paradoxes
No ratings yet
Paradoxes
528 pages
PowerPoint Presentation
No ratings yet
PowerPoint Presentation
60 pages
Mechatronics Q & A
No ratings yet
Mechatronics Q & A
3 pages
What Is Failure Mode Effects Analysis
No ratings yet
What Is Failure Mode Effects Analysis
6 pages
One Word Answer Questions Covering Dermatology
50% (2)
One Word Answer Questions Covering Dermatology
5 pages
Assignment Submission by Ahumuza Ivan
No ratings yet
Assignment Submission by Ahumuza Ivan
3 pages
Culture Teaching Methods in Foreign Language Education: Pre-Service Teachers' Reported Beliefs and Practices
No ratings yet
Culture Teaching Methods in Foreign Language Education: Pre-Service Teachers' Reported Beliefs and Practices
18 pages
History of Sport - Wikipedia
No ratings yet
History of Sport - Wikipedia
19 pages
Designing For Clarity Author Bianca Woods
No ratings yet
Designing For Clarity Author Bianca Woods
61 pages
Advantages and Disadvantages Paragraph
No ratings yet
Advantages and Disadvantages Paragraph
5 pages
A Detailed Lesson Plan in Science Grade 7
No ratings yet
A Detailed Lesson Plan in Science Grade 7
10 pages
Brochure Cosec Tam
No ratings yet
Brochure Cosec Tam
8 pages
1.) The One Great Heart by Alexander Solzhenitsyn
No ratings yet
1.) The One Great Heart by Alexander Solzhenitsyn
4 pages
FotoFocus Biennial 2016 Marlo Pascual Three Works Gallery Guide
No ratings yet
FotoFocus Biennial 2016 Marlo Pascual Three Works Gallery Guide
3 pages
All of The Documentation - Electron
No ratings yet
All of The Documentation - Electron
315 pages
Precast Concrete Bearing Wall Panel Design
100% (1)
Precast Concrete Bearing Wall Panel Design
22 pages
VTP Interview Questions and Answers (VLAN Trunking Protocol) - Networker Interview
100% (1)
VTP Interview Questions and Answers (VLAN Trunking Protocol) - Networker Interview
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Unit 4 Dimenstionality Reduction

Uploaded by

Unit 4 Dimenstionality Reduction

Uploaded by

Unit-4

• A dataset contains a huge number of input features in various

• These techniques are widely used in Machine Learning for obtaining a

• Handling the high-dimensional data is very difficult in practice,

• This approach is useful when we want to keep the

1.Principal Component Analysis (PCA)

• In this example important features to decide the price

• Say PC1,PC2, PC3… and so on..

• Each Principal Components are

• Covariance between (x, y) is computed as given below:

• The following covariance Matrix to be computed is:

If A is the given matrix ( in this case co-variance matrix)

We can calculate eigenvalues from the following equation:

• Consider the first eigen value λ1 = 1.284

• Consider the first eigen value λ2 = 0.0490

• If y1=1, then x1 will be -1.0845

• Principal Component Analysis, or PCA, is a dimensionality-reduction method that is often

• if positive then : the two variables increase or decrease together

• if negative then : One increases when the other decreases (Inversely

Computation for 2x2 and 3x3 Square Matrix

The eigenvectors x and eigenvalues  of a matrix A satisfy

The equation can be rewritten as (A - I) x = 0, where I is the

det(A - I) = (1 - )(-4 - ) – (3)(-2)

Then =  = (-3 +/- sqrt(9-8))/2

So the two values of  are -1 and -2.

This leads to the system of equations

Solution : Let A = and A–λI =

We can calculate eigenvalues from the following equation:

Find the eigenvalues of 1 2 3

• PCA finds the most accurate data representation in a lower

• Fisher Linear Discriminant Project to a line which reduces the

• s1 is the covariance matrix for class 1 and

• Note : Covariance matrix is to be computed on the Mean Cantered data

• (M1-M2) = (3-8.4, 3.6-7.6) = (-5.4, 4.0)

• In this example, highest eigen value is : 15.65 ( )

• Step 1 - Computing the within-class and between-class scatter matrices.

• The singular value decomposition of a matrix A is the factorization of A into the

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.