0% found this document useful (0 votes)

16 views15 pages

5 - Feature Generation

Uploaded by

Abdulrhman Alshameri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views15 pages

5 - Feature Generation

Uploaded by

Abdulrhman Alshameri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Optimal Feature Generation

● In general, feature generation is a problem-dependent task.

However, there are a few general directions common in a
number of applications. We focus on three such alternatives.
➢ Optimized features based on Scatter matrices (Fisher’s linear
discrimination).
• The goal: Given an original set of m measurements

, compute , by the linear transformation

x ∈ ℜm y ∈ ℜ!
T
y = A x criterion involving Sw, Sb is
so that the J3 scattering matrix
maximized. AT is an matrix.

xm
!

0
• The basic steps in the proof:
– J3 = trace{Sw-1 Sm}
– Syw = ATSxwA, Syb = ATSxbA,
– J3(A)=trace{(ATSxwA)-1 (ATSxbA)}
– Compute A so that J3(A) is maximum.

• The solution:

– Let B be the matrix that diagonalizes simultaneously

matrices Syw, Syb , i.e:
BTSywB = I , BTSybB = D
where B is a ℓxℓ matrix and D a ℓxℓ diagonal matrix.

0
– Let C=AB an mxℓ matrix. If A maximizes J3(A) then
(S −1
xw )
S xb C = CD
The above is an eigenvalue-eigenvector problem. For an
M-class problem, is of −rank
1
M-1.
● If ℓ =M-1, choose C to S xw S xb of the M-1 eigenvectors,
consist
corresponding to the non-zero eigenvalues.

The above guarantees maximum T

J3 value. In this case:
J3,x = J3,y. y=C x
● For a two-class problem, this results to the well known
Fisher’s linear discriminant

For Gaussian classes, this is the optimal Bayesian

(
classifier, with aydifference
= µ − of
1
)
S −1 x value .
µ a threshold
2 xw

0
● If ℓ<M-1, choose the ℓ eigenvectors corresponding to the
ℓ largest eigenvectors.
● In this case, J3,y<J3,x, that is there is loss of information.

– Geometric interpretation. The vector is the projection of

onto the subspace spanned by the eigenvectors
y of
.
x
−1
S xw S xb

0
● Principal Components Analysis
(The Karhunen – Loève transform):
➢ The goal: Given an original set of m measurements x ∈ ℜm
compute
y ∈ ℜ!
y = AT x
for an orthogonal A, so that the elements of are optimally
mutually uncorrelated. y
That is

➢ Sketch of the proof:

E [y (i ) y ( j )]= 0, i ≠ j

[ ] [T T
R y = E y y = E AT x x A = AT Rx A ]
0
• If A is chosen so that its columns are thea i orthogonal
eigenvectors of Rx, then
R y = AT Rx A = Λ
where Λ is diagonal with elements the respective
eigenvalues λi.
• Observe that this is a sufficient condition but not
necessary. It imposes a specific orthogonal structure on
A.

➢ Properties of the solution

• Mean Square Error approximation.
Due to the orthogonality of A:

m
T
x = ∑ y (i )a i , y (i ) = a i x
i =0

0
- Define −1
!
xˆ = ∑ y (i )a i
i =0
- The Karhunen – Loève transform minimizes the
square error:
2
⎡ m ⎤
- [
The error is:E x − xˆ
2
]
= E ⎢ ∑ y (i )a i ⎥
⎢⎣ i =! ⎥⎦
m
[
E x − xˆ
2

It can be also shown that this isi =the

!
i]= ∑ λ
minimum mean
square error compared to any other representation of x
by an ℓ-dimensional vector.

0
- In other words, x̂ is the projection of x the
into
subspace spanned by the principal ℓ eigenvectors.
However, for Pattern Recognition this is not the
always the best solution.

0
• Total variance: It is easily seen that

[ ]
σ 2y ( i ) = E y 2 (i ) = λi
Thus Karhunen – Loève transform makes the total
variance maximum.

• Assuming to be a zero mean multivariate Gaussian,

then the K-L transform
y maximizes the entropy:

of the resulting process.

[
H y = − E ln Py ( y ) ]
y

0
➢ Subspace Classification. Following the idea of projecting in a
subspace, the subspace classification classifies an unknown to
the class whose
x subspace is closer to . x
The following steps are in order:

• For each class, estimate the autocorrelation matrix Ri, and

compute the m largest eigenvalues. Form Ai, by using
respective eigenvectors as columns.

• Classify to the class ωi, for which the norm of the

subspace projection
x is maximum

T T
A x > A j x ∀i this
According to Pythagoras theorem,
i ≠ j corresponds to the
subspace to which is closer.
x

0
● Independent Component Analysis (ICA)
In contrast to PCA, where the goal was to produce uncorrelated
features, the goal in ICA is to produce statistically
independent features. This is a much stronger requirement,
involving higher to second order statistics. In this way, one
may overcome the problems of PCA, as exposed before.
➢ The goal: Given , compute
x y ∈ ℜ!
so that the components of
y =are
W xstatistically independent. In
order the problem to have a solution, y the following
assumptions must be valid:
• Assume that is indeed generated by a linear combination
of independent components
x

x =Φy
0
Φ is known as the mixing matrix and W as the demixing
matrix.
• Φ must be invertible or of full column rank.
• Identifiability condition: All independent components, y(i),
must be non-Gaussian. Thus, in contrast to PCA that can
always be performed, ICA is meaningful for non-Gaussian
variables.
• Under the above assumptions, y(i)’s can be uniquely
estimated, within a scalar factor.

0
➢ Common’s method: Given , and x under the previously
stated assumptions, the following steps are adopted:
• Step 1: Perform PCA on :
x
• Step 2: Compute a unitary AT x , so that the fourth order
y =matrix,
cross-cummulants of the transform vector
Â
are zero. This is equivalent to searching for an that makes
y = Aˆ T yˆ maximum,
the squares of the auto-cummulants

Â
where, is the 4th order auto-cumulant.
max ( ˆ ) = k (y (i ) )
A
2
ˆ ˆT
AA
Ψ ∑ 4
k 4 ()
⋅
0
• Step 3: ( )
W = AAˆ
T

➢ A hierarchy of components: which ℓ to use? In PCA one

chooses the principal ones. In ICA one can choose the ones
with the least resemblance to the Gaussian pdf.

0
➢ Example:

The principal component is α 1 , thus according to PCA one

chooses as y the projection of x into α 1 . According to ICA,
one chooses as y the projection on α 2 . This is the least
Gaussian. Indeed:
K4(y1) = -1.7
K4(y2) = 0.1
Observe that across α , the statistics is bimodal. That is, no
2
resemblance to Gaussian. 0

HW MYP 2
No ratings yet
HW MYP 2
4 pages
Hooke's Law QP
No ratings yet
Hooke's Law QP
11 pages
Compiler-Provenance Identification in Obfuscated Binaries Using Vision Transformers
No ratings yet
Compiler-Provenance Identification in Obfuscated Binaries Using Vision Transformers
3 pages
WHC 21 44com 18 en
No ratings yet
WHC 21 44com 18 en
439 pages
Lab4 Instruction Part1
No ratings yet
Lab4 Instruction Part1
9 pages
Machine Learning
No ratings yet
Machine Learning
29 pages
شباتر اله مجمعه
No ratings yet
شباتر اله مجمعه
126 pages
MathModel - Lecture 8 1
No ratings yet
MathModel - Lecture 8 1
8 pages
1 s2.0 S1877050915031828 Main
No ratings yet
1 s2.0 S1877050915031828 Main
7 pages
LINFO2275 Questions D Examen-4
No ratings yet
LINFO2275 Questions D Examen-4
34 pages
Machine Unit4
No ratings yet
Machine Unit4
55 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
Feature Extraction
No ratings yet
Feature Extraction
90 pages
Holography, Past, Present and Future
No ratings yet
Holography, Past, Present and Future
6 pages
From Personalized Medicine To Population Health: A Survey of Mhealth Sensing Techniques
No ratings yet
From Personalized Medicine To Population Health: A Survey of Mhealth Sensing Techniques
24 pages
Problem Solving Lesson
No ratings yet
Problem Solving Lesson
30 pages
Convention For The Safeguarding of The Intangible Cultural Heritage
No ratings yet
Convention For The Safeguarding of The Intangible Cultural Heritage
22 pages
PCA Dr. Pawan Kumar Tiwari
No ratings yet
PCA Dr. Pawan Kumar Tiwari
19 pages
Chapter 6
No ratings yet
Chapter 6
55 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Book
No ratings yet
Book
1 page
Network Reconnaissance and Port Scanning Using Nmap
No ratings yet
Network Reconnaissance and Port Scanning Using Nmap
6 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
Assignment
No ratings yet
Assignment
6 pages
Unit 3,4 and 5
No ratings yet
Unit 3,4 and 5
5 pages
شبكة لوجستية
No ratings yet
شبكة لوجستية
4 pages
Lecture 11 Dimensionality Reduction
No ratings yet
Lecture 11 Dimensionality Reduction
32 pages
Lecture 7: Principal Component Analysis (PCA) (Draft: Version 0.9.1)
No ratings yet
Lecture 7: Principal Component Analysis (PCA) (Draft: Version 0.9.1)
11 pages
Heritage Management Terminology and Definitions: By: Mona Albolwi
No ratings yet
Heritage Management Terminology and Definitions: By: Mona Albolwi
5 pages
Some Methods of Constructing Kernel
No ratings yet
Some Methods of Constructing Kernel
23 pages
Fischer LDA
No ratings yet
Fischer LDA
8 pages
Assignment1 - LAB
No ratings yet
Assignment1 - LAB
4 pages
Reality Holograms and Cultural Heritage
No ratings yet
Reality Holograms and Cultural Heritage
6 pages
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
No ratings yet
Foundations of Machine Learning: Sudeshna Sarkar IIT Kharagpur
40 pages
hst951 7
No ratings yet
hst951 7
32 pages
CS443 Project
No ratings yet
CS443 Project
1 page
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
Compositing Effects For High Thermoelectric Performance of Cu2Se Based Materials
No ratings yet
Compositing Effects For High Thermoelectric Performance of Cu2Se Based Materials
9 pages
Ai Notes V
No ratings yet
Ai Notes V
7 pages
Practice Homework Set
No ratings yet
Practice Homework Set
58 pages
Fishers LDA
No ratings yet
Fishers LDA
47 pages
5 Data Pre Processing III
No ratings yet
5 Data Pre Processing III
30 pages
ML Lecture06 2
No ratings yet
ML Lecture06 2
63 pages
Objectives:: Linear Discriminant Analysis
No ratings yet
Objectives:: Linear Discriminant Analysis
10 pages
Knowledge Management Case Study
No ratings yet
Knowledge Management Case Study
6 pages
Dark Background Image-Denosing Based On KPCA Method
No ratings yet
Dark Background Image-Denosing Based On KPCA Method
4 pages
Balance As A Principle of Interior Design
No ratings yet
Balance As A Principle of Interior Design
2 pages
CHP 4
No ratings yet
CHP 4
72 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
Lab Manual
No ratings yet
Lab Manual
86 pages
6 Dimension Reduction Theory
No ratings yet
6 Dimension Reduction Theory
18 pages
Notes For Multivariate Statistics With R
No ratings yet
Notes For Multivariate Statistics With R
189 pages
Reviewed - IJAMSS - Equivalence of Fisher Discriminant Analysis and Least Square
No ratings yet
Reviewed - IJAMSS - Equivalence of Fisher Discriminant Analysis and Least Square
11 pages
Data Science Cheatsheet
No ratings yet
Data Science Cheatsheet
5 pages
Microscopy Technical Msds 157-4
No ratings yet
Microscopy Technical Msds 157-4
108 pages
Strictly Confidential - (For Internal and Restricted Use Only) Secondary School Examination
No ratings yet
Strictly Confidential - (For Internal and Restricted Use Only) Secondary School Examination
9 pages
15PCA
No ratings yet
15PCA
27 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Free Vibration
100% (1)
Free Vibration
13 pages
Multivariate Statistics - An Introduction 8th Edition
100% (1)
Multivariate Statistics - An Introduction 8th Edition
202 pages
Infrared Optical Materials and Their Antireflection Coatings
No ratings yet
Infrared Optical Materials and Their Antireflection Coatings
266 pages
Subspace Methods
100% (1)
Subspace Methods
12 pages
1.1 Introduction
No ratings yet
1.1 Introduction
17 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
16 dm2 Dimred 2022 23
No ratings yet
16 dm2 Dimred 2022 23
49 pages
Partial Derivatives: Surfaces Level Curves (Page
No ratings yet
Partial Derivatives: Surfaces Level Curves (Page
15 pages
Aerospace Systems
No ratings yet
Aerospace Systems
228 pages
A Journey From Linear Algebra To Machine Learning
No ratings yet
A Journey From Linear Algebra To Machine Learning
50 pages
Read Works
No ratings yet
Read Works
5 pages
On The Eigenspectrum of The Gram Matrix and Its Relationship To The Operator Eigenspectrum
No ratings yet
On The Eigenspectrum of The Gram Matrix and Its Relationship To The Operator Eigenspectrum
18 pages
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
No ratings yet
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
40 pages
Dimensions Reduction
No ratings yet
Dimensions Reduction
27 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
40 pages
Cis515 15 Spectral Clust AppA
No ratings yet
Cis515 15 Spectral Clust AppA
11 pages
SVM-CDing2024 11 15
No ratings yet
SVM-CDing2024 11 15
54 pages
IDC - HV Conf - Perth 2015 - Condition Monitoring of High Voltage Switchgear - KH
No ratings yet
IDC - HV Conf - Perth 2015 - Condition Monitoring of High Voltage Switchgear - KH
58 pages
4.10 Fisher Linear Discriminant: Chapter 4. Nonparametric Techniques
No ratings yet
4.10 Fisher Linear Discriminant: Chapter 4. Nonparametric Techniques
8 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
Perfecting Dryer Design For Flawless Air Flow: Processing
No ratings yet
Perfecting Dryer Design For Flawless Air Flow: Processing
4 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Fisher Linear Discriminant Analysis: Max Welling
No ratings yet
Fisher Linear Discriminant Analysis: Max Welling
4 pages
List of Acronyms and Abbreviations
No ratings yet
List of Acronyms and Abbreviations
34 pages
Principal Component Analysis (PCA) Application To Images: Outline of The Lecture
No ratings yet
Principal Component Analysis (PCA) Application To Images: Outline of The Lecture
26 pages
Cement Carb 4
No ratings yet
Cement Carb 4
7 pages
Probabilty DPP Kota
No ratings yet
Probabilty DPP Kota
17 pages
Stefan Maier - Fire Induced Changes in Surface Reflectance On The Australian Continent As Measured With MODIS
No ratings yet
Stefan Maier - Fire Induced Changes in Surface Reflectance On The Australian Continent As Measured With MODIS
24 pages
Discriminant Functions
No ratings yet
Discriminant Functions
33 pages
Nisi
No ratings yet
Nisi
17 pages
Illuminating Lives Prof Lums Catalog Asia Pacific-090828 PDF
No ratings yet
Illuminating Lives Prof Lums Catalog Asia Pacific-090828 PDF
522 pages
Assesment Workshop PJB PT Lycon Asia Mandiri: Operating Quality Testing and Equipment
No ratings yet
Assesment Workshop PJB PT Lycon Asia Mandiri: Operating Quality Testing and Equipment
4 pages
Comparison of Traditional and Modern Rajasthan Houses
No ratings yet
Comparison of Traditional and Modern Rajasthan Houses
11 pages
cs229 Notes10 PDF
No ratings yet
cs229 Notes10 PDF
6 pages
Deep Excavation in Hong Kong - Design and Construction Control
100% (3)
Deep Excavation in Hong Kong - Design and Construction Control
33 pages
3.9 Heat and Humidity - Mine Refrigeration
No ratings yet
3.9 Heat and Humidity - Mine Refrigeration
14 pages
KL
No ratings yet
KL
5 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
ST100 Series SIL Declaration of Conformity (23EN000025)
No ratings yet
ST100 Series SIL Declaration of Conformity (23EN000025)
1 page
Pca
No ratings yet
Pca
6 pages
WS410
No ratings yet
WS410
7 pages
Grade 8 Exam Questions
No ratings yet
Grade 8 Exam Questions
10 pages
REVIEW - The Origin and Nature of Life On Earth - The Emergence of The Fourth Geosphere
No ratings yet
REVIEW - The Origin and Nature of Life On Earth - The Emergence of The Fourth Geosphere
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

5 - Feature Generation

Uploaded by

5 - Feature Generation

Uploaded by

Optimal Feature Generation

● In general, feature generation is a problem-dependent task.

, compute , by the linear transformation

– Let B be the matrix that diagonalizes simultaneously

The above guarantees maximum T

For Gaussian classes, this is the optimal Bayesian

– Geometric interpretation. The vector is the projection of

➢ Sketch of the proof:

➢ Properties of the solution

It can be also shown that this isi =the

• Assuming to be a zero mean multivariate Gaussian,

of the resulting process.

• For each class, estimate the autocorrelation matrix Ri, and

• Classify to the class ωi, for which the norm of the

➢ A hierarchy of components: which ℓ to use? In PCA one

The principal component is α 1 , thus according to PCA one

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.