0% found this document useful (0 votes)

16 views3 pages

More Kernels and Their Properties

The document discusses advanced topics in machine learning, focusing on kernels and their properties. It introduces various kernel functions, such as polynomial and Gaussian kernels, and explains their applications in kernel methods like k-nearest neighbors. Additionally, it covers Mercer’s Theorem, which establishes the conditions under which a function can be considered a kernel, and outlines properties of kernels in relation to linear algebra.

Uploaded by

Misael García

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

More Kernels and Their Properties

Uploaded by

Misael García

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

150AML: Advanced Topics in Machine Learning Spring 2008

Course Topic: Computational Learning Theory

Department of Computer Science, Tufts University
Lecture 18 Apr 1
Instructor: Roni Khardon Scribe: Roni

More Kernels and Their Properties

These notes are slightly edited from previous scribe notes (in Spring 2006) taken by Mashhood
Ishaque.

1 Kernels and Kernel Methods

In the previous lecture we introduced the idea of kernels and gave the Boolean kernels and dual
perceptron algorithm that works with kernels. Here we introduce some more common kernels and
kernel methods.
We say that k(x, y) is a kernel function iff there is a feature map φ such that for all x, y,
~ · φ(y)
k(x, y) = φ(x) ~

Any learning algorithm that only depends on the inner product of examples, and therefore, can be
run kernels, is called kernel method.

Nearest Neighbors: We next show that k-nearest neighbor (kNN) is also a kernel method. kNN
classifies new example by finding k closest examples in the sample and taking a majority vote on
the label. So all we need is to find distances between examples. We have

~ 2 =
~ − φ(y)|| (φi (x) − φi (y))2
X
||φ(x)
φi (x)2 + φi (y)2 − 2
X X X
= φi (x)φi (y)
= k(x, x) + k(y, y) − 2k(x, y)
So indeed the distance can be calculated using 3 calls to the kernel function.

The polynomial kernel: Let x, y ∈ Rn . Define k(x, y) = (hx, yi + c)d for c, d ∈ R. This
corresponds to a feature map φ including polynomials of the original variables.
For example P if d = 2, 2then:
k(~x, ~y )P
= ( xi yi +P c)
= (P P xi yi + c)( xj yj +Pc)
= i j xi yi · xj yj + 2c i xi yi + c2
P √ √
2c yi + c2
P P
= i j x i x j · y i yj + i 2c xi

φ has n2 entries from

P P
i j =⇒ (feature is xi xj )
P √
+ n entries from i =⇒ (feature is 2c xi )
+1 =⇒ (feature is c)

1
2
The Gaussian kernel: is defined as k(x, y) = e−||~x−~y|| /σ . By using Taylor’s expansion ea =
1 k
1 + a + . . . + k! a one can see that e~x·~y is a kernel with (an infinite set of) features corresponding
to polynomial terms. Then we can normalize by σ and divide the corresponding features by e ||x||
and e||y|| to get the Gaussian kernel.

2 Linear Algebra
A quick review was given using slides. See slide copies. The main result we need is as follows:
Any symmetric matrix K with real valued entries can be written in the form K = P DP T
where P = (V~1 , V~2 , ..., V~m ), V~i are eigen vectors of K that form an orthonormal basis (so we also
have P T = P −1 ) and where D is a diagonal matrix with Di,i = λi being the corresponding eigen
values. T
P P A square matrix A is positive semi-definite (PSD) iff for all vectors c we have c Ac =
i j ci cj Ai,j ≥ 0. It is well known that a matrix is positive semi-definite iff all the eigen values
are non-negative.

3 Mercer’s Theorem
The sample S = x1 , x2 , ..., xm includes m examples. The Kernel (Gram) matrix K is an m × m
matrix including inner products between all pairs of examples i.e., Ki,j = k(xi , xj ). K is symmetric
since k(x, y) = k(y, x) = φ(x) . φ(y)
Mercer’s Theorem: A symmetric function k(., .) is a kernel iff for any finite sample S the
kernel matrix for S is positive semi-definite.
One direction of the theorem
P P is easy: if k() PisP
a kernel, and K is theP kernel matrix P with K i,j =
T
k(xj , xj ). Then c Kc = i j ci cj Ki,j = i j ci cj φ(xi )φ(xj ) = ( i ci φ(xi ))( j cj φ(xj )) =
||( j cj φ(xj ))||2 ≥ 0.
P
For the other direction we will prove a weaker result.
Theorem: Consider a finite input space X = {x1 , x2 , ..., xm } and the kernel matrix K over the
entire space. If K is positive semi-definite then k(., .) is a kernel function.
Proof: By the linear algebra facts above we can write K = P DP T .
Define a feature mapping
√ into a m-dimensional space where the lth bit in feature expansion for
i i ~
example x is φl (x ) = λl (Vl )i .
The inner product is
m
~ i ) · φ(y
~ j) =
X
φ(x φl (xi ) φl (xj )
l=1
m
X
= λl (Vl )i (Vl )j
l=1

We want to show that

~ i ) · φ(y
k(xi , xj ) = φ(x ~ j)

Consider i, jth entry of the matrix K = k(xi , xj ). We have the following identities where the
last one proves the result.

Ki,j = [P DP T ]i,j

2
= [[P D]P T ]i,j

[P D] = (V~1 , V~2 , ..., V~m )D

[P D]i,l = (Vl )i λl
m
X
[[P D]P T ]i,j = (Vl )i λl (Vl )j
l=1

Note that Mercer’s theorem allows us to work with a kernel function without knowing which
feature map it corresponds to or its relevance to the learning problem. This has often been used
in practical applications.

4 More Properties of Kernels

Consider any space X of samples and kernels k1 (., .) and k2 (., .) over X. Then k(., .) is a kernel with
(1) k(x, y) = k1 (x, y) + k2 (x, y)
(2) k(x, y) = ak1 (x, y) where a > 0
(3) k(x, y) = f (x) · f (y) for any function f on x
(4) k(x, y) = k1 (x, y) · k2 (x, y)
(5) k(x, y) = √ k1 (x,y)√
k1 (x,x) k1 (y,y)

Proof of (1):
K = K 1 + K2
where we add matrices component-wise
~T K X
∀~x, X ~ =X
~ T K1 X
~ +X
~ T K2 X
~ ≥0

Another Proof: Let

φ1 (x) = (φ11 (x), ..., φ1N1 (x))
φ2 (x) = (φ21 (x), ..., φ2N2 (x))
be the feature map for K1 and K2
Define φ(x) by concatenating the feature maps (or alternate features if the spaces are infinite)
φ(x) = (φ11 (x), ..., φ1N1 (x), φ21 (x), ..., φ2N2 (x))
The mapping clearly satisfies φ(x) · φ(y) = φ1 (x) · φ1 (y) + φ2 (x) · φ2 (y).
Proof of (2):
√ √ √ √
k(x, y) = ( aφ11 (x), ..., aφ1N (x))( aφ11 (y), ..., aφN1 (y) = ak1 (x, y)

Proof of (3): there is just one feature defined by f ()

Proof of (4): multiply out the φ expressions for k1 and k2 to see that k is a kernels with the
space of products of features from φ1 and φ2 .
Proof of (5):
φ1 (x)
Let φ1 (x) be as above. Define φ(x) by φi (x) = kφ1i (x)k . Then k() calculates the inner product
for φ().

Machine Learning With Kernel Methods
No ratings yet
Machine Learning With Kernel Methods
760 pages
Lecture 36
No ratings yet
Lecture 36
133 pages
10 Understanding Kernels
No ratings yet
10 Understanding Kernels
41 pages
Kernel Functions
No ratings yet
Kernel Functions
35 pages
Chap6 1-KernelMethods
No ratings yet
Chap6 1-KernelMethods
36 pages
Kernell Mallows Kernels For Permutations
No ratings yet
Kernell Mallows Kernels For Permutations
38 pages
05 Kernel
No ratings yet
05 Kernel
24 pages
Lecture4 introToRKHS
No ratings yet
Lecture4 introToRKHS
33 pages
4c Kernels
No ratings yet
4c Kernels
31 pages
Lecture 4
No ratings yet
Lecture 4
49 pages
Lecture 05
No ratings yet
Lecture 05
49 pages
Lecture 8 - Kernels
No ratings yet
Lecture 8 - Kernels
32 pages
Lecture17 Kernels
No ratings yet
Lecture17 Kernels
23 pages
Kernel Methods in Machine Learning
No ratings yet
Kernel Methods in Machine Learning
53 pages
03 - Kernelization
No ratings yet
03 - Kernelization
32 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
Kernel Methods
No ratings yet
Kernel Methods
19 pages
ML Lecture06 2
No ratings yet
ML Lecture06 2
63 pages
A Gentle Introduction To The Kernel Distance: 1 Definitions
No ratings yet
A Gentle Introduction To The Kernel Distance: 1 Definitions
9 pages
Data An-6
No ratings yet
Data An-6
36 pages
A Primer On Kernel Methods PDF
No ratings yet
A Primer On Kernel Methods PDF
42 pages
Kernel Models 1233
No ratings yet
Kernel Models 1233
56 pages
05 Lectureslides Kernels
No ratings yet
05 Lectureslides Kernels
47 pages
Kernel Discriminant Analysis For Positive Definite and Indefinite Kernels
No ratings yet
Kernel Discriminant Analysis For Positive Definite and Indefinite Kernels
15 pages
Lec 16
No ratings yet
Lec 16
23 pages
Some Methods of Constructing Kernel
No ratings yet
Some Methods of Constructing Kernel
23 pages
Kernel Methods For Pattern Analysis
100% (3)
Kernel Methods For Pattern Analysis
478 pages
Kernal Methods Machine Learning
No ratings yet
Kernal Methods Machine Learning
53 pages
Algorithms Notes For Professionals
100% (1)
Algorithms Notes For Professionals
252 pages
Slides Chap5 KernelMethods
No ratings yet
Slides Chap5 KernelMethods
24 pages
Combining Entropy Measures For Anomaly Detection
No ratings yet
Combining Entropy Measures For Anomaly Detection
14 pages
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
No ratings yet
Support Vector Machines: Kernels: CS4780/5780 - Machine Learning Fall 2011 Thorsten Joachims Cornell University
15 pages
Kernels and Distances For Structured Data
No ratings yet
Kernels and Distances For Structured Data
28 pages
Reproducing Kernel Hilbert Space, Mercer's Theorem, Eigenfunctions, Nystr Om Method, and Use of Kernels in Machine Learning: Tutorial and Survey
No ratings yet
Reproducing Kernel Hilbert Space, Mercer's Theorem, Eigenfunctions, Nystr Om Method, and Use of Kernels in Machine Learning: Tutorial and Survey
31 pages
Lec5 SVM Kernel SoftMargin
No ratings yet
Lec5 SVM Kernel SoftMargin
44 pages
SCH Smo 03 C
No ratings yet
SCH Smo 03 C
24 pages
Ds 11
No ratings yet
Ds 11
21 pages
SVM and Kernels
No ratings yet
SVM and Kernels
13 pages
Optimal Gaussian Kernel Parameter Selection For SVM Classifier
No ratings yet
Optimal Gaussian Kernel Parameter Selection For SVM Classifier
7 pages
Introduction To Kernels: Max Welling
No ratings yet
Introduction To Kernels: Max Welling
16 pages
The Representation of Similarities in Linear Spaces
No ratings yet
The Representation of Similarities in Linear Spaces
17 pages
Kernel Methods For General Pattern Analysis PDF
No ratings yet
Kernel Methods For General Pattern Analysis PDF
77 pages
Kernel Method Homework
No ratings yet
Kernel Method Homework
5 pages
MOTLI Common Questions
100% (1)
MOTLI Common Questions
72 pages
Wk02 Machine Learning
No ratings yet
Wk02 Machine Learning
4 pages
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
No ratings yet
2021 UNAS REFER Rafi Yon Saputra 173112706420242 Kernel Primer
65 pages
Kernel Nearest-Neighbor Algorithm
No ratings yet
Kernel Nearest-Neighbor Algorithm
10 pages
hw5 Kernel Trick 2021
No ratings yet
hw5 Kernel Trick 2021
4 pages
Kernel Methods in Machine Learning
No ratings yet
Kernel Methods in Machine Learning
3 pages
Week 9 Notes
No ratings yet
Week 9 Notes
6 pages
Kernel Methods: Feature Mapping at No Cost
No ratings yet
Kernel Methods: Feature Mapping at No Cost
25 pages
SVM Kernel Functions
No ratings yet
SVM Kernel Functions
12 pages
High Dimensional Representation
No ratings yet
High Dimensional Representation
33 pages
Kernel Method
No ratings yet
Kernel Method
5 pages
Dark Background Image-Denosing Based On KPCA Method
No ratings yet
Dark Background Image-Denosing Based On KPCA Method
4 pages
Classes of Kernels For Machine Learning: A Statistics Perspective
No ratings yet
Classes of Kernels For Machine Learning: A Statistics Perspective
14 pages
07 Kernels
No ratings yet
07 Kernels
6 pages
Math 8 Written Work No. 1 Quarter 1: - Encircle The Letter of The Correct Answer
No ratings yet
Math 8 Written Work No. 1 Quarter 1: - Encircle The Letter of The Correct Answer
8 pages
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
No ratings yet
Kernel Functions: Tejumade Afonja Jan 2, 2017 6 Min Read
6 pages
ML Kernel Methods
No ratings yet
ML Kernel Methods
51 pages
On Data Structures
No ratings yet
On Data Structures
56 pages
Transformation (Stretch) Simbeye
No ratings yet
Transformation (Stretch) Simbeye
19 pages
Functions & Inequalities
No ratings yet
Functions & Inequalities
15 pages
Icse Class 10 Maths LMR Doubts
No ratings yet
Icse Class 10 Maths LMR Doubts
10 pages
So You Want To Be A Pixel Artist 14
No ratings yet
So You Want To Be A Pixel Artist 14
5 pages
Elementary and Intermediate Algebra 5th Edition Alan S. Tussy 2024 Scribd Download
No ratings yet
Elementary and Intermediate Algebra 5th Edition Alan S. Tussy 2024 Scribd Download
81 pages
3.2 Transformations of Quadratics
No ratings yet
3.2 Transformations of Quadratics
6 pages
Mark Scheme (Results) : Summer 2018
No ratings yet
Mark Scheme (Results) : Summer 2018
20 pages
Solving Rational Equations
No ratings yet
Solving Rational Equations
33 pages
So You Want To Be A Pixel Artist 04
No ratings yet
So You Want To Be A Pixel Artist 04
1 page
Maths Formulas Eng Version 2019-20 by Pragathi Pu Callege
No ratings yet
Maths Formulas Eng Version 2019-20 by Pragathi Pu Callege
6 pages
? (2) General Mathematics Reviewer
No ratings yet
? (2) General Mathematics Reviewer
6 pages
Linear Equations and Matrices (Week 1-3)
No ratings yet
Linear Equations and Matrices (Week 1-3)
130 pages
MONT
No ratings yet
MONT
73 pages
Course Outline Distribution
No ratings yet
Course Outline Distribution
2 pages
Scikit-Learn Cyber Security CheatSheet
No ratings yet
Scikit-Learn Cyber Security CheatSheet
2 pages
تمارين على المتجهات
No ratings yet
تمارين على المتجهات
2 pages
So You Want To Be A Pixel Artist 08
No ratings yet
So You Want To Be A Pixel Artist 08
5 pages
Mark's Pixel Art Tutorial 2
No ratings yet
Mark's Pixel Art Tutorial 2
2 pages
04 Chapter 04 Hermite Interpolation
No ratings yet
04 Chapter 04 Hermite Interpolation
7 pages
12th Maths Unit 6 Study Material English Medium PDF
No ratings yet
12th Maths Unit 6 Study Material English Medium PDF
58 pages
Functions Paper 4
No ratings yet
Functions Paper 4
27 pages
Mark's Pixel Art Tutorial 7
No ratings yet
Mark's Pixel Art Tutorial 7
2 pages
Git For Subversion Users Cheatsheet
No ratings yet
Git For Subversion Users Cheatsheet
2 pages
Mark's Pixel Art Tutorial 1
No ratings yet
Mark's Pixel Art Tutorial 1
1 page
Q Parameterization
No ratings yet
Q Parameterization
17 pages
Adobe Scan Jan 14, 2024
No ratings yet
Adobe Scan Jan 14, 2024
25 pages
6 Itf DPP Genetry
No ratings yet
6 Itf DPP Genetry
12 pages
Planner For JEE Advanced 2024
No ratings yet
Planner For JEE Advanced 2024
1 page
32 Booker
No ratings yet
32 Booker
12 pages
Wa0023
No ratings yet
Wa0023
4 pages
Steinitz
No ratings yet
Steinitz
11 pages
CH 1 and 2
No ratings yet
CH 1 and 2
4 pages
1981-Date-Kashiwara-Miwa-Vertex Operatprs and Tau Func.-Transformation Groups For Solitons Eq.-Proc - Jpn.Acad.
No ratings yet
1981-Date-Kashiwara-Miwa-Vertex Operatprs and Tau Func.-Transformation Groups For Solitons Eq.-Proc - Jpn.Acad.
6 pages
9S5Apm6HEemE8A7At5Cb6A Week 2 Householder
No ratings yet
9S5Apm6HEemE8A7At5Cb6A Week 2 Householder
7 pages
5.13 Rational Chebyshev Approximation: Evaluation of Functions
No ratings yet
5.13 Rational Chebyshev Approximation: Evaluation of Functions
5 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

More Kernels and Their Properties

Uploaded by

More Kernels and Their Properties

Uploaded by

150AML: Advanced Topics in Machine Learning Spring 2008

Course Topic: Computational Learning Theory

More Kernels and Their Properties

1 Kernels and Kernel Methods

φ has n2 entries from

We want to show that

[P D] = (V~1 , V~2 , ..., V~m )D

4 More Properties of Kernels

Another Proof: Let

Proof of (3): there is just one feature defined by f ()

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.