0% found this document useful (0 votes)

13 views73 pages

Visualization 9 Dim Reduction

The document discusses dimensionality reduction (DR) techniques in data visualization, highlighting the importance of reducing the complexity of high-dimensional data for better interpretability and computational efficiency. It covers various methods such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and manifold learning, along with their applications in fields like multimedia, bioinformatics, and finance. The goal of DR is to extract relevant information from data while simplifying the representation for analysis and visualization.

Uploaded by

Richard Heisenberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views73 pages

Visualization 9 Dim Reduction

Uploaded by

Richard Heisenberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 73

EECE 5642

Data Visualization

Dimensionality Reduction

Y. Raymond Fu
Professor
Electrical and Computer Engineering (ECE), COE
Khoury College of Computer Science (KCCS)
Northeastern University
Attribute Dimensions and Orders

• Dimensions
– 1D: scalar
– 2D: two-dimensional vector
– 3D: three-dimensional vector
– >3D: multi-dimensional vector
• Orders
– scalars
– vectors
– matrix 1-st order 2-nd order
– tensors (high-order)

vector matrix

2
Data Table

www.many-eyes.com
3 Courtesy of Prof. Hanspeter Pfister, Harvard University.
Univariate Data Representations
Matlab Box Plot

Courtesy of Prof. Hanspeter Pfister, Harvard University.

4 Original figures were from the slides of Stasko
Bivariate Data Representations

Courtesy of Prof. Hanspeter Pfister, Harvard University.

5 Original figures were from the slides of Stasko
Trivariate Data Representations

Courtesy of Prof. Hanspeter Pfister, Harvard University.

6 Original figures were from the slides of Stasko
Multi-Dimensional Data

Courtesy of Prof. Hanspeter Pfister, Harvard University.

7 Original figures were from the slides of Stasko
Multi-Dimensional Data Visualization

8 https://www.youtube.com/watch?v=wvsE8jm1GzE
What if the dimension of the data is 4, 5, 6, and
even more?

A world of high-dimensional measurements!

Dimensionality Reduction (DR)

9
High Dimensional Data
• Multimedia
– High-resolution images
– High-resolution videos
– Data from multiple sensors
• Bioinformatics
– Expressions of genes
– Neurons
• Social networks
– Tweets/likes/friendships
– Other interactions
• Weather and climate
– Multiple measurements (e.g., temperature)
– Time series data
• Finance
– Stock markets
– Time series data

10
Motivation and Goal of DR
• Reduce the degree of freedom in measurements
 Replace a large set of measured variables with a small set of more
“condensed” variables
 Simpler models are more robust on small datasets
• Reduce the computational load
 By reducing the dimensionality of data, the computational burden
(time and space) could be greatly decreased.
• Visualization
 “Looking at the data”—more interpretable; simpler explanations
 Make sense of the data before processing

Goal
• Extract information hidden in the data
 Detect variables relevant for a specific task and how variables interact
with each other  Reformulate data with less variables.

11 Samuel Kaski, Jaakko Peltonen: Dimensionality Reduction for Data Visualization [Applications Corner]. IEEE Signal Process. Mag. 28(2): 100-104 (2011)
Motivation and Goal of DR

This is easier to interpret … … than this

12 Courtesy of Prof. Jaakko Peltonen, Aalto University.

Feature Selection vs. Feature Extraction
Given a data set X consisting of n samples, and the dimension of
each sample is d.
• Feature Selection
 Choose k important features (k < d), ignoring the remaining d – k.
 Example: microarray data analysis

• Feature Extraction
 Transform the original data set X from the d-dimensional space to a
k-dimensional space (k < d).
 A general problem: 𝑌 = 𝑃𝑇 𝑋, where 𝑋 ∈ 𝑹𝑑 , 𝑌 ∈ 𝑹𝑘 .

13
Statistics & Linear Algebra Background
• Given a set of n-point data {Xk} in Rd
– The mean is E{x}

– The variance is Var{x}

– The co-variance between two data set is

14 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Eigenvectors
• For transform Y=AX, if exists

• e=[e1, e2, e3]T is an eigenvector, λ is the eigenvalue associated

with this eigenvector.
• For transform A, e is just a scaling function.
• Example

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

15 http://en.wikipedia.org/wiki/Eigenvalue,_eigenvector_and_eigenspace
Dimensionality Reduction Methods
• Linear Methods
– Principal Component Analysis (PCA), M.A. Turk & A.P. Pentland
– Multidimensional Scaling (MDS), T.F. Cox and M.A.A. Cox
– Locality Preserving Projections (LPP), X.F. He, S.C. Yan, Y.X. Hu
– Locality Persuit Embedding (LPE), W.L. Min, K. Lu, and X.F. He.
– Locally Embedded Analysis (LEA), Y. Fu and T.S. Huang
• Nonlinear Methods
– Locally Linear Embedding (LLE), S.T. Roweis & L.K. Saul
– Laplacian Eigenmaps, M. Belkin & P. Niyogi
– Isomap, J.B. Tenenbaum, V.de Silva, and J.C. Langford
– Hessian LLE, D.L. Donoho & C.E. Grimes
– Semidefinite Programming (SDE), K.Q. Weinberger & L.K. Saul
• Fisher Graph Methods
– Linear Discriminant Analysis (LDA), R.A. Fisher
– Marginal Fisher Analysis (MFA), S.C. Yan, et al.
– Local Discriminant Embedding (LDE), H.-T. Chen, et al.
– Discriminant Simplex Analysis (DSA), Y. Fu and T.S. Huang
– Correlation Embedding Analysis (CEA), Y. Fu and T.S. Huang

16
Parametric vs. Nonparametric Learning
• Parametric Model
– Use a parameterized family of probability distributions to describe the nature
of a set of data (Moghaddam & Pentland, 1997).
– The data distribution is empirically assumed or estimated.
– Learning is conducted by measuring a set of fixed parameters, such as mean
and variance.
– Effective for the large sample, but degrade for complicated data distribution.
• Nonparametric Model
– Distribution free.
– Learning is conducted by measuring the pair-wise data relationship in both
global and local manners.
– Effective and robust due to the reliance on fewer assumptions and parameters.
– Work for cases with small-sample, high-dimensionality, and complicated data
distribution.

17
Parametric Model
• Principal Component Analysis (PCA) and Linear Discriminant
Analysis (LDA)
• PCA is trying to captures the “principle” variations in the data
• It is computed by finding the Eigenvectors of the covariance
matrix of the data
• Geometrically, PCA finds the largest variations directions of
the underlying data
• Can be applied in data compression, pattern recognition, etc.
• Find a line going through the
data mean and along the max
variation direction of the data.
• Assuming zero mean, line is
represented as y=wTx, where
w is the basis, wTw=1.
18 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
Principal Component Analysis

19 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Principal Component Analysis

20 Courtesy of Prof. Jaakko Peltonen, Aalto University.

Principal Component Analysis

• OptDigits Dataset
 The data set contains 5620 instances of digitized handwritten digits in
range 0 ~ 9.
 Each digit is a 𝑹64 vector: 8 × 8 = 64 pixels.

21
Principal Component Analysis

22 Courtesy of Prof. Jaakko Peltonen, Aalto University.

Principal Component Analysis

Eigenvector Eigenfaced
23 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
Linear Discriminant Analysis

24 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Linear Discriminant Analysis

• Instead of PCA, it finds the discriminant subspace by including class label info
in subspace modeling (Supervised learning).
– Compute within class scatter
– Compute between class scatter
– Maximize between scatters and minimize within scatters
25 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
LDA Definition

26 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

LDA Two-Class Case

27 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

LDA Multiple-Class Case

28 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Different Subspace Base Vectors
• Different subspace base vectors show
different projective directions
• Subspace base vector w forms a Fisherface

29
PCA vs. LDA

Digits data after PCA Digits data after LDA

30 Courtesy of Prof. Jaakko Peltonen, Aalto University.

PCA vs. LDA

PCA LDA
31 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
PCA vs. LDA

PCA LDA
32 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
PCA vs. LDA

• PCA performs
worse under
this condition
• LDA (FLD-Fisher
Linear Discrimi-
nant) provides
better low
dimensional
representation.

33 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

When LDA Fails

• LDA fails in the right figure( v1 is the projected

direction). Think about why

34 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Criteria of Nonparametric Model
Effective to model sample
distributions
Manifold Learning
Effective to classify
different classes
Fisher Graph
Effective to measure
Similarity sample distances
Metric
Effective to describe
intrinsic data structures
High-
Order
Data
Structure
35
Manifold
• “Manifold, is an abstract mathematical space in which every
point has a neighborhood which resembles Euclidean space,
but in which the global structure may be more complicated.” --
-from Wikipedia
• “A manifold is a topological space that is locally Euclidean.” ---
from Mathworld
• e. g. 2D map of the 3D earth is a manifold.
• Manifold could be obtained by a projection from original data
to a low-dimensional representation via subspace learning.
• Manifold criterion can provide more effective ways to model
the data distribution than conventional learning methods
based on the Gaussian distribution.

36
Manifold

37 http://en.wikipedia.org/wiki/Manifold
Manifold Learning
Swiss Roll

Dimensionality
Reduction

38 Courtesy of Sam T. Roweis and Lawrence K. Saul, Sience 2002

Locally Linear Embedding

http://www.cs.toronto.edu/~roweis/lle/
39
LEA for Pose Manifold

Linear embedding and subspace projection of 400 rotating teapot images. The number of nearest neighbors is k = 6.

40
Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Expression Manifold

Manifold visualization of 1,965 Frey’s face images by LEA using k = 6 nearest neighbors.

41
Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Emotion State Manifold

Manifold visualization for 11,627 AAI sequence images of a male subject using LLE algorithm. (a) A video frame
snapshot and the 3D face tracking result. The yellow mesh visualizes the geometric motion of the face. (b) Manifold
visualization with k=5 nearest neighbors. (c) k=8 nearest neighbors. (d) k=15 nearest neighbors and labeling results.
42
Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Head Pose Manifold

43
Fisher Graph
• Graph Embedding (S. Yan, IEEE TPAMI, 2007)
– G={X, W} is an undirected weighted graph.
– W measures the similarity between a pair of vertices.
– Laplacian matrix

– Most manifold learning method can be reformulated as

where d is a constant and B is the constraint matrix.

Between-Locality Graph Within-Locality Graph Courtesy of Shuicheng Yan

Discriminant Simplex Analysis

Y. Fu, et. al., IEEE Transactions on Information Forensics and Security, 2008.
Similarity Metric
• Single-Sample Metric
– Euclidean Distance and Pearson Correlation Coefficient.

• Multi-Sample Metric
– k-Nearest- Neighbor Simplex

Q
Q
Correlation Embedding Analysis

 Objective Function

Correlation Distance Fisher Graph

Y. Fu, et. al., IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
High-Order Data Structure
– m-th order tensors
– Representation where
– Define , where
– Here, tensor means multilinear representation.

1-st order 2-nd order

vector matrix
Tensor

Y. Fu, et. al., IEEE Transactions on Circuits and Systems for Video Technology, 2009.
Correlation Tensor Analysis
Given two m-th order tensors,
Pearson Correlation Coefficient (PCC):

CTA objective function

Correlation Distance and Fisher Graph

Multilinear Representation

m different subspaces
Y. Fu, et. al., IEEE Transactions on Image Processing, 2008.
Manifold with Noise Effect
Robust Manifold by Low-Rank Recovery
Real-world ATR data are
Automated, real-
large scale, unbalanced in time, and robust
dynamic sampling, and description of ATR
easily affected by noises data space under
and outliers, which are uncertainty.
difficult to represent.

Low-rank matrix recovery

can deal with noises and
outliers for data
reconstruction.
Stabilized Manifold Learning

Raw Data Existing Method New Method

LLE Noise Outlier

Voting for Outlier Detection

Stabilized Manifold Learning
Large Scale Manifold Learning
 Graph based methods require spectral decomposition of matrices
of n x n, where n denotes the number of samples.
 The storage cost and computational cost of building neighborhood
maps are O(n2) and O(n3), it is almost intractable to apply these
methods to large-scale scenarios.
 Neighborhood search is also a large scale aspect.
Large Scale Manifold Learning

Graph oriented clustering K-means clustering

Robust Matching of Sub-Manifolds
 A robust visual representation must be insensitive to durations in the case of
dynamics or time series, such as action/activity videos.
 A generalized manifold can be considered as a union of sub-manifolds with
different durations which characterize different instances with similar structures,
such as different individuals performing the same action, instead of a single
continuous manifold as conventionally regarded.
 Robust matching of these sub-manifolds can be achieved through both low-rank
matrix recovery and simplex synchronization.
Applications
 Chemical data visualization
 DR algorithm: multidimensional scaling (MDS)

Seung-Hee Bae, Jong Youl Choi, Judy Qiu, Geoffrey Fox: Dimension reduction and visualization of large high-dimensional data via interpolation. HPDC
58 2010: 203-214
Applications
 Biology data visualization
 DR algorithm: principal component analysis (PCA)

Andreas Lehrmann, Michael Huber, Aydin Can Polatkan, Albert Pritzkau, Kay Nieselt: Visualizing dimensionality reduction of systems biology data.
59 Data Min. Knowl. Discov. 27(1): 146-165 (2013)
Applications
 Biology data visualization
 DR algorithm: locally linear embedding (LLE)

Andreas Lehrmann, Michael Huber, Aydin Can Polatkan, Albert Pritzkau, Kay Nieselt: Visualizing dimensionality reduction of systems biology data.
60 Data Min. Knowl. Discov. 27(1): 146-165 (2013)
Applications
 Bioinformatics
 DR algorithm: multidimensional scaling (MDS)

Adam Hughes, Yang Ruan, Saliya Ekanayake, Seung-Hee Bae, Qunfeng Dong, Mina Rho, Judy Qiu, Geoffrey Fox: Interpolative multidimensional scaling
61 techniques for the identification of clusters in very large sequence sets. BMC Bioinformatics 13(S-2): S9 (2012)
Applications
 Metagenomic data visualization
 DR algorithm: stochastic neighbor embedding (SNE)

CC Laczny, N Pinel, N Vlassis, P Wilmes: Alignment-free Visualization of Metagenomic Data by Nonlinear Dimension Reduction, Scientific reports, 4
62 (2014).
Applications
 Neuroscience
 DR algorithm: multiple algorithms

J. P. Cunningham and B. M. Yu.: Dimensionality reduction for large-scale neural recordings. Nature Neuroscience, (2014), doi:10.1038/nn.3776.
63
Applications
 Semantic visualization in data mining
 DR algorithm: spherical semantic embedding (SSE).

64 Tuan M. V. Le, Hady Wirawan Lauw: Semantic visualization for spherical representation. KDD (2014): 1007-1016.
Applications
 Visualization of machine learning datasets
 DR algorithm: stochastic neighbor embedding (SNE)

Zhirong Yang, Jaakko Peltonen, Samuel Kaski: Scalable Optimization of Neighbor Embedding for Visualization. ICML (2) 2013: 127-135
65
Transfer Learning in Dimension Reduction

• We are facing huge amount of unlabeled data

nowadays
• Only a few databases are labeled
• The problem of inconsistency of training and
test data
• Transfer learning can help: training in one
domain and test in another
• Knowledge is better utilized
66
Recent Advances: Transfer Learning in DR
Motivation:
• We are facing huge amount of unlabeled data nowadays
• Only a few databases are labeled
• Knowledge is better utilized
Basic Idea:
Given two data sets A and B, use the knowledge learned from A to
help the learning task for B.

67
Recent Advances: Transfer Learning in DR
Object Face
Recognition Recognition

Learning Framework
Ming Shao, Carlos Castillo, Zhenghong Gu, Yun Fu: Low-Rank Transfer Subspace Learning. ICDM (2012): 1104-1109. 68
Recent Advances: Robust Subspace Discovery
Low-rank matrix recovery

Clean images Noisy images -observation - low-rank - sparse

Images from Twitter

Subspace Learning Subspace Clustering

 Find low-dimensional projection  Discover underlying subspaces in
with specific properties. data set, and correct errors.
 Unsupervised (e.g., PCA, LPP) /  Sparse subspace clustering (SSC),
Supervised (e.g., LDA) Low-rank representation (LRR).
Sheng Li, Yun Fu: Robust Subspace Discovery through Supervised Low-Rank Constraints. SDM 2014: 163-171 69
Recent Advances: Robust Subspace Discovery

Learning Framework
Sheng Li, Yun Fu: Robust Subspace Discovery through Supervised Low-Rank Constraints. SDM 2014: 163-171 70
Self-Taught Low-Rank Coding for Visual Learning

Self-taught Learning (Raina et al, 2007) Our Motivations and Contributions

 Transferring knowledge from auxiliary domain  Learn effective feature representations for
with minimum restrictions. target domain.
 A special type of transfer learning.  High-quality dictionary bridges auxiliary
domain and target domain.
Objective Function  Low-rank constraint characterizes the
structure information.
 The first general self-taught learning
framework is developed, including supervised
and unsupervised learning tasks.
Self-Taught Low-Rank Coding for Visual Learning
Application I: Subspace
Clustering

Application II: Image

Classification
Summary
• The motivation of using dimensionality reduction (DR) for
visualization
• DR mainly includes feature selection, feature extraction.
• Two basic linear DR methods: PCA and LDA.
• Nonlinear DR methods: LLE, SNE, etc.
• Applications of DR methods.
• Recent advances in DR.

TutorialOnNeuralModelingSystems 2
0% (1)
TutorialOnNeuralModelingSystems 2
10 pages
C. The Teacher Discusses The " .: SWOT Analysis For Events Management "
No ratings yet
C. The Teacher Discusses The " .: SWOT Analysis For Events Management "
4 pages
Eigenvectors_2
No ratings yet
Eigenvectors_2
31 pages
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
No ratings yet
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
10 pages
Unit 3
No ratings yet
Unit 3
21 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
315 F19 27 Pca1
No ratings yet
315 F19 27 Pca1
28 pages
Lecture 16_25.09.2024_PCA, Unsupervised Learning-Clustring & Metrics
No ratings yet
Lecture 16_25.09.2024_PCA, Unsupervised Learning-Clustring & Metrics
51 pages
16 dm2 Dimred 2022 23
No ratings yet
16 dm2 Dimred 2022 23
49 pages
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
No ratings yet
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
18 pages
FR Pca Lda
No ratings yet
FR Pca Lda
52 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
UNIT - 4
No ratings yet
UNIT - 4
76 pages
lec3
No ratings yet
lec3
60 pages
5-dimension reduction
No ratings yet
5-dimension reduction
48 pages
Polo Chaur Dimension Reduction
No ratings yet
Polo Chaur Dimension Reduction
59 pages
Dimensions Reduction
No ratings yet
Dimensions Reduction
27 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
19 pages
Updated Feature Enginering Notes
No ratings yet
Updated Feature Enginering Notes
47 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
ML 4
No ratings yet
ML 4
14 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
PrincipalComponentAnalysis-LectureNotesPublic
No ratings yet
PrincipalComponentAnalysis-LectureNotesPublic
24 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Manifold Learning Algorithms
No ratings yet
Manifold Learning Algorithms
17 pages
Lecture W12ab
No ratings yet
Lecture W12ab
60 pages
Deep Learning 3
No ratings yet
Deep Learning 3
12 pages
MLSP-6 dimensionality reduction
No ratings yet
MLSP-6 dimensionality reduction
39 pages
Feature Engineering
No ratings yet
Feature Engineering
51 pages
3
No ratings yet
3
12 pages
CHP 4
No ratings yet
CHP 4
72 pages
Lecture 9_PCA
No ratings yet
Lecture 9_PCA
44 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
85 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
10_Autoencoders
No ratings yet
10_Autoencoders
42 pages
UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
03 Face Detection
No ratings yet
03 Face Detection
7 pages
Lecture8 2015
No ratings yet
Lecture8 2015
51 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
17 pages
4 - Basics in Statistics and Linear Algebra
No ratings yet
4 - Basics in Statistics and Linear Algebra
7 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
Computer Vision: Spring 2006 15-385,-685
No ratings yet
Computer Vision: Spring 2006 15-385,-685
58 pages
Day School 03
No ratings yet
Day School 03
32 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
20-pca
No ratings yet
20-pca
50 pages
Dimensionality Reduction DR (2)
No ratings yet
Dimensionality Reduction DR (2)
31 pages
DAAI - Lecture - 04 - With - Solutions - 10oct22
No ratings yet
DAAI - Lecture - 04 - With - Solutions - 10oct22
84 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Week 8 Notes_DM
No ratings yet
Week 8 Notes_DM
26 pages
Principal Component Analysis: Atent Ariables
No ratings yet
Principal Component Analysis: Atent Ariables
13 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Big-O Notation Demystified: Definitive Reference for Developers and Engineers
From Everand
Big-O Notation Demystified: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Pathways to Machine Learning and Soft Computing: 邁向機器學習與軟計算之路（國際英文版）
From Everand
Pathways to Machine Learning and Soft Computing: 邁向機器學習與軟計算之路（國際英文版）
Jyh-Horng Jeng
No ratings yet
Format of Gap Certificate
No ratings yet
Format of Gap Certificate
4 pages
Young Scientist Competition 2019
No ratings yet
Young Scientist Competition 2019
9 pages
Rules Tri-County League 2023
No ratings yet
Rules Tri-County League 2023
2 pages
Data-for-AI-August-2021-Cognilytica-Slides
No ratings yet
Data-for-AI-August-2021-Cognilytica-Slides
8 pages
5.1 Forces & Their Interactions: Question Paper
No ratings yet
5.1 Forces & Their Interactions: Question Paper
6 pages
Front-End Analysis - Lean For The Machine (TechSolve)
100% (3)
Front-End Analysis - Lean For The Machine (TechSolve)
18 pages
SecretsToTheirSuccess
No ratings yet
SecretsToTheirSuccess
35 pages
CAE Reading Test
100% (1)
CAE Reading Test
10 pages
Cti List
No ratings yet
Cti List
6 pages
2017 2018 Calendar
No ratings yet
2017 2018 Calendar
2 pages
Examen de Ingles 1
No ratings yet
Examen de Ingles 1
10 pages
The Federal Polytechnic, Idah
100% (1)
The Federal Polytechnic, Idah
3 pages
Conversation With Phrasal Verbs
100% (2)
Conversation With Phrasal Verbs
1 page
Chapter III
No ratings yet
Chapter III
5 pages
A Guide For The Aspiring High School Trumpet Player
100% (1)
A Guide For The Aspiring High School Trumpet Player
20 pages
Personal Statement 3 - Fulbright Application
No ratings yet
Personal Statement 3 - Fulbright Application
1 page
Student Portal
No ratings yet
Student Portal
3 pages
Sepep Self Reflection
No ratings yet
Sepep Self Reflection
6 pages
21st Century Literature From The Philippines and The World
55% (31)
21st Century Literature From The Philippines and The World
7 pages
Module 6
No ratings yet
Module 6
32 pages
Software Design & Architecture Assignment 2
No ratings yet
Software Design & Architecture Assignment 2
3 pages
Medt 04: Health Information System For Medical Laboratory Science
No ratings yet
Medt 04: Health Information System For Medical Laboratory Science
24 pages
174 598 1 SP
No ratings yet
174 598 1 SP
12 pages
344pm - 39.EPRA JOURNALS 12504
No ratings yet
344pm - 39.EPRA JOURNALS 12504
4 pages
MIS604 Assessment+2 20240603
No ratings yet
MIS604 Assessment+2 20240603
7 pages
English 6 Draft
No ratings yet
English 6 Draft
16 pages
Self-Observation Form
No ratings yet
Self-Observation Form
4 pages
Process Flow: 1. Upload Assignment by Lecturer
No ratings yet
Process Flow: 1. Upload Assignment by Lecturer
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Visualization 9 Dim Reduction

Uploaded by

Visualization 9 Dim Reduction

Uploaded by

EECE 5642

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

A world of high-dimensional measurements!

Dimensionality Reduction (DR)

This is easier to interpret … … than this

12 Courtesy of Prof. Jaakko Peltonen, Aalto University.

– The variance is Var{x}

– The co-variance between two data set is

14 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

• e=[e1, e2, e3]T is an eigenvector, λ is the eigenvalue associated

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

19 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

20 Courtesy of Prof. Jaakko Peltonen, Aalto University.

22 Courtesy of Prof. Jaakko Peltonen, Aalto University.

24 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

26 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

27 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

28 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Digits data after PCA Digits data after LDA

30 Courtesy of Prof. Jaakko Peltonen, Aalto University.

33 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

• LDA fails in the right figure( v1 is the projected

34 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

38 Courtesy of Sam T. Roweis and Lawrence K. Saul, Sience 2002

– Most manifold learning method can be reformulated as

where d is a constant and B is the constraint matrix.

Between-Locality Graph Within-Locality Graph Courtesy of Shuicheng Yan

Correlation Distance Fisher Graph

1-st order 2-nd order

CTA objective function

Correlation Distance and Fisher Graph

Low-rank matrix recovery

Raw Data Existing Method New Method

LLE Noise Outlier

Voting for Outlier Detection

Graph oriented clustering K-means clustering

• We are facing huge amount of unlabeled data

Clean images Noisy images -observation - low-rank - sparse

Images from Twitter

Subspace Learning Subspace Clustering

Self-taught Learning (Raina et al, 2007) Our Motivations and Contributions

Application II: Image

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.