0% found this document useful (0 votes)

65 views2 pages

Things To Remember - Principal Component Analysis

PCA is a technique for dimensionality reduction that combines input variables to drop the least important while retaining the most valuable parts. It generates new variables that are independent of each other, satisfying the assumptions of linear models. PCA works best when you want to reduce variables without removing any, ensure independence, and accept less interpretability. It standardizes data, generates the covariance matrix, performs eigendecomposition to identify principal components capturing maximum variance, and sorts them to select the most important new variables. Its effectiveness depends on attribute scales, and interpretation challenges arise with discrete or skewed data.

Uploaded by

Umaprasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views2 pages

Things To Remember - Principal Component Analysis

Uploaded by

Umaprasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Things to Remember - Principal Component Analysis ( PCA)

What is PCA?

Let’s say that you want to predict what the gross domestic product (GDP) of India will be for 2019. You
have lots of information available:. GDP for the first quarter of 2019, GDP for the entirety of 2018, 2017,
and so on. You have any publicly-available economic indicators, like the unemployment rate, inflation
rate, and so on. You have Census data from 2012 estimating how many Indians work in each industry
and Indian statistical data updating those estimates in between each census. You could gather stock
price data, the number of IPOs occurring in a year. Despite being an overwhelming number of variables
to consider, this just scratches the surface.

With so many variables at hand it would be difficult in hand to decide which variables to focus on, in
technical terms it is important to reduce the dimension of your feature space.Reducing the dimension of
the feature space is called “dimensionality reduction.

Principal component analysis is a technique for dimension reduction — so it combines input variables in
a specific way, to drop the “least important” variables while still retaining the most valuable parts of all
of the variables! As an added benefit, each of the “new” variables after PCA are all independent of one
another. This is a benefit because the assumptions of a linear model require our independent variables
to be independent of one another. If we decide to fit a linear regression model with these “new”
variables (see “principal component regression” below), this assumption will necessarily be satisfied.

When should I use PCA?

1. Do you want to reduce the number of variables, but aren’t able to identify variables to
completely remove from consideration?
2. Do you want to ensure your variables are independent of one another?
3. Are you comfortable making your independent variables less interpretable?

If you answered “yes” to all three questions, then PCA is a good method to use. If you answered “no” to
question 3, you should not use PCA.
Steps for PCA

1. Begins by standardising the data. Data on all the dimensions are subtracted from their means to
shift the data points to the origin. i.e. the data is centered on the origins
2. Generate the covariance matrix / correlation matrix for all the dimensions
3. Perform eigen decomposition, that is, compute eigenvectors which are the principal
components and the corresponding eigenvalues which are the magnitudes of variance captured
4. Sort the eigen pairs in descending order of eigenvalues and select the one with the largest
value. This is the first principal component that covers the maximum information from the
original data.

Performance issues with PCA

1. PCA effectiveness depends upon the scales of the attributes. If attributes have different scales,
PCA will pick variable with highest variance rather than picking up attributes based on
correlation
2. Changing scales of the variables can change the PCA
3. Interpreting PCA can become challenging due to presence of discrete data
4. Presence of skew in data with long thick tail can impact the effectiveness of the PCA (related to
point 1)
5. PCA assumes a linear relationship between attributes. It is ineffective when relationships are
non linear

Ai & ML Week-9
No ratings yet
Ai & ML Week-9
30 pages
Principal Component Analysis1
No ratings yet
Principal Component Analysis1
26 pages
Week 11 Notes
No ratings yet
Week 11 Notes
52 pages
Apm PDF
No ratings yet
Apm PDF
246 pages
Unit 3
No ratings yet
Unit 3
102 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
28 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Newman Networks An Introduction 2010
100% (2)
Newman Networks An Introduction 2010
394 pages
PCA Theory
No ratings yet
PCA Theory
13 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
17 pages
Quantitative Finance
No ratings yet
Quantitative Finance
9 pages
PCA- PRINCIPAL COMPONENT ANALYSIS 1233
No ratings yet
PCA- PRINCIPAL COMPONENT ANALYSIS 1233
30 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Worksheet 22 PDF
No ratings yet
Worksheet 22 PDF
2 pages
Unit 1 - Mechanics - Spring 2017 - 1
No ratings yet
Unit 1 - Mechanics - Spring 2017 - 1
32 pages
BMK Q1 IEEESensors POT YOLO Real TimeRoadPotholesDetectionusingEdgeSegmentationbasedYolov8Network
No ratings yet
BMK Q1 IEEESensors POT YOLO Real TimeRoadPotholesDetectionusingEdgeSegmentationbasedYolov8Network
9 pages
Load Flow CoursesFile
No ratings yet
Load Flow CoursesFile
73 pages
Orientation of Runway: The Runway Is Usually Oriented in The Direction of The Prevailing Winds
No ratings yet
Orientation of Runway: The Runway Is Usually Oriented in The Direction of The Prevailing Winds
20 pages
Data Analytics
No ratings yet
Data Analytics
28 pages
House Price Prediction Using Machine Learning
No ratings yet
House Price Prediction Using Machine Learning
10 pages
Jigsaw Fractal
No ratings yet
Jigsaw Fractal
4 pages
09 Principal Component Analysis 1
No ratings yet
09 Principal Component Analysis 1
8 pages
Pca 1692550768
No ratings yet
Pca 1692550768
13 pages
2D Line Grid
No ratings yet
2D Line Grid
20 pages
Dynamic Factor Allocation Leveraging Regime-Switching Signals
No ratings yet
Dynamic Factor Allocation Leveraging Regime-Switching Signals
23 pages
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)_7c5a4c5da931f4f69a14c94e7e8b9062
23 pages
Dimensionality Reduction Techniques for ML Class_2813f3a7aa17cc18bb7fd5e1c5779838
No ratings yet
Dimensionality Reduction Techniques for ML Class_2813f3a7aa17cc18bb7fd5e1c5779838
17 pages
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
No ratings yet
Clustering_and_dimensionality_reduction_techniques__PCA__t_SNE__K_means_ (1)
15 pages
Principal Component Analysis - Wikipedia
No ratings yet
Principal Component Analysis - Wikipedia
28 pages
Sess03 Dimension Reduction Methods
No ratings yet
Sess03 Dimension Reduction Methods
36 pages
Unit 3
No ratings yet
Unit 3
31 pages
Module 3
No ratings yet
Module 3
41 pages
Pca
No ratings yet
Pca
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
Group 4 - Activity
No ratings yet
Group 4 - Activity
17 pages
The Intuition Behind PCA: Machine Learning Assignment
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
11 pages
Chapter Five Principal Comonent Analysis (PCA)
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
33 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
DS Ca2 PPT 3010 3017
No ratings yet
DS Ca2 PPT 3010 3017
10 pages
pca1
No ratings yet
pca1
3 pages
Principal+Component+Analysis
No ratings yet
Principal+Component+Analysis
6 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
Design and Implementation of An Extended Kalman Filter For The State Estimation of A Permanent Magnet Synchronous Motor
No ratings yet
Design and Implementation of An Extended Kalman Filter For The State Estimation of A Permanent Magnet Synchronous Motor
7 pages
STAT502
No ratings yet
STAT502
13 pages
3
No ratings yet
3
12 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
9 pages
1501589578da-mod15-Q1-e-text
No ratings yet
1501589578da-mod15-Q1-e-text
9 pages
Polar Coordinates
No ratings yet
Polar Coordinates
20 pages
2018 01 31 Popcorn Activity
No ratings yet
2018 01 31 Popcorn Activity
4 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Data Mining - Module 2 - HU
No ratings yet
Data Mining - Module 2 - HU
88 pages
TOK Mock Exhibition
No ratings yet
TOK Mock Exhibition
2 pages
Assignment 1
No ratings yet
Assignment 1
13 pages
SkidResistanceTextureCrashRisk PDF
No ratings yet
SkidResistanceTextureCrashRisk PDF
17 pages
Comp 2911 Cheat Sheet
No ratings yet
Comp 2911 Cheat Sheet
5 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
19 pages
ML Module 6
No ratings yet
ML Module 6
6 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
PCA Tutorial: Instructor: Forbes Burkowski
No ratings yet
PCA Tutorial: Instructor: Forbes Burkowski
12 pages
EDU COURSE SSShhhiiippp SSStttrrruuuccctttuuurrreeesss IIII
No ratings yet
EDU COURSE SSShhhiiippp SSStttrrruuuccctttuuurrreeesss IIII
174 pages
Assignment 4 - Heaps
No ratings yet
Assignment 4 - Heaps
7 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Cufsm Manual
No ratings yet
Cufsm Manual
15 pages
Template For Oxford University Press OUP Journals 2009 1
No ratings yet
Template For Oxford University Press OUP Journals 2009 1
9 pages
Linear Algebra
No ratings yet
Linear Algebra
5 pages
Summary PCA by Atta Mohammad 26040
No ratings yet
Summary PCA by Atta Mohammad 26040
2 pages
10 ASAP Advanced Statistics Dimension Reduction
No ratings yet
10 ASAP Advanced Statistics Dimension Reduction
8 pages
Theory of Demand and Supply ARM - M1
No ratings yet
Theory of Demand and Supply ARM - M1
31 pages
cheat sheet
No ratings yet
cheat sheet
2 pages
Linear Algebra
No ratings yet
Linear Algebra
5 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
Principal Components Analysis
No ratings yet
Principal Components Analysis
16 pages
FUEL-IN JECTION SYSTEM CALCULATIONs
No ratings yet
FUEL-IN JECTION SYSTEM CALCULATIONs
9 pages
Turnstile Antenna
No ratings yet
Turnstile Antenna
16 pages
CDR - Sample - 2 PDF
50% (4)
CDR - Sample - 2 PDF
29 pages
Manual CFD
100% (1)
Manual CFD
43 pages
Srikage &creep
No ratings yet
Srikage &creep
10 pages
Statistics: Basic Principles and Applications
From Everand
Statistics: Basic Principles and Applications
Ramune B. Adams
No ratings yet
Practical Statistical Process Control
From Everand
Practical Statistical Process Control
Colin Hardwick
5/5 (9)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Things To Remember - Principal Component Analysis

Uploaded by

Things To Remember - Principal Component Analysis

Uploaded by

Things to Remember - Principal Component Analysis ( PCA)

When should I use PCA?

Performance issues with PCA

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.