0% found this document useful (0 votes)

100 views14 pages

Week 6 (PCA, SVD, LDA)

The document discusses performing feature extraction on iris flower data using principal component analysis (PCA), singular value decomposition (SVD), linear discriminant analysis (LDA), and feature subset selection. It loads the iris dataset using Pandas and displays information about the data.

Uploaded by

nirmala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

100 views14 pages

Week 6 (PCA, SVD, LDA)

Uploaded by

nirmala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

wstktfkgj

October 14, 2023

[21]: import pandas as pd

import matplotlib.pyplot as plt
import seaborn as sns
from sklearn import datasets
from sklearn.decomposition import PCA
import sklearn.preprocessing

0.1 WEEK-6:
Feature Extraction: (Use packages that are applicable) 1. Principal Component Analysis (PCA)
2. Singular Value Decomposition (SVD) 3. Linear Discriminant Analysis (LDA) 4. Feature Subset
Selection
[8]: df=datasets.load_iris()
df

[8]: {'data': array([[5.1, 3.5, 1.4, 0.2],

[4.9, 3. , 1.4, 0.2],
[4.7, 3.2, 1.3, 0.2],
[4.6, 3.1, 1.5, 0.2],
[5. , 3.6, 1.4, 0.2],
[5.4, 3.9, 1.7, 0.4],
[4.6, 3.4, 1.4, 0.3],
[5. , 3.4, 1.5, 0.2],
[4.4, 2.9, 1.4, 0.2],
[4.9, 3.1, 1.5, 0.1],
[5.4, 3.7, 1.5, 0.2],
[4.8, 3.4, 1.6, 0.2],
[4.8, 3. , 1.4, 0.1],
[4.3, 3. , 1.1, 0.1],
[5.8, 4. , 1.2, 0.2],
[5.7, 4.4, 1.5, 0.4],
[5.4, 3.9, 1.3, 0.4],
[5.1, 3.5, 1.4, 0.3],
[5.7, 3.8, 1.7, 0.3],
[5.1, 3.8, 1.5, 0.3],
[5.4, 3.4, 1.7, 0.2],
[5.1, 3.7, 1.5, 0.4],

1
[4.6, 3.6, 1. , 0.2],
[5.1, 3.3, 1.7, 0.5],
[4.8, 3.4, 1.9, 0.2],
[5. , 3. , 1.6, 0.2],
[5. , 3.4, 1.6, 0.4],
[5.2, 3.5, 1.5, 0.2],
[5.2, 3.4, 1.4, 0.2],
[4.7, 3.2, 1.6, 0.2],
[4.8, 3.1, 1.6, 0.2],
[5.4, 3.4, 1.5, 0.4],
[5.2, 4.1, 1.5, 0.1],
[5.5, 4.2, 1.4, 0.2],
[4.9, 3.1, 1.5, 0.2],
[5. , 3.2, 1.2, 0.2],
[5.5, 3.5, 1.3, 0.2],
[4.9, 3.6, 1.4, 0.1],
[4.4, 3. , 1.3, 0.2],
[5.1, 3.4, 1.5, 0.2],
[5. , 3.5, 1.3, 0.3],
[4.5, 2.3, 1.3, 0.3],
[4.4, 3.2, 1.3, 0.2],
[5. , 3.5, 1.6, 0.6],
[5.1, 3.8, 1.9, 0.4],
[4.8, 3. , 1.4, 0.3],
[5.1, 3.8, 1.6, 0.2],
[4.6, 3.2, 1.4, 0.2],
[5.3, 3.7, 1.5, 0.2],
[5. , 3.3, 1.4, 0.2],
[7. , 3.2, 4.7, 1.4],
[6.4, 3.2, 4.5, 1.5],
[6.9, 3.1, 4.9, 1.5],
[5.5, 2.3, 4. , 1.3],
[6.5, 2.8, 4.6, 1.5],
[5.7, 2.8, 4.5, 1.3],
[6.3, 3.3, 4.7, 1.6],
[4.9, 2.4, 3.3, 1. ],
[6.6, 2.9, 4.6, 1.3],
[5.2, 2.7, 3.9, 1.4],
[5. , 2. , 3.5, 1. ],
[5.9, 3. , 4.2, 1.5],
[6. , 2.2, 4. , 1. ],
[6.1, 2.9, 4.7, 1.4],
[5.6, 2.9, 3.6, 1.3],
[6.7, 3.1, 4.4, 1.4],
[5.6, 3. , 4.5, 1.5],
[5.8, 2.7, 4.1, 1. ],
[6.2, 2.2, 4.5, 1.5],

2
[5.6, 2.5, 3.9, 1.1],
[5.9, 3.2, 4.8, 1.8],
[6.1, 2.8, 4. , 1.3],
[6.3, 2.5, 4.9, 1.5],
[6.1, 2.8, 4.7, 1.2],
[6.4, 2.9, 4.3, 1.3],
[6.6, 3. , 4.4, 1.4],
[6.8, 2.8, 4.8, 1.4],
[6.7, 3. , 5. , 1.7],
[6. , 2.9, 4.5, 1.5],
[5.7, 2.6, 3.5, 1. ],
[5.5, 2.4, 3.8, 1.1],
[5.5, 2.4, 3.7, 1. ],
[5.8, 2.7, 3.9, 1.2],
[6. , 2.7, 5.1, 1.6],
[5.4, 3. , 4.5, 1.5],
[6. , 3.4, 4.5, 1.6],
[6.7, 3.1, 4.7, 1.5],
[6.3, 2.3, 4.4, 1.3],
[5.6, 3. , 4.1, 1.3],
[5.5, 2.5, 4. , 1.3],
[5.5, 2.6, 4.4, 1.2],
[6.1, 3. , 4.6, 1.4],
[5.8, 2.6, 4. , 1.2],
[5. , 2.3, 3.3, 1. ],
[5.6, 2.7, 4.2, 1.3],
[5.7, 3. , 4.2, 1.2],
[5.7, 2.9, 4.2, 1.3],
[6.2, 2.9, 4.3, 1.3],
[5.1, 2.5, 3. , 1.1],
[5.7, 2.8, 4.1, 1.3],
[6.3, 3.3, 6. , 2.5],
[5.8, 2.7, 5.1, 1.9],
[7.1, 3. , 5.9, 2.1],
[6.3, 2.9, 5.6, 1.8],
[6.5, 3. , 5.8, 2.2],
[7.6, 3. , 6.6, 2.1],
[4.9, 2.5, 4.5, 1.7],
[7.3, 2.9, 6.3, 1.8],
[6.7, 2.5, 5.8, 1.8],
[7.2, 3.6, 6.1, 2.5],
[6.5, 3.2, 5.1, 2. ],
[6.4, 2.7, 5.3, 1.9],
[6.8, 3. , 5.5, 2.1],
[5.7, 2.5, 5. , 2. ],
[5.8, 2.8, 5.1, 2.4],
[6.4, 3.2, 5.3, 2.3],

3
[6.5, 3. , 5.5, 1.8],
[7.7, 3.8, 6.7, 2.2],
[7.7, 2.6, 6.9, 2.3],
[6. , 2.2, 5. , 1.5],
[6.9, 3.2, 5.7, 2.3],
[5.6, 2.8, 4.9, 2. ],
[7.7, 2.8, 6.7, 2. ],
[6.3, 2.7, 4.9, 1.8],
[6.7, 3.3, 5.7, 2.1],
[7.2, 3.2, 6. , 1.8],
[6.2, 2.8, 4.8, 1.8],
[6.1, 3. , 4.9, 1.8],
[6.4, 2.8, 5.6, 2.1],
[7.2, 3. , 5.8, 1.6],
[7.4, 2.8, 6.1, 1.9],
[7.9, 3.8, 6.4, 2. ],
[6.4, 2.8, 5.6, 2.2],
[6.3, 2.8, 5.1, 1.5],
[6.1, 2.6, 5.6, 1.4],
[7.7, 3. , 6.1, 2.3],
[6.3, 3.4, 5.6, 2.4],
[6.4, 3.1, 5.5, 1.8],
[6. , 3. , 4.8, 1.8],
[6.9, 3.1, 5.4, 2.1],
[6.7, 3.1, 5.6, 2.4],
[6.9, 3.1, 5.1, 2.3],
[5.8, 2.7, 5.1, 1.9],
[6.8, 3.2, 5.9, 2.3],
[6.7, 3.3, 5.7, 2.5],
[6.7, 3. , 5.2, 2.3],
[6.3, 2.5, 5. , 1.9],
[6.5, 3. , 5.2, 2. ],
[6.2, 3.4, 5.4, 2.3],
[5.9, 3. , 5.1, 1.8]]),
'target': array([0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2]),
'frame': None,
'target_names': array(['setosa', 'versicolor', 'virginica'], dtype='<U10'),
'DESCR': '.. _iris_dataset:\n\nIris plants
dataset\n--------------------\n\n**Data Set Characteristics:**\n\n :Number of
Instances: 150 (50 in each of three classes)\n :Number of Attributes: 4

4
numeric, predictive attributes and the class\n :Attribute Information:\n
- sepal length in cm\n - sepal width in cm\n - petal length in
cm\n - petal width in cm\n - class:\n - Iris-
Setosa\n - Iris-Versicolour\n - Iris-Virginica\n
\n :Summary Statistics:\n\n ============== ==== ==== ======= =====
====================\n Min Max Mean SD Class
Correlation\n ============== ==== ==== ======= ===== ====================\n
sepal length: 4.3 7.9 5.84 0.83 0.7826\n sepal width: 2.0 4.4
3.05 0.43 -0.4194\n petal length: 1.0 6.9 3.76 1.76 0.9490
(high!)\n petal width: 0.1 2.5 1.20 0.76 0.9565 (high!)\n
============== ==== ==== ======= ===== ====================\n\n :Missing
Attribute Values: None\n :Class Distribution: 33.3% for each of 3 classes.\n
:Creator: R.A. Fisher\n :Donor: Michael Marshall
(MARSHALL%PLU@io.arc.nasa.gov)\n :Date: July, 1988\n\nThe famous Iris
database, first used by Sir R.A. Fisher. The dataset is taken\nfrom Fisher\'s
paper. Note that it\'s the same as in R, but not as in the UCI\nMachine Learning
Repository, which has two wrong data points.\n\nThis is perhaps the best known
database to be found in the\npattern recognition literature. Fisher\'s paper is
a classic in the field and\nis referenced frequently to this day. (See Duda &
Hart, for example.) The\ndata set contains 3 classes of 50 instances each,
where each class refers to a\ntype of iris plant. One class is linearly
separable from the other 2; the\nlatter are NOT linearly separable from each
other.\n\n.. topic:: References\n\n - Fisher, R.A. "The use of multiple
measurements in taxonomic problems"\n Annual Eugenics, 7, Part II, 179-188
(1936); also in "Contributions to\n Mathematical Statistics" (John Wiley,
NY, 1950).\n - Duda, R.O., & Hart, P.E. (1973) Pattern Classification and
Scene Analysis.\n (Q327.D83) John Wiley & Sons. ISBN 0-471-22361-1. See
page 218.\n - Dasarathy, B.V. (1980) "Nosing Around the Neighborhood: A New
System\n Structure and Classification Rule for Recognition in Partially
Exposed\n Environments". IEEE Transactions on Pattern Analysis and
Machine\n Intelligence, Vol. PAMI-2, No. 1, 67-71.\n - Gates, G.W. (1972)
"The Reduced Nearest Neighbor Rule". IEEE Transactions\n on Information
Theory, May 1972, 431-433.\n - See also: 1988 MLC Proceedings, 54-64.
Cheeseman et al"s AUTOCLASS II\n conceptual clustering system finds 3
classes in the data.\n - Many, many more …',
'feature_names': ['sepal length (cm)',
'sepal width (cm)',
'petal length (cm)',
'petal width (cm)'],
'filename': 'iris.csv',
'data_module': 'sklearn.datasets.data'}

[ ]:

5
0.2 1. Principal Component Analysis (PCA)
[9]: #Determining initla dimensions of dataset
x=df.data
y=df.target
print(x.shape,y.shape)

(150, 4) (150,)

[10]: #scatter plot

plt.scatter(x[:,0],x[:,1],c=y)

[10]: <matplotlib.collections.PathCollection at 0x243da686310>

[11]: from sklearn.decomposition import PCA

pca = PCA()
X_new = pca.fit_transform(x)

[12]: cov_mat=pca.get_covariance()

[13]: cov_mat

[13]: array([[ 0.68569351, -0.042434 , 1.27431544, 0.51627069],

[-0.042434 , 0.18997942, -0.32965638, -0.12163937],
[ 1.27431544, -0.32965638, 3.11627785, 1.2956094 ],

6
[ 0.51627069, -0.12163937, 1.2956094 , 0.58100626]])

[14]: import numpy as np

eig_vals, eig_vecs=np.linalg.eig(cov_mat)

print("Eigenvelues \n%s" %eig_vals)

print("Eigenvectors \n%s" %eig_vecs)

Eigenvelues
[4.22824171 0.24267075 0.0782095 0.02383509]
Eigenvectors
[[ 0.36138659 -0.65658877 -0.58202985 0.31548719]
[-0.08452251 -0.73016143 0.59791083 -0.3197231 ]
[ 0.85667061 0.17337266 0.07623608 -0.47983899]
[ 0.3582892 0.07548102 0.54583143 0.75365743]]

[15]: #PCA for tagrget dimension on the dataset

pca=PCA(n_components=2)
pca.fit(x)

[15]: PCA(n_components=2)

[16]: # Visualizing principle components

pca.components_

[16]: array([[ 0.36138659, -0.08452251, 0.85667061, 0.3582892 ],

[ 0.65658877, 0.73016143, -0.17337266, -0.07548102]])

[17]: # Transforming the dataset from 4 dimensions into 2 dimension using PCA
z=pca.transform(x)
z.shape

[17]: (150, 2)

[18]: #scatter plot

plt.scatter(z[:,0],z[:,1],c=y)

[18]: <matplotlib.collections.PathCollection at 0x243dab43be0>

7
0.2.1 Observation:The three classes appear to be well separated

[19]: # Variance ratio of the target dimensions

pca.explained_variance_ratio_

[19]: array([0.92461872, 0.05306648])

0.3 Observation
0.3.1 Together, the first two principal components contain 97.76% of the information.
The first principal component contains 94.46% of the variance and the second principal component
contains 5.3% of the variance. The third and fourth principal component contained the rest of the
variance of the dataset.

0.4 2.Singular Value Decomposition

0.4.1 Displaying high-dimensional data using reduced-rank matrices
If the data is highly dimensional, you can use Singular Value Decomposition (SVD) to find a
reduced-rank approximation of the data that can be visualized easily.
[22]: iris1 = sklearn.datasets.load_iris()
iris1.data.shape

[22]: (150, 4)

8
[25]: df_iris = pd.DataFrame(iris1.data, columns=iris1.feature_names)

df_iris.shape

[25]: (150, 4)

[26]: U_iris, S_iris, Vt_iris = np.linalg.svd(df_iris, full_matrices=False)

[27]: U_iris.shape

[27]: (150, 4)

NOTE: numpy.linalg.svd actually returns a Σ that is not a diagonal matrix, but a list of the entries
on the diagonal.
[28]: S_iris

[28]: array([95.95991387, 17.76103366, 3.46093093, 1.88482631])

[29]: Vt_iris

[29]: array([[-0.75110816, -0.38008617, -0.51300886, -0.16790754],

[ 0.2841749 , 0.5467445 , -0.70866455, -0.34367081],
[ 0.50215472, -0.67524332, -0.05916621, -0.53701625],
[ 0.32081425, -0.31725607, -0.48074507, 0.75187165]])

[31]: #Befroe SVD scatter plot

x=iris1.data
y=iris1.target
print(x.shape,y.shape)

#scatter plot
plt.scatter(x[:,0],x[:,1],c=y)

(150, 4) (150,)

[31]: <matplotlib.collections.PathCollection at 0x243dabc5250>

9
[33]: # After SVD
import matplotlib.pyplot as plt

# Plot the first two columns of U

plt.scatter(U_iris[:, 0], U_iris[:, 1], c=iris1.target)
plt.xlabel('PC1')
plt.ylabel('PC2')
plt.show()

10
[ ]:

[ ]:

0.5 3.Linear Discriminant Analysis (LDA)

[44]: #determing the initial dimensions of the dataset
X=df.data
Y=df.target
print(X.shape)

(150, 4)

[45]: # DEcomposing 4D into 2D ising LDA

from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
lda=LinearDiscriminantAnalysis(n_components=2)
X_r2=lda.fit(X,Y).transform(X)

[46]: #Getting the variance ratio

lda.explained_variance_ratio_

[46]: array([0.9912126, 0.0087874])

11
[50]: # visualizing 2D datab in the form of scatter plot
import numpy as np
colors=['royalblue','red','tan']
vectorizer=np.vectorize(lambda X:colors[X%len(colors)])
plt.scatter(X_r2[:,0],X_r2[:,1],c=vectorizer(y))

[50]: <matplotlib.collections.PathCollection at 0x1e6ce8ba610>

0.6 Observation
0.6.1 LDA is able to able to separate the classes very well after dimenionality reduc-
tion
0.7 4.Feature Subset Selection
0.7.1 Filter approach

[52]: iris= pd.read_csv('E:\\OneDrive - Don Bosco␣

↪School\\VNR-VJIET\\DE\\datasets\\iris.csv')

iris

[52]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm \

0 1 5.1 3.5 1.4 0.2
1 2 4.9 3.0 1.4 0.2
2 3 4.7 3.2 1.3 0.2
3 4 4.6 3.1 1.5 0.2

12
4 5 5.0 3.6 1.4 0.2
.. … … … … …
145 146 6.7 3.0 5.2 2.3
146 147 6.3 2.5 5.0 1.9
147 148 6.5 3.0 5.2 2.0
148 149 6.2 3.4 5.4 2.3
149 150 5.9 3.0 5.1 1.8

Species
0 Iris-setosa
1 Iris-setosa
2 Iris-setosa
3 Iris-setosa
4 Iris-setosa
.. …
145 Iris-virginica
146 Iris-virginica
147 Iris-virginica
148 Iris-virginica
149 Iris-virginica

[150 rows x 6 columns]

[58]: # Visualizing using pair plot

#sns.pairplot(iris.drop(['Id'],axis=1),hue='Species',height=2)

0.8 Correlation Matrix with Heatmap

Correlation states how the features are related to each other or the target variable.
Correlation can be positive (increase in one value of feature increases the value of the target variable)
or negative (increase in one value of feature decreases the value of the target variable)
Heatmap makes it easy to identify which features are most related to the target variable, we will
plot heatmap of correlated features using the seaborn library.
[38]: #Visualizing correlation using heatmap
sns.heatmap(iris.corr(method='pearson').drop(['Id'],axis=1).
↪drop(['Id'],axis=0),annot=True)

plt.show()

13
0.9 Observation
[ ]:

SK Learn 1
No ratings yet
SK Learn 1
11 pages
ML Journal
No ratings yet
ML Journal
58 pages
Machine Learning Group Project
No ratings yet
Machine Learning Group Project
22 pages
ML#07
No ratings yet
ML#07
21 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Unit 2
No ratings yet
Unit 2
12 pages
Strangers
No ratings yet
Strangers
8 pages
Python Course Cheat Sheet
No ratings yet
Python Course Cheat Sheet
30 pages
Lab4 KNN
No ratings yet
Lab4 KNN
9 pages
Prac7 8 9 10
No ratings yet
Prac7 8 9 10
12 pages
Merged
No ratings yet
Merged
35 pages
Experimenting With Data Analysis Packages and Statistical Operations
No ratings yet
Experimenting With Data Analysis Packages and Statistical Operations
18 pages
Chap5 - Wei - Ipynb - Colab
No ratings yet
Chap5 - Wei - Ipynb - Colab
29 pages
Data Toolkit Assignment
No ratings yet
Data Toolkit Assignment
11 pages
DSBDA6
No ratings yet
DSBDA6
6 pages
ML Group 2
No ratings yet
ML Group 2
16 pages
Shiva Teja
No ratings yet
Shiva Teja
19 pages
Ex 5 - NN - Wheat Seed Data
No ratings yet
Ex 5 - NN - Wheat Seed Data
9 pages
7 Output
No ratings yet
7 Output
4 pages
Pca 2382487
No ratings yet
Pca 2382487
8 pages
DWM Practical
No ratings yet
DWM Practical
12 pages
DL Lab 3
No ratings yet
DL Lab 3
5 pages
21BECE30036 Prac 1
No ratings yet
21BECE30036 Prac 1
10 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
PRGM 4
No ratings yet
PRGM 4
3 pages
KNN052
No ratings yet
KNN052
5 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
No ratings yet
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
21 pages
Prac9 23bme053
No ratings yet
Prac9 23bme053
4 pages
18AIL78 - Lab Manual
No ratings yet
18AIL78 - Lab Manual
25 pages
D3 Docs
No ratings yet
D3 Docs
6 pages
DSM 3
No ratings yet
DSM 3
6 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
PCA
No ratings yet
PCA
23 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
April 23, 2025: Pandas PD
No ratings yet
April 23, 2025: Pandas PD
11 pages
Ashwin Report
No ratings yet
Ashwin Report
18 pages
Exp 5,6,7
No ratings yet
Exp 5,6,7
2 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
DSM 1
No ratings yet
DSM 1
6 pages
Lab Extern L
No ratings yet
Lab Extern L
8 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Inbuilt Kmeans
No ratings yet
Inbuilt Kmeans
3 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
Final ML File
No ratings yet
Final ML File
34 pages
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
No ratings yet
10 - DBSCANClusteringOnIRIS-Copy1 - Jupyter Notebook
4 pages
Assignment 5'
No ratings yet
Assignment 5'
4 pages
Pattern Recognition Practicals
No ratings yet
Pattern Recognition Practicals
8 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
Data Mining Ex1
No ratings yet
Data Mining Ex1
10 pages
Flores
No ratings yet
Flores
4 pages
Week 6 K Nearestneighbors 1
No ratings yet
Week 6 K Nearestneighbors 1
11 pages
Dimentionality Reduction Implementation
No ratings yet
Dimentionality Reduction Implementation
8 pages
1 Abril PDF
No ratings yet
1 Abril PDF
10 pages
Grade 7 Quiz Bee
100% (1)
Grade 7 Quiz Bee
2 pages
Bisection Method
No ratings yet
Bisection Method
11 pages
Basic Engineering Correlation Calculus v3 001
100% (1)
Basic Engineering Correlation Calculus v3 001
3 pages
Mathematics Year 3 2021 2022
No ratings yet
Mathematics Year 3 2021 2022
9 pages
Basic Engineering Correlation Algebra Re PDF
No ratings yet
Basic Engineering Correlation Algebra Re PDF
78 pages
Financial Management Tables - PV & FV
No ratings yet
Financial Management Tables - PV & FV
4 pages
HSC 11 Scalars and Vectors Ch2
No ratings yet
HSC 11 Scalars and Vectors Ch2
5 pages
4MA1 1HR Que 20220111
100% (1)
4MA1 1HR Que 20220111
32 pages
Math 9-Q4-L1-The Six Trigonometric Ratios
No ratings yet
Math 9-Q4-L1-The Six Trigonometric Ratios
22 pages
6 Ee 2b Vocab Cards
No ratings yet
6 Ee 2b Vocab Cards
7 pages
Mathaka Wassa 2022 New Edition
No ratings yet
Mathaka Wassa 2022 New Edition
68 pages
Non-Restoring Division Algorithm
100% (1)
Non-Restoring Division Algorithm
4 pages
Week 4 Statistics Recap MAKING MEANING OF MEASUREMENTS & RAW TEST SCORES
No ratings yet
Week 4 Statistics Recap MAKING MEANING OF MEASUREMENTS & RAW TEST SCORES
39 pages
VR17 - First Year Syllabus PO-CO Mapping Updated - CSE PDF
No ratings yet
VR17 - First Year Syllabus PO-CO Mapping Updated - CSE PDF
37 pages
CLASS 10th MATHS (Standar) HALF YEARLY 2024-25
No ratings yet
CLASS 10th MATHS (Standar) HALF YEARLY 2024-25
6 pages
Triangulation Adjustment
No ratings yet
Triangulation Adjustment
32 pages
Advanced Scripting and Environmental Analysis: 6.1. Mathematical Sets and Functions
No ratings yet
Advanced Scripting and Environmental Analysis: 6.1. Mathematical Sets and Functions
2 pages
430 4 3 Mathematics (Basic)
No ratings yet
430 4 3 Mathematics (Basic)
19 pages
Monotonocity: A. Definitions
No ratings yet
Monotonocity: A. Definitions
11 pages
Face Detection Using PCA
No ratings yet
Face Detection Using PCA
32 pages
Mark Scheme (Results) : Summer 2018
No ratings yet
Mark Scheme (Results) : Summer 2018
25 pages
Math340Fa23 FinalExamKey
No ratings yet
Math340Fa23 FinalExamKey
16 pages
(E-MATH) Chapter 12 - Congruence & Similaritymaths Olevel
No ratings yet
(E-MATH) Chapter 12 - Congruence & Similaritymaths Olevel
9 pages
Problem Solving - The Cryptarithmetic Example
No ratings yet
Problem Solving - The Cryptarithmetic Example
4 pages
1.2 - Literal Equations & Linear Applications
No ratings yet
1.2 - Literal Equations & Linear Applications
7 pages
Computational Physics: - The Programming Language I'll Use Is
No ratings yet
Computational Physics: - The Programming Language I'll Use Is
19 pages
Fourier Series: Yi Cheng Cal Poly Pomona
No ratings yet
Fourier Series: Yi Cheng Cal Poly Pomona
46 pages
Covering Spaces and Graph Theory PDF
No ratings yet
Covering Spaces and Graph Theory PDF
46 pages
Developing Level of Interest of Grade Learners of City Central Elementary School in Relation To Problem Solving Activities in Mathematics
No ratings yet
Developing Level of Interest of Grade Learners of City Central Elementary School in Relation To Problem Solving Activities in Mathematics
2 pages
Sequences and Series: Module Quiz: B
No ratings yet
Sequences and Series: Module Quiz: B
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Week 6 (PCA, SVD, LDA)

Uploaded by

Week 6 (PCA, SVD, LDA)

Uploaded by

wstktfkgj

October 14, 2023

[21]: import pandas as pd

[8]: {'data': array([[5.1, 3.5, 1.4, 0.2],

[10]: #scatter plot

[10]: <matplotlib.collections.PathCollection at 0x243da686310>

[11]: from sklearn.decomposition import PCA

[13]: array([[ 0.68569351, -0.042434 , 1.27431544, 0.51627069],

[14]: import numpy as np

print("Eigenvelues \n%s" %eig_vals)

[15]: #PCA for tagrget dimension on the dataset

[16]: # Visualizing principle components

[16]: array([[ 0.36138659, -0.08452251, 0.85667061, 0.3582892 ],

[18]: #scatter plot

[18]: <matplotlib.collections.PathCollection at 0x243dab43be0>

[19]: # Variance ratio of the target dimensions

[19]: array([0.92461872, 0.05306648])

0.4 2.Singular Value Decomposition

[26]: U_iris, S_iris, Vt_iris = np.linalg.svd(df_iris, full_matrices=False)

[28]: array([95.95991387, 17.76103366, 3.46093093, 1.88482631])

[29]: array([[-0.75110816, -0.38008617, -0.51300886, -0.16790754],

[31]: #Befroe SVD scatter plot

[31]: <matplotlib.collections.PathCollection at 0x243dabc5250>

# Plot the first two columns of U

0.5 3.Linear Discriminant Analysis (LDA)

[45]: # DEcomposing 4D into 2D ising LDA

[46]: #Getting the variance ratio

[46]: array([0.9912126, 0.0087874])

[50]: <matplotlib.collections.PathCollection at 0x1e6ce8ba610>

[52]: iris= pd.read_csv('E:\\OneDrive - Don Bosco␣

[52]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm \

[150 rows x 6 columns]

[58]: # Visualizing using pair plot

0.8 Correlation Matrix with Heatmap

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.