0% found this document useful (0 votes)

93 views13 pages

Knn1 MinMaxScalar

The document loads and preprocesses the iris dataset for machine learning modeling. It loads the dataset, explores the data distribution, scales the feature variables using min-max scaling, and splits the data into features and target for modeling. The scaling is performed to standardize the variable ranges which may improve model performance.

Uploaded by

Joe1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views13 pages

Knn1 MinMaxScalar

Uploaded by

Joe1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

9/7/2018 komal_knn1_minMaxScalar

In [36]: import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

from sklearn.preprocessing import MinMaxScaler

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier
from sklearn import metrics

import seaborn as sns

sns.set(font_scale=1.5)
sns.set(style='white',color_codes=True)

In [2]: location = r"D:\komal\SIMPLILEARN\MY COURSES\IN PROGRESS\DATA SCIENCE WITH PYT

HON\Live class downloads\Aug 11 Sat - Sep 15 Sat - Attending\datasets\iris.cs
v"

In [3]: # load the training data from breast cancer data set
df_iris = pd.read_csv(location)
df_iris.head()

Out[3]:
sepal_length sepal_width petal_length petal_width class

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 1/13
9/7/2018 komal_knn1_minMaxScalar

In [4]: # Check the available styles

plt.style.available

Out[4]: ['bmh',
'classic',
'dark_background',
'fast',
'fivethirtyeight',
'ggplot',
'grayscale',
'seaborn-bright',
'seaborn-colorblind',
'seaborn-dark-palette',
'seaborn-dark',
'seaborn-darkgrid',
'seaborn-deep',
'seaborn-muted',
'seaborn-notebook',
'seaborn-paper',
'seaborn-pastel',
'seaborn-poster',
'seaborn-talk',
'seaborn-ticks',
'seaborn-white',
'seaborn-whitegrid',
'seaborn',
'Solarize_Light2',
'tableau-colorblind10',
'_classic_test']

In [5]: plt.style.use('ggplot')

In [6]: # Means are in the same order of magnitude for all features so scaling
# might not be beneficial.
# If mean values were of different orders of magnitude, scaling could
# significantly improve accuracy of a classifier.

df_iris.describe()

Out[6]:
sepal_length sepal_width petal_length petal_width

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 2/13
9/7/2018 komal_knn1_minMaxScalar

In [7]: X = df_iris.drop('class' , 1).values # drop target variable

y1 = df_iris['class'].values
y = df_iris['class']

In [8]: scaler = MinMaxScaler()

scaler

Out[8]: MinMaxScaler(copy=True, feature_range=(0, 1))

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 3/13
9/7/2018 komal_knn1_minMaxScalar

In [9]: X_scaled = scaler.fit_transform(X)

print('X_scaled type is', type(X_scaled))

X_scaled

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 4/13
9/7/2018 komal_knn1_minMaxScalar

X_scaled type is <class 'numpy.ndarray'>

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 5/13
9/7/2018 komal_knn1_minMaxScalar

Out[9]: array([[0.22222222, 0.625 , 0.06779661, 0.04166667],

[0.16666667, 0.41666667, 0.06779661, 0.04166667],
[0.11111111, 0.5 , 0.05084746, 0.04166667],
[0.08333333, 0.45833333, 0.08474576, 0.04166667],
[0.19444444, 0.66666667, 0.06779661, 0.04166667],
[0.30555556, 0.79166667, 0.11864407, 0.125 ],
[0.08333333, 0.58333333, 0.06779661, 0.08333333],
[0.19444444, 0.58333333, 0.08474576, 0.04166667],
[0.02777778, 0.375 , 0.06779661, 0.04166667],
[0.16666667, 0.45833333, 0.08474576, 0. ],
[0.30555556, 0.70833333, 0.08474576, 0.04166667],
[0.13888889, 0.58333333, 0.10169492, 0.04166667],
[0.13888889, 0.41666667, 0.06779661, 0. ],
[0. , 0.41666667, 0.01694915, 0. ],
[0.41666667, 0.83333333, 0.03389831, 0.04166667],
[0.38888889, 1. , 0.08474576, 0.125 ],
[0.30555556, 0.79166667, 0.05084746, 0.125 ],
[0.22222222, 0.625 , 0.06779661, 0.08333333],
[0.38888889, 0.75 , 0.11864407, 0.08333333],
[0.22222222, 0.75 , 0.08474576, 0.08333333],
[0.30555556, 0.58333333, 0.11864407, 0.04166667],
[0.22222222, 0.70833333, 0.08474576, 0.125 ],
[0.08333333, 0.66666667, 0. , 0.04166667],
[0.22222222, 0.54166667, 0.11864407, 0.16666667],
[0.13888889, 0.58333333, 0.15254237, 0.04166667],
[0.19444444, 0.41666667, 0.10169492, 0.04166667],
[0.19444444, 0.58333333, 0.10169492, 0.125 ],
[0.25 , 0.625 , 0.08474576, 0.04166667],
[0.25 , 0.58333333, 0.06779661, 0.04166667],
[0.11111111, 0.5 , 0.10169492, 0.04166667],
[0.13888889, 0.45833333, 0.10169492, 0.04166667],
[0.30555556, 0.58333333, 0.08474576, 0.125 ],
[0.25 , 0.875 , 0.08474576, 0. ],
[0.33333333, 0.91666667, 0.06779661, 0.04166667],
[0.16666667, 0.45833333, 0.08474576, 0. ],
[0.19444444, 0.5 , 0.03389831, 0.04166667],
[0.33333333, 0.625 , 0.05084746, 0.04166667],
[0.16666667, 0.45833333, 0.08474576, 0. ],
[0.02777778, 0.41666667, 0.05084746, 0.04166667],
[0.22222222, 0.58333333, 0.08474576, 0.04166667],
[0.19444444, 0.625 , 0.05084746, 0.08333333],
[0.05555556, 0.125 , 0.05084746, 0.08333333],
[0.02777778, 0.5 , 0.05084746, 0.04166667],
[0.19444444, 0.625 , 0.10169492, 0.20833333],
[0.22222222, 0.75 , 0.15254237, 0.125 ],
[0.13888889, 0.41666667, 0.06779661, 0.08333333],
[0.22222222, 0.75 , 0.10169492, 0.04166667],
[0.08333333, 0.5 , 0.06779661, 0.04166667],
[0.27777778, 0.70833333, 0.08474576, 0.04166667],
[0.19444444, 0.54166667, 0.06779661, 0.04166667],
[0.75 , 0.5 , 0.62711864, 0.54166667],
[0.58333333, 0.5 , 0.59322034, 0.58333333],
[0.72222222, 0.45833333, 0.66101695, 0.58333333],
[0.33333333, 0.125 , 0.50847458, 0.5 ],
[0.61111111, 0.33333333, 0.61016949, 0.58333333],
[0.38888889, 0.33333333, 0.59322034, 0.5 ],
[0.55555556, 0.54166667, 0.62711864, 0.625 ],
file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 6/13
9/7/2018 komal_knn1_minMaxScalar

[0.16666667, 0.16666667, 0.38983051, 0.375 ],

[0.63888889, 0.375 , 0.61016949, 0.5 ],
[0.25 , 0.29166667, 0.49152542, 0.54166667],
[0.19444444, 0. , 0.42372881, 0.375 ],
[0.44444444, 0.41666667, 0.54237288, 0.58333333],
[0.47222222, 0.08333333, 0.50847458, 0.375 ],
[0.5 , 0.375 , 0.62711864, 0.54166667],
[0.36111111, 0.375 , 0.44067797, 0.5 ],
[0.66666667, 0.45833333, 0.57627119, 0.54166667],
[0.36111111, 0.41666667, 0.59322034, 0.58333333],
[0.41666667, 0.29166667, 0.52542373, 0.375 ],
[0.52777778, 0.08333333, 0.59322034, 0.58333333],
[0.36111111, 0.20833333, 0.49152542, 0.41666667],
[0.44444444, 0.5 , 0.6440678 , 0.70833333],
[0.5 , 0.33333333, 0.50847458, 0.5 ],
[0.55555556, 0.20833333, 0.66101695, 0.58333333],
[0.5 , 0.33333333, 0.62711864, 0.45833333],
[0.58333333, 0.375 , 0.55932203, 0.5 ],
[0.63888889, 0.41666667, 0.57627119, 0.54166667],
[0.69444444, 0.33333333, 0.6440678 , 0.54166667],
[0.66666667, 0.41666667, 0.6779661 , 0.66666667],
[0.47222222, 0.375 , 0.59322034, 0.58333333],
[0.38888889, 0.25 , 0.42372881, 0.375 ],
[0.33333333, 0.16666667, 0.47457627, 0.41666667],
[0.33333333, 0.16666667, 0.45762712, 0.375 ],
[0.41666667, 0.29166667, 0.49152542, 0.45833333],
[0.47222222, 0.29166667, 0.69491525, 0.625 ],
[0.30555556, 0.41666667, 0.59322034, 0.58333333],
[0.47222222, 0.58333333, 0.59322034, 0.625 ],
[0.66666667, 0.45833333, 0.62711864, 0.58333333],
[0.55555556, 0.125 , 0.57627119, 0.5 ],
[0.36111111, 0.41666667, 0.52542373, 0.5 ],
[0.33333333, 0.20833333, 0.50847458, 0.5 ],
[0.33333333, 0.25 , 0.57627119, 0.45833333],
[0.5 , 0.41666667, 0.61016949, 0.54166667],
[0.41666667, 0.25 , 0.50847458, 0.45833333],
[0.19444444, 0.125 , 0.38983051, 0.375 ],
[0.36111111, 0.29166667, 0.54237288, 0.5 ],
[0.38888889, 0.41666667, 0.54237288, 0.45833333],
[0.38888889, 0.375 , 0.54237288, 0.5 ],
[0.52777778, 0.375 , 0.55932203, 0.5 ],
[0.22222222, 0.20833333, 0.33898305, 0.41666667],
[0.38888889, 0.33333333, 0.52542373, 0.5 ],
[0.55555556, 0.54166667, 0.84745763, 1. ],
[0.41666667, 0.29166667, 0.69491525, 0.75 ],
[0.77777778, 0.41666667, 0.83050847, 0.83333333],
[0.55555556, 0.375 , 0.77966102, 0.70833333],
[0.61111111, 0.41666667, 0.81355932, 0.875 ],
[0.91666667, 0.41666667, 0.94915254, 0.83333333],
[0.16666667, 0.20833333, 0.59322034, 0.66666667],
[0.83333333, 0.375 , 0.89830508, 0.70833333],
[0.66666667, 0.20833333, 0.81355932, 0.70833333],
[0.80555556, 0.66666667, 0.86440678, 1. ],
[0.61111111, 0.5 , 0.69491525, 0.79166667],
[0.58333333, 0.29166667, 0.72881356, 0.75 ],
[0.69444444, 0.41666667, 0.76271186, 0.83333333],
[0.38888889, 0.20833333, 0.6779661 , 0.79166667],

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 7/13
9/7/2018 komal_knn1_minMaxScalar

[0.41666667, 0.33333333, 0.69491525, 0.95833333],

[0.58333333, 0.5 , 0.72881356, 0.91666667],
[0.61111111, 0.41666667, 0.76271186, 0.70833333],
[0.94444444, 0.75 , 0.96610169, 0.875 ],
[0.94444444, 0.25 , 1. , 0.91666667],
[0.47222222, 0.08333333, 0.6779661 , 0.58333333],
[0.72222222, 0.5 , 0.79661017, 0.91666667],
[0.36111111, 0.33333333, 0.66101695, 0.79166667],
[0.94444444, 0.33333333, 0.96610169, 0.79166667],
[0.55555556, 0.29166667, 0.66101695, 0.70833333],
[0.66666667, 0.54166667, 0.79661017, 0.83333333],
[0.80555556, 0.5 , 0.84745763, 0.70833333],
[0.52777778, 0.33333333, 0.6440678 , 0.70833333],
[0.5 , 0.41666667, 0.66101695, 0.70833333],
[0.58333333, 0.33333333, 0.77966102, 0.83333333],
[0.80555556, 0.41666667, 0.81355932, 0.625 ],
[0.86111111, 0.33333333, 0.86440678, 0.75 ],
[1. , 0.75 , 0.91525424, 0.79166667],
[0.58333333, 0.33333333, 0.77966102, 0.875 ],
[0.55555556, 0.33333333, 0.69491525, 0.58333333],
[0.5 , 0.25 , 0.77966102, 0.54166667],
[0.94444444, 0.41666667, 0.86440678, 0.91666667],
[0.55555556, 0.58333333, 0.77966102, 0.95833333],
[0.58333333, 0.45833333, 0.76271186, 0.70833333],
[0.47222222, 0.41666667, 0.6440678 , 0.70833333],
[0.72222222, 0.45833333, 0.74576271, 0.83333333],
[0.66666667, 0.45833333, 0.77966102, 0.95833333],
[0.72222222, 0.45833333, 0.69491525, 0.91666667],
[0.41666667, 0.29166667, 0.69491525, 0.75 ],
[0.69444444, 0.5 , 0.83050847, 0.91666667],
[0.66666667, 0.54166667, 0.79661017, 1. ],
[0.66666667, 0.41666667, 0.71186441, 0.91666667],
[0.55555556, 0.20833333, 0.6779661 , 0.75 ],
[0.61111111, 0.41666667, 0.71186441, 0.79166667],
[0.52777778, 0.58333333, 0.74576271, 0.91666667],
[0.44444444, 0.41666667, 0.69491525, 0.70833333]])

In [10]: # transform back to df for easier exploration/plotting (output of scaler)

X_scaled_df = pd.DataFrame(X_scaled, columns=['s_SepalLength','s_SepalWidth',
's_PetalLength','s_PetalWidth'])

X_scaled_df.head()

Out[10]:
s_SepalLength s_SepalWidth s_PetalLength s_PetalWidth

0 0.222222 0.625000 0.067797 0.041667

1 0.166667 0.416667 0.067797 0.041667

2 0.111111 0.500000 0.050847 0.041667

3 0.083333 0.458333 0.084746 0.041667

4 0.194444 0.666667 0.067797 0.041667

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 8/13
9/7/2018 komal_knn1_minMaxScalar

In [11]: df_iris_scaled = pd.concat([X_scaled_df,y],axis=1)

df_iris_scaled.head()

Out[11]:
s_SepalLength s_SepalWidth s_PetalLength s_PetalWidth class

0 0.222222 0.625000 0.067797 0.041667 Iris-setosa

1 0.166667 0.416667 0.067797 0.041667 Iris-setosa

2 0.111111 0.500000 0.050847 0.041667 Iris-setosa

3 0.083333 0.458333 0.084746 0.041667 Iris-setosa

4 0.194444 0.666667 0.067797 0.041667 Iris-setosa

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 9/13
9/7/2018 komal_knn1_minMaxScalar

In [12]: # Notice x-axis on subplots are all the same for all features (0 to 1)
# after scaling.
fig = plt.figure(figsize=(14,9))
fig.suptitle('Frequency Distribution of Features by Species ',fontsize=20)

ax1 = fig.add_subplot(221)
df_iris_scaled.groupby("class").s_PetalLength.plot(kind='hist',
alpha=0.8,
legend=True,
title='s_PetalLength')

ax2 = fig.add_subplot(222,sharey=ax1)
df_iris_scaled.groupby("class").s_PetalWidth.plot(kind='hist',
alpha=0.8,
legend=True,
title='s_PetalWidth')

ax3 = fig.add_subplot(223,sharey=ax1)
df_iris_scaled.groupby("class").s_SepalLength.plot(kind='hist',
alpha=0.8,
legend=True,
title='s_SepalLength')

ax4 = fig.add_subplot(224,sharey=ax1)
df_iris_scaled.groupby("class").s_SepalWidth.plot(kind='hist',
alpha=0.8,
legend=True,
title='s_SepalWidth');

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 10/13
9/7/2018 komal_knn1_minMaxScalar

In [13]: X_scaled_df.describe()

Out[13]:
s_SepalLength s_SepalWidth s_PetalLength s_PetalWidth

count 150.000000 150.000000 150.000000 150.000000

mean 0.428704 0.439167 0.467571 0.457778

std 0.230018 0.180664 0.299054 0.317984

min 0.000000 0.000000 0.000000 0.000000

25% 0.222222 0.333333 0.101695 0.083333

50% 0.416667 0.416667 0.567797 0.500000

75% 0.583333 0.541667 0.694915 0.708333

max 1.000000 1.000000 1.000000 1.000000

In [18]: # train and test split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, rando

m_state = 0)

In [19]: print("train sample size",X_train.shape, type(X_train))

print("test sample size",X_test.shape, type(X_test))

train sample size (105, 4) <class 'numpy.ndarray'>

test sample size (45, 4) <class 'numpy.ndarray'>

In [23]: clf = KNeighborsClassifier(n_neighbors=5)

clf.fit(X_train, y_train)

Out[23]: KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',

metric_params=None, n_jobs=1, n_neighbors=5, p=2,
weights='uniform')

In [24]: y_pred = clf.predict(X_test)

In [28]: # Creates a confusion matrix

cm = metrics.confusion_matrix(y_test, y_pred)

In [29]: cm

Out[29]: array([[16, 0, 0],

[ 0, 17, 1],
[ 0, 0, 11]], dtype=int64)

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 11/13
9/7/2018 komal_knn1_minMaxScalar

In [32]: CT=pd.crosstab(y_test, y_pred, rownames=['True'], colnames=['Predicted'], marg

ins=True)
CT

Out[32]:
Predicted Iris-setosa Iris-versicolor Iris-virginica All

True

Iris-setosa 16 0 0 16

Iris-versicolor 0 17 1 18

Iris-virginica 0 0 11 11

All 16 17 12 45

In [38]: from sklearn.metrics import accuracy_score

An insight we can get from the matrix is that the model was very accurate at classifying setosa and
versicolor (True Positive/All = 1.0). However, accuracy for virginica was lower (11/12 = 0.917).

In [39]: plt.figure(figsize=(6,4))
sns.heatmap(CT, annot=True)
plt.title('KNN classification model \nAccuracy:{0:.3f}'.format(accuracy_score(
y_test, y_pred)))
plt.ylabel('True label')
plt.xlabel('Predicted label')

Out[39]: Text(0.5,16,'Predicted label')

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 12/13
9/7/2018 komal_knn1_minMaxScalar

In [42]: from sklearn.metrics import classification_report

print(classification_report(y_test,y_pred))

precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 16

Iris-versicolor 1.00 0.94 0.97 18
Iris-virginica 0.92 1.00 0.96 11

avg / total 0.98 0.98 0.98 45

In [43]: # Classification accuracy : Overall how often is the classifier correct?

print(metrics.accuracy_score(y_test, y_pred))

# classification error : Overall how often is the classifier incorrect?

print(1-metrics.accuracy_score(y_test, y_pred))

0.9777777777777777
0.022222222222222254

In [45]: # Sensitivity : when the actual value is +ve, how often is the predication cor
rect
# Also known as "True Positive Rate" or 'Recall"
# should be MAXIMIZED
#print(metrics.recall_score(y_test, y_pred, average='none'))

# Specificity: When the actual value is -ve, how often the prediction correct
# Also known as "Selective"
# should be MAXIMIZED

# False Positive Rate : when the actual value is negative, how often is the
# prediction incorrect
# 1- Specificity

# Precision: when a +ve value is predicted, how often is the prediction correc
t?
# print(metrics.precision_score(y_test, y_pred, average='none'))

file:///D:/komal/SIMPLILEARN/MY%20COURSES/IN%20PROGRESS/My%20Codes_ML_DS/codes%20in%20pdf/komal_knn1_minMaxScalar.html 13/13

SMDM Project
87% (15)
SMDM Project
23 pages
Assignment 1 Sta301
100% (1)
Assignment 1 Sta301
3 pages
Employees Mod DB PDF
No ratings yet
Employees Mod DB PDF
1 page
TP5 Ex1 Kmeans
No ratings yet
TP5 Ex1 Kmeans
7 pages
Stock-MArket-Forecasting - Untitled - Ipynb at Master Krishnaik06 - Stock-MArket-Forecasting
No ratings yet
Stock-MArket-Forecasting - Untitled - Ipynb at Master Krishnaik06 - Stock-MArket-Forecasting
37 pages
Minor Project
No ratings yet
Minor Project
92 pages
PCA
No ratings yet
PCA
23 pages
Import As
100% (1)
Import As
27 pages
Python Tut Gradient Descent Algos MLR - Jupyter Notebook
No ratings yet
Python Tut Gradient Descent Algos MLR - Jupyter Notebook
40 pages
K Means Clustering
No ratings yet
K Means Clustering
6 pages
Upyter Notebook1
No ratings yet
Upyter Notebook1
5 pages
DL Lab 3
No ratings yet
DL Lab 3
5 pages
Data AMMM
No ratings yet
Data AMMM
27 pages
DATA SCIENCE IDC 302 End Sem Project
No ratings yet
DATA SCIENCE IDC 302 End Sem Project
1 page
7 Output
No ratings yet
7 Output
4 pages
Merged
No ratings yet
Merged
35 pages
HW1
No ratings yet
HW1
2 pages
Dimentionality Reduction Implementation
No ratings yet
Dimentionality Reduction Implementation
8 pages
Bob AttackCue01 CritHit
No ratings yet
Bob AttackCue01 CritHit
175 pages
Numpy TE2D6
No ratings yet
Numpy TE2D6
8 pages
DL Lab2
No ratings yet
DL Lab2
38 pages
Optimals Newest ROGUE2.5speed
No ratings yet
Optimals Newest ROGUE2.5speed
3 pages
April 23, 2025: Pandas PD
No ratings yet
April 23, 2025: Pandas PD
11 pages
Assignment-07-DBSCAN Clustering (Crimes) - Jupyter Notebook
No ratings yet
Assignment-07-DBSCAN Clustering (Crimes) - Jupyter Notebook
11 pages
Numpy TE2
No ratings yet
Numpy TE2
12 pages
Bob AttackCue01 Hit
No ratings yet
Bob AttackCue01 Hit
152 pages
Tarea 8
No ratings yet
Tarea 8
7 pages
Bob SitGround Out
No ratings yet
Bob SitGround Out
238 pages
Bob WalkAimShotgun DiagR
No ratings yet
Bob WalkAimShotgun DiagR
148 pages
CSV To Array Pythonn
No ratings yet
CSV To Array Pythonn
6 pages
Task Digits Numbers
No ratings yet
Task Digits Numbers
1 page
Bob AttackChainsaw01 CritHit
No ratings yet
Bob AttackChainsaw01 CritHit
226 pages
Keeratsi HW8
No ratings yet
Keeratsi HW8
17 pages
KNN052
No ratings yet
KNN052
5 pages
Implementation of Image Processing Algorithms For Fracture Detection On Different Human Body Parts. (Minor 02)
No ratings yet
Implementation of Image Processing Algorithms For Fracture Detection On Different Human Body Parts. (Minor 02)
22 pages
Kmeans Example Mnnit
No ratings yet
Kmeans Example Mnnit
23 pages
Bob Crowbar DoorLeft
No ratings yet
Bob Crowbar DoorLeft
332 pages
Grin 4
No ratings yet
Grin 4
4 pages
Bob WalkAim1Hand DiagR
No ratings yet
Bob WalkAim1Hand DiagR
143 pages
Kmeans and Apriori
No ratings yet
Kmeans and Apriori
20 pages
Unit 2
No ratings yet
Unit 2
12 pages
Bob WindowSmash
No ratings yet
Bob WindowSmash
162 pages
Important Steps: Gensim: A Python Library For NLP and Word Embeddings
No ratings yet
Important Steps: Gensim: A Python Library For NLP and Word Embeddings
31 pages
Kerr - Solve Ivp
No ratings yet
Kerr - Solve Ivp
8 pages
Interpolatingfunction : Bla Ndsolve ( (F ''' (T) + F (T) F '' (T) 0, F (0) 0, F ' (0) 0, F ' (100 000) 1), F, T)
No ratings yet
Interpolatingfunction : Bla Ndsolve ( (F ''' (T) + F (T) F '' (T) 0, F (0) 0, F ' (0) 0, F ' (100 000) 1), F, T)
7 pages
Flores
No ratings yet
Flores
4 pages
Un Modelo de Colas
No ratings yet
Un Modelo de Colas
5 pages
03 Multiple Linear Regression
No ratings yet
03 Multiple Linear Regression
7 pages
Grin 5
No ratings yet
Grin 5
4 pages
Practical 5
No ratings yet
Practical 5
13 pages
Nelson-Siegel Model
No ratings yet
Nelson-Siegel Model
50 pages
Data Covid-19 Jakarta: Numpy NP Matplotlib - Pyplot PLT Ipython - Display
No ratings yet
Data Covid-19 Jakarta: Numpy NP Matplotlib - Pyplot PLT Ipython - Display
3 pages
Optimals Newest AprilVirtualCircuitnewspeed
No ratings yet
Optimals Newest AprilVirtualCircuitnewspeed
3 pages
Fuzzy Set
No ratings yet
Fuzzy Set
21 pages
Untitled 1
No ratings yet
Untitled 1
9 pages
Numpy
No ratings yet
Numpy
1 page
Fuzzy Set
No ratings yet
Fuzzy Set
20 pages
Actividad Fenomenos
No ratings yet
Actividad Fenomenos
10 pages
Augmented Solow Vietnam Philippines
No ratings yet
Augmented Solow Vietnam Philippines
22 pages
Sys Prop
No ratings yet
Sys Prop
2,756 pages
22
No ratings yet
22
7 pages
4.4. Data Standardization - Ipynb - Colaboratory
No ratings yet
4.4. Data Standardization - Ipynb - Colaboratory
1 page
The Digital Guide
From Everand
The Digital Guide
Raylene Egbert
No ratings yet
Selenium Testing Process
No ratings yet
Selenium Testing Process
9 pages
Selenium Java Environment Setup
No ratings yet
Selenium Java Environment Setup
7 pages
Java For Selenium
No ratings yet
Java For Selenium
9 pages
Windows Quickstart Instructions: Step 1: Download Anaconda
No ratings yet
Windows Quickstart Instructions: Step 1: Download Anaconda
7 pages
HDFS and YARN
No ratings yet
HDFS and YARN
91 pages
SELECT From WORLD Tutorial
No ratings yet
SELECT From WORLD Tutorial
13 pages
Hive and Impala
No ratings yet
Hive and Impala
46 pages
Worksheet 2
No ratings yet
Worksheet 2
3 pages
SELECT From Nobel
No ratings yet
SELECT From Nobel
13 pages
Decision Tree and EDA With Functions: Import Pandas As PD
No ratings yet
Decision Tree and EDA With Functions: Import Pandas As PD
9 pages
Regular Expressions in Python
No ratings yet
Regular Expressions in Python
16 pages
Knn1 HouseVotes
No ratings yet
Knn1 HouseVotes
2 pages
Random Forest/Roc&Auc - Hyperparamer Tuning With For Loop - TITANIC DB
No ratings yet
Random Forest/Roc&Auc - Hyperparamer Tuning With For Loop - TITANIC DB
17 pages
Random Forest: Random Forest Has Classifier For Classification and Regressor For Regression
No ratings yet
Random Forest: Random Forest Has Classifier For Classification and Regressor For Regression
9 pages
Digits Recognition Dataset
No ratings yet
Digits Recognition Dataset
4 pages
How To Analyze Data Using The Average: Link For
No ratings yet
How To Analyze Data Using The Average: Link For
1 page
# Import Plotting Libraries: in (1) : Import Pandas As PD
No ratings yet
# Import Plotting Libraries: in (1) : Import Pandas As PD
13 pages
Ibook - Pub Seasonal Adjustment Methods and Real Time Trend Cycle Estimation
No ratings yet
Ibook - Pub Seasonal Adjustment Methods and Real Time Trend Cycle Estimation
293 pages
Biostatistics
No ratings yet
Biostatistics
10 pages
Lecture 7 - CH 3 Forecasting - 1spp
No ratings yet
Lecture 7 - CH 3 Forecasting - 1spp
58 pages
Non-Stationarity and Unit Roots
No ratings yet
Non-Stationarity and Unit Roots
25 pages
TMM 1
No ratings yet
TMM 1
4 pages
ANOVA Calculator - One Way ANOVA and Tukey HSD Test
No ratings yet
ANOVA Calculator - One Way ANOVA and Tukey HSD Test
5 pages
Shrout Bolger 2002
No ratings yet
Shrout Bolger 2002
26 pages
Bayesian Lecture Notes
No ratings yet
Bayesian Lecture Notes
28 pages
MCA (Revised) Term-End Examination February, 2021 Mcse-004: Numerical and Statistical Computing
No ratings yet
MCA (Revised) Term-End Examination February, 2021 Mcse-004: Numerical and Statistical Computing
6 pages
Stats Poster Project 1
No ratings yet
Stats Poster Project 1
3 pages
Topic1.4-Functions of Random Variables
No ratings yet
Topic1.4-Functions of Random Variables
41 pages
Statistical Inference
No ratings yet
Statistical Inference
148 pages
Problem 5.1: Piscataquis River: Exceedence Probability (%)
No ratings yet
Problem 5.1: Piscataquis River: Exceedence Probability (%)
7 pages
Psychological Statistics Exam
No ratings yet
Psychological Statistics Exam
3 pages
CH 4 - Problems
No ratings yet
CH 4 - Problems
72 pages
Midterm Exam Formula PDF
No ratings yet
Midterm Exam Formula PDF
6 pages
Codebasics DS AI Bootcamp Brochure v1
No ratings yet
Codebasics DS AI Bootcamp Brochure v1
41 pages
Hubungan Persepsi Mahasiswa Tentang Keluarga Harmonis Dengan Kesiapan Menikah
No ratings yet
Hubungan Persepsi Mahasiswa Tentang Keluarga Harmonis Dengan Kesiapan Menikah
7 pages
An End-To-End Project On Time Series Analysis and Forecasting With Python
No ratings yet
An End-To-End Project On Time Series Analysis and Forecasting With Python
23 pages
Sentiment Analysis IMDB Review - Presentation
No ratings yet
Sentiment Analysis IMDB Review - Presentation
19 pages
Pertemuan 7z
No ratings yet
Pertemuan 7z
31 pages
Regression Problems in Python PDF
No ratings yet
Regression Problems in Python PDF
34 pages
Violation of Assumptions of CLR Model:: Multicollinearity
No ratings yet
Violation of Assumptions of CLR Model:: Multicollinearity
28 pages
Chapter Three Ond Four
No ratings yet
Chapter Three Ond Four
11 pages
Intervalo de Confianza y Dummy Variables 1
No ratings yet
Intervalo de Confianza y Dummy Variables 1
13 pages
Quartiles
No ratings yet
Quartiles
8 pages
Leveraging Data For Analyses - Radio PM - Makkah 4G KPIs
No ratings yet
Leveraging Data For Analyses - Radio PM - Makkah 4G KPIs
957 pages
Lab 02 - Introduction To Pandas
No ratings yet
Lab 02 - Introduction To Pandas
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Knn1 MinMaxScalar

Uploaded by

Knn1 MinMaxScalar

Uploaded by

9/7/2018 komal_knn1_minMaxScalar

In [36]: import numpy as np

import matplotlib.pyplot as plt

from sklearn.preprocessing import MinMaxScaler

from sklearn.model_selection import train_test_split

import seaborn as sns

In [2]: location = r"D:\komal\SIMPLILEARN\MY COURSES\IN PROGRESS\DATA SCIENCE WITH PYT

0 5.1 3.5 1.4 0.2 Iris-setosa

1 4.9 3.0 1.4 0.2 Iris-setosa

2 4.7 3.2 1.3 0.2 Iris-setosa

3 4.6 3.1 1.5 0.2 Iris-setosa

4 5.0 3.6 1.4 0.2 Iris-setosa

In [4]: # Check the available styles

count 150.000000 150.000000 150.000000 150.000000

mean 5.843333 3.054000 3.758667 1.198667

std 0.828066 0.433594 1.764420 0.763161

min 4.300000 2.000000 1.000000 0.100000

25% 5.100000 2.800000 1.600000 0.300000

50% 5.800000 3.000000 4.350000 1.300000

75% 6.400000 3.300000 5.100000 1.800000

max 7.900000 4.400000 6.900000 2.500000

In [7]: X = df_iris.drop('class' , 1).values # drop target variable

In [8]: scaler = MinMaxScaler()

Out[8]: MinMaxScaler(copy=True, feature_range=(0, 1))

In [9]: X_scaled = scaler.fit_transform(X)

print('X_scaled type is', type(X_scaled))

X_scaled type is <class 'numpy.ndarray'>

Out[9]: array([[0.22222222, 0.625 , 0.06779661, 0.04166667],

[0.16666667, 0.16666667, 0.38983051, 0.375 ],

[0.41666667, 0.33333333, 0.69491525, 0.95833333],

In [10]: # transform back to df for easier exploration/plotting (output of scaler)

0 0.222222 0.625000 0.067797 0.041667

1 0.166667 0.416667 0.067797 0.041667

2 0.111111 0.500000 0.050847 0.041667

3 0.083333 0.458333 0.084746 0.041667

4 0.194444 0.666667 0.067797 0.041667

In [11]: df_iris_scaled = pd.concat([X_scaled_df,y],axis=1)

0 0.222222 0.625000 0.067797 0.041667 Iris-setosa

1 0.166667 0.416667 0.067797 0.041667 Iris-setosa

2 0.111111 0.500000 0.050847 0.041667 Iris-setosa

3 0.083333 0.458333 0.084746 0.041667 Iris-setosa

4 0.194444 0.666667 0.067797 0.041667 Iris-setosa

count 150.000000 150.000000 150.000000 150.000000

mean 0.428704 0.439167 0.467571 0.457778

std 0.230018 0.180664 0.299054 0.317984

min 0.000000 0.000000 0.000000 0.000000

25% 0.222222 0.333333 0.101695 0.083333

50% 0.416667 0.416667 0.567797 0.500000

75% 0.583333 0.541667 0.694915 0.708333

max 1.000000 1.000000 1.000000 1.000000

In [18]: # train and test split

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, rando

In [19]: print("train sample size",X_train.shape, type(X_train))

train sample size (105, 4) <class 'numpy.ndarray'>

In [23]: clf = KNeighborsClassifier(n_neighbors=5)

Out[23]: KNeighborsClassifier(algorithm='auto', leaf_size=30, metric='minkowski',

In [24]: y_pred = clf.predict(X_test)

In [28]: # Creates a confusion matrix

Out[29]: array([[16, 0, 0],

In [32]: CT=pd.crosstab(y_test, y_pred, rownames=['True'], colnames=['Predicted'], marg

In [38]: from sklearn.metrics import accuracy_score

Out[39]: Text(0.5,16,'Predicted label')

In [42]: from sklearn.metrics import classification_report

precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 16

avg / total 0.98 0.98 0.98 45

In [43]: # Classification accuracy : Overall how often is the classifier correct?

# classification error : Overall how often is the classifier incorrect?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.