0% found this document useful (0 votes)

4 views5 pages

Distance and Normalization

The document calculates the Euclidean and Manhattan distances between points A(3,4) and B(5,6), resulting in approximately 2.83 and 4, respectively. It also performs normalization using Z-Score, Min-Max, and Decimal Scaling methods, yielding specific normalized values for both points. Additionally, it discusses finding the best-fitted regression line for a set of points and describes Equal-Frequency and Equal-Width binning techniques for another data set.

Uploaded by

Sanya Uppal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views5 pages

Distance and Normalization

Uploaded by

Sanya Uppal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Distance and Normalization

Given two points in 2D space:

A = (3,4)
B = (5,6)
Find Euclidean Distance and Manhattan Distance

Perform Normalization:
i) Z-Score
ii) Min-Max
iii) Decimal scaling by a factor of 100

Euclidean Distance:

The Euclidean distance between two points A(x1 , y1 ) and B(x2 , y2 ) is given by the formula:

Euclidean Distance = (x2 − x1 )2 + (y2 − y1 )2

For points A = (3, 4) and B = (5, 6):

Euclidean Distance = (5 − 3)2 + (6 − 4)2 = 22 + 22 =

4+4= 8 ≈ 2.83

Manhattan Distance:

The Manhattan distance between two points is the sum of the absolute differences of their coordinates:

Manhattan Distance = ∣x2 − x1 ∣ + ∣y2 − y1 ∣

For points A = (3, 4) and B = (5, 6):

Manhattan Distance = ∣5 − 3∣ + ∣6 − 4∣ = 2 + 2 = 4

Normalization:

Let’s now perform the requested normalization methods on the points.

i) Z-Score Normalization:

The Z-score normalization is calculated using:

x−μ
Z=

σ
Where:

x is the data point

μ is the mean
σ is the standard deviation

Let’s calculate for both dimensions (x and y):

For x-coordinates (3 and 5), the mean (μx ) and standard deviation (σx ) are:

3+5
μx = =4
2

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 1/5
(3 − 4)2 + (5 − 4)2 1+1
σx = = = 1=1
2 2

For y -coordinates (4 and 6), the mean (μy ) and standard deviation (σy ) are:

4+6
μy = =5
2

(4 − 5)2 + (6 − 5)2 1+1

σy = = = 1=1
2 2

Now applying the Z-score formula for each point:

For point A = (3, 4):
3−4 4−5
Zx = = −1, Zy = = −1
1 1

So, Az-score
= (−1, −1)
For point B = (5, 6):
5−4 6−5
Zx = = 1, Zy = =1
1 1

So, Bz-score = (1, 1)

ii) Min-Max Normalization:

The Min-Max normalization is calculated as:

x − xmin
Min-Max(x) =

xmax − xmin

For the x-coordinates (3 and 5):

xmin = 3,

xmax = 5

For point A = (3, 4):

3−3
Min-Max(x) = =0
5−3

For point B = (5, 6):

5−3
Min-Max(x) = =1
5−3

For the y -coordinates (4 and 6):

ymin = 4,
ymax = 6

For point A = (3, 4):

4−4
Min-Max(y) = =0
6−4

For point B = (5, 6):

6−4
Min-Max(y) = =1
6−4

So the Min-Max normalized points are:

Amin-max = (0, 0)

Bmin-max = (1, 1)

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 2/5
iii) Decimal Scaling Normalization (by a factor of 100):

Decimal scaling normalization is calculated by:

x
Decimal Scaling(x) =
10k

Where k is the smallest integer such that max(∣xmin ∣, ∣xmax ∣) < 10k .
For points A = (3, 4) and B = (5, 6), the values are already in a range of 0 to 9, so applying decimal scaling by a factor of
100:
For point A = (3, 4):

3 4
Adecimal scaling = ( , ) = (0.03, 0.04)
100 100

For point B = (5, 6):

5 6
Bdecimal scaling = ( , ) = (0.05, 0.06)
100 100

Summary:

1. Euclidean Distance: ≈ 2.83

2. Manhattan Distance: 4

Normalization:

Z-Score:
Az-score = (−1, −1)

Bz-score = (1, 1)

Min-Max:
Amin-max = (0, 0)

Bmin-max = (1, 1)

Decimal Scaling:
Adecimal scaling = (0.03, 0.04)
Bdecimal scaling = (0.05, 0.06)

{1,2,3,4,5,6,7,8}

Find best-fitted regression line:

(1,4), (2,8), (3,12), (4,16), (5,20), (6,24)

To find the best-fitted regression line for the given points (1, 4), (2, 8), (3, 12), (4, 16), (5, 20), (6, 24), we can use the
linear regression formula:

y = mx + b

Where:

m is the slope of the line

b is the y-intercept

We can calculate the slope m and y-intercept b using the formulas:

1. Slope (m):

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 3/5
n(∑ xi yi ) − (∑ xi )(∑ yi )
m=

n(∑ x2i ) − (∑ xi )2

2. Intercept (b):

∑ y i − m ∑ xi
b=

n
Where n is the number of data points, xi and yi are the coordinates of the points.

Let’s calculate the slope and intercept for the given points.
The best-fitted regression line for the given points is:

y = 4x + 0

Or simply:

y = 4x

This indicates that the slope of the line is 4, and the y-intercept is 0.

ata Points: {5,10,12,20,25,32,64,78,81}

Equal-Frequency
Equal-Width

To perform Equal-Frequency and Equal-Width binning on the given data points {5, 10, 12, 20, 25, 32, 64, 78, 81}, let's first
define what these methods mean:

1. Equal-Frequency Binning:
Divide the data into bins such that each bin has the same number of data points.
The number of bins is typically chosen based on the number of data points.
2. Equal-Width Binning:
Divide the range of data values into equal-width intervals.
The number of bins is specified, and the width of each bin is calculated by dividing the total range of the data by
the number of bins.

Let’s determine:

Number of bins for each method (we’ll use 3 bins for illustration).
Perform both binning techniques.

Here are the results of the binning methods:

Equal-Frequency Binning:

Bin 1: [5, 10, 12]

Bin 2: [20, 25, 32]
Bin 3: [64, 78, 81]

Each bin contains 3 data points, and the data is split accordingly.

Equal-Width Binning:

Bin 1: [5, 10, 12, 20, 25] (Range: 5 to 25)

Bin 2: [32] (Range: 25 to 45)
Bin 3: [64, 78] (Range: 45 to 65)

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 4/5
81−5
In this case, the width of each bin is approximately 3
= 25.33, and the data points are grouped based on this range.

Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 5/5

KVM Vs Vmware
No ratings yet
KVM Vs Vmware
4 pages
Federated Learning For Internet of Things A Comprehensive Survey
No ratings yet
Federated Learning For Internet of Things A Comprehensive Survey
37 pages
8 Normalization Methods
No ratings yet
8 Normalization Methods
10 pages
Reachable Distance Function For KNN Classification
No ratings yet
Reachable Distance Function For KNN Classification
152 pages
Standardization Spreadsheet
No ratings yet
Standardization Spreadsheet
18 pages
Data Preprocessing
No ratings yet
Data Preprocessing
32 pages
A Tutorial On Regression
No ratings yet
A Tutorial On Regression
10 pages
004 - Equation Fitting - CH4 - New
No ratings yet
004 - Equation Fitting - CH4 - New
56 pages
CH 2
No ratings yet
CH 2
121 pages
AbidAdhikari26840-DWDM
No ratings yet
AbidAdhikari26840-DWDM
43 pages
assignment(2)
No ratings yet
assignment(2)
4 pages
Maths For ML Revision
No ratings yet
Maths For ML Revision
25 pages
Data Preparation
No ratings yet
Data Preparation
11 pages
DCA2101-COMPUTER
No ratings yet
DCA2101-COMPUTER
12 pages
CS-3035 (ML) - CS Mid Feb 2024
No ratings yet
CS-3035 (ML) - CS Mid Feb 2024
8 pages
Week8 PDF
No ratings yet
Week8 PDF
31 pages
5 Data Preprocessing III Editted Notes
No ratings yet
5 Data Preprocessing III Editted Notes
17 pages
02 Tinh Khoang Cach - Compatibility Mode
No ratings yet
02 Tinh Khoang Cach - Compatibility Mode
14 pages
Discrminant Analysis
No ratings yet
Discrminant Analysis
29 pages
23.-Scaling-Techniques
No ratings yet
23.-Scaling-Techniques
30 pages
B211 Z Scores & Normal Distribution
No ratings yet
B211 Z Scores & Normal Distribution
25 pages
English Syntax
No ratings yet
English Syntax
106 pages
Assignment 1 Om Diyora
No ratings yet
Assignment 1 Om Diyora
21 pages
Chapter 5 Ppt
No ratings yet
Chapter 5 Ppt
36 pages
ML_Notes
No ratings yet
ML_Notes
44 pages
Gridding Report - : Data Source
No ratings yet
Gridding Report - : Data Source
5 pages
3 1 Chapter 3 Normalization
No ratings yet
3 1 Chapter 3 Normalization
22 pages
ML Co4 Session 29
No ratings yet
ML Co4 Session 29
36 pages
Maths 1
No ratings yet
Maths 1
31 pages
Gridding Report - : Data Source
No ratings yet
Gridding Report - : Data Source
5 pages
IDS5
No ratings yet
IDS5
56 pages
Busbar General Datasheet PDF
No ratings yet
Busbar General Datasheet PDF
2 pages
HW1 Solution
No ratings yet
HW1 Solution
7 pages
Feature Engineering
No ratings yet
Feature Engineering
18 pages
TM3 ch07 Clustering
No ratings yet
TM3 ch07 Clustering
47 pages
Statistical Tables
No ratings yet
Statistical Tables
9 pages
Week 10
No ratings yet
Week 10
50 pages
Least Square by Nicholson-linear algebra-2018
No ratings yet
Least Square by Nicholson-linear algebra-2018
12 pages
CS181 HW0
No ratings yet
CS181 HW0
9 pages
Expt. 2 Graphing and Curve-Fitting
No ratings yet
Expt. 2 Graphing and Curve-Fitting
5 pages
'MITS6002 Business Analytics: Assignment 3
No ratings yet
'MITS6002 Business Analytics: Assignment 3
11 pages
04_data-normalization-in-python.en
No ratings yet
04_data-normalization-in-python.en
1 page
1.DATA: No - of Cleaning Hours (X)
No ratings yet
1.DATA: No - of Cleaning Hours (X)
1 page
HW4
No ratings yet
HW4
10 pages
3point5point2 Normalization
No ratings yet
3point5point2 Normalization
3 pages
Alexander - 1984 - Stride Length and Speed For Adults, Children, and Fossil Hominids PDF
No ratings yet
Alexander - 1984 - Stride Length and Speed For Adults, Children, and Fossil Hominids PDF
5 pages
Rocks and Minerals Practice s2c PDF
No ratings yet
Rocks and Minerals Practice s2c PDF
13 pages
Chapter Four-2
No ratings yet
Chapter Four-2
30 pages
Principles of Least Squares
No ratings yet
Principles of Least Squares
44 pages
Chapter 1: Introduction: Database System Concepts, 6 Ed
No ratings yet
Chapter 1: Introduction: Database System Concepts, 6 Ed
15 pages
Curve fitting-I-II
No ratings yet
Curve fitting-I-II
12 pages
princcomp2d
No ratings yet
princcomp2d
6 pages
GridDataReport 20240427085521
No ratings yet
GridDataReport 20240427085521
8 pages
Abb Etu
100% (1)
Abb Etu
40 pages
Numerical Methodes - Chapter 4
No ratings yet
Numerical Methodes - Chapter 4
25 pages
Wa0072
No ratings yet
Wa0072
12 pages
06 Fitting Matching
No ratings yet
06 Fitting Matching
13 pages
Lab06.least Squares Fitting Shortened - Desmos - MATH-1173-001 - Calculus I With Computer Expl
No ratings yet
Lab06.least Squares Fitting Shortened - Desmos - MATH-1173-001 - Calculus I With Computer Expl
5 pages
5_6122829974731751861
No ratings yet
5_6122829974731751861
8 pages
Lecture 12 Distance Metrics Different Distance Metrics in Machine Learning
No ratings yet
Lecture 12 Distance Metrics Different Distance Metrics in Machine Learning
12 pages
Working Principles FACTS Presentation
No ratings yet
Working Principles FACTS Presentation
15 pages
Risk Assessment Guidelines For Tunnels
100% (1)
Risk Assessment Guidelines For Tunnels
10 pages
SP.769 Photovoltaic Systems: Least Squares Fit of Straight Line To Data
No ratings yet
SP.769 Photovoltaic Systems: Least Squares Fit of Straight Line To Data
3 pages
C Interview Questions: Abdul Kalam
No ratings yet
C Interview Questions: Abdul Kalam
69 pages
W116-Vacuum-Climate - PNG (PNG Image, 600 × 776 Pixels) - Scaled (84%)
No ratings yet
W116-Vacuum-Climate - PNG (PNG Image, 600 × 776 Pixels) - Scaled (84%)
24 pages
Statistical Tables
No ratings yet
Statistical Tables
9 pages
Raytheon Technologies: Restricted Circulation - L&T Technology Services - © 1
No ratings yet
Raytheon Technologies: Restricted Circulation - L&T Technology Services - © 1
30 pages
How To Recover Permanent Deleted Files Windows 10
No ratings yet
How To Recover Permanent Deleted Files Windows 10
10 pages
Performance Analysis of Wells With Downhole Water Loop Installation For Water Coning Control
100% (1)
Performance Analysis of Wells With Downhole Water Loop Installation For Water Coning Control
8 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Straight Line
No ratings yet
Straight Line
4 pages
Body Control System: Section
100% (1)
Body Control System: Section
133 pages
Valve Material Selection Guide
No ratings yet
Valve Material Selection Guide
15 pages
This Study Resource Was: Page 1 of 7
No ratings yet
This Study Resource Was: Page 1 of 7
7 pages
GOT1000 - GT15 Operator Authentication Setting - Technical
No ratings yet
GOT1000 - GT15 Operator Authentication Setting - Technical
6 pages
1.105 Solid Mechanics Laboratory: Least Squares Fit of Straight Line To Data
No ratings yet
1.105 Solid Mechanics Laboratory: Least Squares Fit of Straight Line To Data
3 pages
Engproc 41 00007 v2
No ratings yet
Engproc 41 00007 v2
11 pages
K-Means Clustering Tutorial - Matlab Code
No ratings yet
K-Means Clustering Tutorial - Matlab Code
3 pages
ANA CRM Maturity Model
100% (1)
ANA CRM Maturity Model
1 page
Ground Shaking
100% (1)
Ground Shaking
20 pages
Frugel Horn Mk3
No ratings yet
Frugel Horn Mk3
8 pages
20.SAM64000-461-01 FUEL OIL SERVICE SYSTEM 燃油日用系统
No ratings yet
20.SAM64000-461-01 FUEL OIL SERVICE SYSTEM 燃油日用系统
17 pages
Numerical Analysis I: Take Home FINAL Exam Sept., 2020
No ratings yet
Numerical Analysis I: Take Home FINAL Exam Sept., 2020
3 pages
JSRRDA - PMU - 16 JMF Observations Palamu
No ratings yet
JSRRDA - PMU - 16 JMF Observations Palamu
2 pages
TBS 18/9-H F 0,75 KW IE3: Optifan Technical Report 21/02/2024
No ratings yet
TBS 18/9-H F 0,75 KW IE3: Optifan Technical Report 21/02/2024
2 pages
Advance Review Weekly Exam 2 Bacolod
No ratings yet
Advance Review Weekly Exam 2 Bacolod
4 pages
Drenaj Stradal Cu Teava
No ratings yet
Drenaj Stradal Cu Teava
2 pages
Missed Call Log (Nortel)
No ratings yet
Missed Call Log (Nortel)
1 page
Analytic Geometry: Graphic Solutions Using Matlab Language
From Everand
Analytic Geometry: Graphic Solutions Using Matlab Language
Ing. Mario Castillo
No ratings yet
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Distance and Normalization

Uploaded by

Distance and Normalization

Uploaded by

Distance and Normalization

Given two points in 2D space:

Euclidean Distance = (x2 − x1 )2 + (y2 − y1 )2

For points A = (3, 4) and B = (5, 6):

Euclidean Distance = (5 − 3)2 + (6 − 4)2 = 22 + 22 =

Manhattan Distance = ∣x2 − x1 ∣ + ∣y2 − y1 ∣

For points A = (3, 4) and B = (5, 6):

Let’s now perform the requested normalization methods on the points.

The Z-score normalization is calculated using:

x is the data point

Let’s calculate for both dimensions (x and y):

(4 − 5)2 + (6 − 5)2 1+1

Now applying the Z-score formula for each point:

So, Bz-score = (1, 1)

ii) Min-Max Normalization:

The Min-Max normalization is calculated as:

For the x-coordinates (3 and 5):

For point A = (3, 4):

For point B = (5, 6):

For the y -coordinates (4 and 6):

For point A = (3, 4):

For point B = (5, 6):

So the Min-Max normalized points are:

Decimal scaling normalization is calculated by:

For point B = (5, 6):

1. Euclidean Distance: ≈ 2.83

Find best-fitted regression line:

m is the slope of the line

We can calculate the slope m and y-intercept b using the formulas:

ata Points: {5,10,12,20,25,32,64,78,81}

Here are the results of the binning methods:

Bin 1: [5, 10, 12]

Bin 1: [5, 10, 12, 20, 25] (Range: 5 to 25)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Distance and Normalization

Uploaded by

Distance and Normalization

Uploaded by

Distance and Normalization

Given two points in 2D space:

Euclidean Distance = (x2 − x1 )2 + (y2 − y1 )2

For points A = (3, 4) and B = (5, 6):

Euclidean Distance = (5 − 3)2 + (6 − 4)2 = ​ 22 + 22 =

Manhattan Distance = ∣x2 − x1 ∣ + ∣y2 − y1 ∣ ​ ​ ​ ​

For points A = (3, 4) and B = (5, 6):

Let’s now perform the requested normalization methods on the points.

The Z-score normalization is calculated using:

x is the data point

Let’s calculate for both dimensions (x and y):

(4 − 5)2 + (6 − 5)2 1+1

Now applying the Z-score formula for each point:

So, Bz-score ​ = (1, 1)

ii) Min-Max Normalization:

The Min-Max normalization is calculated as:

For the x-coordinates (3 and 5):

For point A = (3, 4):

For point B = (5, 6):

For the y -coordinates (4 and 6):

For point A = (3, 4):

For point B = (5, 6):

So the Min-Max normalized points are:

Decimal scaling normalization is calculated by:

For point B = (5, 6):

1. Euclidean Distance: ≈ 2.83

Find best-fitted regression line:

m is the slope of the line

We can calculate the slope m and y-intercept b using the formulas:

ata Points: {5,10,12,20,25,32,64,78,81}

Here are the results of the binning methods:

Bin 1: [5, 10, 12]

Bin 1: [5, 10, 12, 20, 25] (Range: 5 to 25)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Euclidean Distance = (5 − 3)2 + (6 − 4)2 = 22 + 22 =

Manhattan Distance = ∣x2 − x1 ∣ + ∣y2 − y1 ∣

So, Bz-score = (1, 1)