0% found this document useful (0 votes)

3 views19 pages

Logistic Regression

Uploaded by

airobot28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views19 pages

Logistic Regression

Uploaded by

airobot28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

CSE 422: Artificial Intelligence

Logistic Regression

Swakkhar Shatabda

BRAC University

December 2, 2024

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 1 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Contents

1 Classification

2 Sigmoid

3 Cross Entropy

4 Stochastic Gradient Descent

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 2 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Classification

1 In classification problems, we are given data as X and labels as y .

2 Here, we are upto learn a model where, y will be predicted as a
function of X .
3 In classification, the label y is categorical or discrete in value.
4 For example, suppose you are given many features of a fish, like
length, weight, eggs, months and you have to predict whether it is
legal to be caught or not. This problem can be formulated as a
classification problem.

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 3 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Data

Here is how data looks like in a supervised setting:

features label
x1 x2 x3 x4 y
instance no length weight has eggs month legal?
1 10 250 1 12 No
2 20 1250 0 1 Yes
3 15 750 1 2 No
.. .. .. .. .. ..
. . . . . .
m 17 550 0 3 Yes

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 4 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Experiments
We will first try to predict the class of the dataset based on two features,
x1 and x2 .

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 5 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Experiments
We will first try to predict the class of the dataset based on two features,
x1 and x2 .

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 6 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Logistic Regression

At first, we are going to try a linear classifier called logistic regression. We

can apply logistic regression when the data is linearly separable.
The relationship will be predicted as:

y = w0 + w1 x1

This is again an equation of a straight line

We need the best line that separates blue from the orange
learn w0 , w1 , · · ·
Can we use gradient descent here? A little trick required!

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 7 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Gradient Descent for Logistic Regression

The cost function / loss function of gradient descent

m
1X
e= (ŷ (i) − y (i))2
2
i=1

This time too predicted label ŷ is a function of ~x and w

The labels are discrete, for this binary classification two labels 0 (no
or negative) and 1 (yes or positive)
Now, we try to define ŷ with help of the weights or coefficients of the
line.

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 8 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Linear Classification

This linear classifier divides instances based on the local wrt the line,
on the right positive, negative on the left
Any point on the line satisfies the equation. Any point on the right
(3,1) yields positive result and any point on the left (1,1) yields
negative result.
Based on this we can define a linear classifier
Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 9 / 19
Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Linear Classification

This following function will help us in making decision:

f (~x ) = w0 + w1 x1 + w2 x2 + · · · + wn xn

LinearClassifier

1 if f (~x ) > 0 or w0 + w1 x1 + w2 x2 + · · · + wn xn > 0

2 return 1
3 else return 0

This simple classifier just checks whether a point is on the left or right.

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 10 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

A step function!

Alas! This is not a continuous function and thus not differentiable. We

can’t calculate gradients! We need to find an alternate!

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 11 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

A sigmoid function!

1
σ(~x ) = 1+exp(−~
x)

Good things about sigmoid!

1 Its continuous and differentiable.
2 σ 0 (~x ) = σ(~x )(1 − σ(~x ))

Lets go back to the loss function now.

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 12 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

A new loss function - Cross-entropy

Cross-entropy loss, or log loss, measures the performance of a classification

model whose output is a probability value between 0 and 1.
m
X
e= (−y (i)log (ŷ (i)) − (1 − y )log (1 − ŷ ))
i=1

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 13 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Cross Entropy Loss Function

How it works?

m
X
e= (−y (i)log (ŷ (i)) − (1 − y )log (1 − ŷ ))
i=1

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 14 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Cross Entropy Loss Function

How to find the gradient? Lets try!
m
δe δ X
= (−y (i)log (ŷ (i)) − (1 − y )log (1 − ŷ (i)))
δw0 δw 0 i=1
m
X 1 1
= (−y (i) ŷ (i)(1 − ŷ (i)).1 − (1 − y ) (−1))ŷ (i)(1 − ŷ (i)).1)
i=1
ŷ (i) (1 − ŷ (i))
m
(1)
X
= (−y (i) + y (i)ŷ (i) + ŷ (i) − y (i)ŷ (i)).1
i=1
Xm
= (ŷ (i) − y (i)).1
i=1

In a similar way,
m
δe X
= (ŷ (i) − y (i)).xi (2)
δwi i=1

Now the same gradient descent will work!

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 15 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Comments on Gradient Descent

1 Slow when the dataset is too large!

2 Rather learning the whole dataset, possible to learn in chunks!
3 What if we process only 1 single item at each iteration?
4 Lets have another look!

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 16 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Gradient Descent Algorithm

GradientDescent(X , y , alpha, maxIter )

1 for j = 1 to m
2 x0 (j) = 1
3 w0 , w1 , · · · , wn initialized randomly
4 iter = 0
5 while iter + + ≤ maxIter
6 for j = 0 to n
7 slopej = 0
8 for i = 1 to m
9 ŷ = w0 + w1 x1 (i) + w2 x2 (i) + · · · + wn xn (i)
10 e = ŷ − y (i)
11 for j = 0 to n
12 slopej = slopej + e × xj (i)
13 for j = 0 to n
14 wj = wj − α × slopej
15 return w0 , w1 , · · · , wn

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 17 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Lighter Gradient Descent Algorithm

LighterGradientDescent(X , y , alpha, maxIter )

1 for j = 1 to m
2 x0 (j) = 1
3 w0 , w1 , · · · , wn initialized randomly
4 iter = 0
5 while iter + + ≤ maxIter
6 for j = 0 to n
7 slopej = 0
8 i = iter
9 ŷ = w0 + w1 x1 (i) + w2 x2 (i) + · · · + wn xn (i)
10 e = ŷ − y (i)
11 for j = 0 to n
12 slopej = slopej + e × xj (i)
13 for j = 0 to n
14 wj = wj − α × slopej
15 return w0 , w1 , · · · , wn

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 18 / 19

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

Thats it!

Thank you

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 19 / 19

Book - The Design of High Performance Mechatronics 2nd - 20231110
No ratings yet
Book - The Design of High Performance Mechatronics 2nd - 20231110
928 pages
Confederate Pulp and Paper
No ratings yet
Confederate Pulp and Paper
3 pages
3a Variations
No ratings yet
3a Variations
17 pages
3a Variations4
No ratings yet
3a Variations4
5 pages
Mathematical Foundations of Computational Linguistics: Manfred Klenner and Jannis Vamvas
No ratings yet
Mathematical Foundations of Computational Linguistics: Manfred Klenner and Jannis Vamvas
32 pages
Lec 04 Deep Networks 2
No ratings yet
Lec 04 Deep Networks 2
78 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
Module 6 - Loss Function
No ratings yet
Module 6 - Loss Function
22 pages
Cross Entropy Loss Intro, Applications
No ratings yet
Cross Entropy Loss Intro, Applications
21 pages
7 TrainingNN-2
No ratings yet
7 TrainingNN-2
84 pages
03-Linear Classification
No ratings yet
03-Linear Classification
17 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Multimedia Application L9
No ratings yet
Multimedia Application L9
43 pages
Lect 8
No ratings yet
Lect 8
117 pages
L3 Cse256 Fa24 FFN
No ratings yet
L3 Cse256 Fa24 FFN
64 pages
Chapter02 Introduction To DeepLearning
No ratings yet
Chapter02 Introduction To DeepLearning
84 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
02 - Linear Models - D (Multiclass Classification)
No ratings yet
02 - Linear Models - D (Multiclass Classification)
9 pages
הרצאה-Classifiers and Decision Trees
No ratings yet
הרצאה-Classifiers and Decision Trees
119 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
AML AfterMid Merged
No ratings yet
AML AfterMid Merged
389 pages
Module 1 - Problems in Neural Network
No ratings yet
Module 1 - Problems in Neural Network
20 pages
Lecture 19
No ratings yet
Lecture 19
8 pages
Practice QuestionsV1
No ratings yet
Practice QuestionsV1
7 pages
Practice QuestionsV1
No ratings yet
Practice QuestionsV1
7 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
DeepLearning Workshop Humayun
No ratings yet
DeepLearning Workshop Humayun
63 pages
Learning 2
No ratings yet
Learning 2
82 pages
CSE445 T4a Logistic Regression
No ratings yet
CSE445 T4a Logistic Regression
38 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
Neural Networks
No ratings yet
Neural Networks
63 pages
Text Classification Using Logistics Regression
No ratings yet
Text Classification Using Logistics Regression
64 pages
Lec 05
No ratings yet
Lec 05
46 pages
cs188 Fa22 Note21
No ratings yet
cs188 Fa22 Note21
4 pages
Lesson 4 Deep Neural Network and Tools
No ratings yet
Lesson 4 Deep Neural Network and Tools
159 pages
CS115 01
No ratings yet
CS115 01
38 pages
Lecture 220927 02
No ratings yet
Lecture 220927 02
29 pages
CM20315 05 Loss
No ratings yet
CM20315 05 Loss
100 pages
Chapter Classification
No ratings yet
Chapter Classification
12 pages
7.losses and Activations
No ratings yet
7.losses and Activations
79 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
CS60010: Deep Learning: Spring 2021
No ratings yet
CS60010: Deep Learning: Spring 2021
32 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
Lecture 03 - Feedforward Networks - 4p
No ratings yet
Lecture 03 - Feedforward Networks - 4p
19 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
48 pages
Cross Interopy
No ratings yet
Cross Interopy
7 pages
06 Lectureslides LinearClassification Fixed
No ratings yet
06 Lectureslides LinearClassification Fixed
52 pages
Lecture W1c UG
No ratings yet
Lecture W1c UG
33 pages
3-LG Eval
No ratings yet
3-LG Eval
52 pages
I2ml3e Chap10
No ratings yet
I2ml3e Chap10
27 pages
Lecture 4 - Linear Classification
No ratings yet
Lecture 4 - Linear Classification
34 pages
2 Softmaxregression
No ratings yet
2 Softmaxregression
4 pages
Lecture 11
No ratings yet
Lecture 11
26 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
8 pages
Loss Functions
No ratings yet
Loss Functions
7 pages
Ch03 LogisticRegression
No ratings yet
Ch03 LogisticRegression
79 pages
W02 MLOptDL
No ratings yet
W02 MLOptDL
23 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
CS 229 - Supervised Learning Cheatsheet
No ratings yet
CS 229 - Supervised Learning Cheatsheet
2 pages
What Is Cross-Entropy?: 1 Answer
No ratings yet
What Is Cross-Entropy?: 1 Answer
3 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Neuro-Fuzzy, Revision Questions June 1, 2005
No ratings yet
Neuro-Fuzzy, Revision Questions June 1, 2005
7 pages
Week 5 EMQ Solution
100% (2)
Week 5 EMQ Solution
4 pages
BCS 2015-16
No ratings yet
BCS 2015-16
29 pages
Problem Set 4: Graphs: CS 3510: Design & Analysis of Algorithms
No ratings yet
Problem Set 4: Graphs: CS 3510: Design & Analysis of Algorithms
5 pages
Introduction To Bioinformatics
No ratings yet
Introduction To Bioinformatics
2 pages
Pages From (Monson Hayes) Schaum S Outline of Digital Signal
No ratings yet
Pages From (Monson Hayes) Schaum S Outline of Digital Signal
7 pages
Data Science & ML Using Python
No ratings yet
Data Science & ML Using Python
5 pages
Methodology Fyp2 (Experimental & Simulation) - Dr. Zainoor
No ratings yet
Methodology Fyp2 (Experimental & Simulation) - Dr. Zainoor
23 pages
RiskManagement B00246928
No ratings yet
RiskManagement B00246928
8 pages
Cyber Law1
No ratings yet
Cyber Law1
19 pages
Sha-3 Selection Announcement
No ratings yet
Sha-3 Selection Announcement
1 page
Introduction To Time Series Analysis
No ratings yet
Introduction To Time Series Analysis
17 pages
Statistical Process Control For An Attribute
No ratings yet
Statistical Process Control For An Attribute
2 pages
Question Bank DSP (Unit III, IV, V
0% (1)
Question Bank DSP (Unit III, IV, V
8 pages
Chapter - 5 Algebra
No ratings yet
Chapter - 5 Algebra
18 pages
DL Practical 3 Loss Function
No ratings yet
DL Practical 3 Loss Function
6 pages
Quantum Computing
100% (1)
Quantum Computing
9 pages
Topic Wise Test Polynomials Cbse Class 9 Maths: Verify Division Algorithm For The P (X) X X
No ratings yet
Topic Wise Test Polynomials Cbse Class 9 Maths: Verify Division Algorithm For The P (X) X X
1 page
Class-XI Database+Concepts
No ratings yet
Class-XI Database+Concepts
32 pages
Cs1201 Design and Analysis of Algorithm
No ratings yet
Cs1201 Design and Analysis of Algorithm
27 pages
eNAT Grade Level Report (Grade 5)
No ratings yet
eNAT Grade Level Report (Grade 5)
22 pages
MATHESH Matlab Final Output
No ratings yet
MATHESH Matlab Final Output
19 pages
PH/PH/1 Bulk Arrival and Bulk Service Queue With Randomly Varying Environment
No ratings yet
PH/PH/1 Bulk Arrival and Bulk Service Queue With Randomly Varying Environment
12 pages
Saltelli Algorithm
No ratings yet
Saltelli Algorithm
3 pages
Introduction To Binomial Distribution
No ratings yet
Introduction To Binomial Distribution
10 pages
Ch18-Waiting Line Management
No ratings yet
Ch18-Waiting Line Management
18 pages
Blokchain Technology Assignment: 1. Public Distribution System (PDS)
No ratings yet
Blokchain Technology Assignment: 1. Public Distribution System (PDS)
5 pages
Machine Learning - What It Is, Tutorial, Definition, Types - Javatpoint
No ratings yet
Machine Learning - What It Is, Tutorial, Definition, Types - Javatpoint
14 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Classification Sigmoid Cross Entropy Stochastic Gradient Descent

CSE 422: Artificial Intelligence

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 1 / 19

4 Stochastic Gradient Descent

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 2 / 19

1 In classification problems, we are given data as X and labels as y .

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 3 / 19

Here is how data looks like in a supervised setting:

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 4 / 19

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 5 / 19

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 6 / 19

At first, we are going to try a linear classifier called logistic regression. We

This is again an equation of a straight line

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 7 / 19

Gradient Descent for Logistic Regression

The cost function / loss function of gradient descent

This time too predicted label ŷ is a function of ~x and w

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 8 / 19

This following function will help us in making decision:

1 if f (~x ) > 0 or w0 + w1 x1 + w2 x2 + · · · + wn xn > 0

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 10 / 19

Alas! This is not a continuous function and thus not differentiable. We

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 11 / 19

Good things about sigmoid!

Lets go back to the loss function now.

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 12 / 19

A new loss function - Cross-entropy

Cross-entropy loss, or log loss, measures the performance of a classification

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 13 / 19

Cross Entropy Loss Function

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 14 / 19

Cross Entropy Loss Function

Now the same gradient descent will work!

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 15 / 19

Comments on Gradient Descent

1 Slow when the dataset is too large!

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 16 / 19

Gradient Descent Algorithm

GradientDescent(X , y , alpha, maxIter )

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 17 / 19

Lighter Gradient Descent Algorithm

LighterGradientDescent(X , y , alpha, maxIter )

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 18 / 19

Swakkhar Shatabda CSE 422: Fall 2021 December 2, 2024 19 / 19

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.