0% found this document useful (0 votes)

195 views22 pages

Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)

This document discusses different aspects of machine learning. It begins by defining learning as improving performance on future tasks through observations. It then discusses three main reasons for wanting agents to learn: designers cannot anticipate all situations or changes over time, and sometimes have no knowledge to program solutions. The document outlines different forms of learning depending on the agent component, prior knowledge, data representation, and available feedback. Examples of learning types include supervised learning from labeled examples, unsupervised learning to find patterns, and reinforcement learning from rewards/punishments. Bayesian learning and neural networks are also summarized at a high level.

Uploaded by

Mustefa Mohammed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

195 views22 pages

Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)

Uploaded by

Mustefa Mohammed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Chapter 5

Learning

Book: Artificial Intelligence, A Modern Approach (Russell & Norvig) Melaku M. 1

Learning
• An agent is learning if it improves its performance on future tasks after making observations
about the world.
• 3 main reasons why would we want an agent to learn:
1. The designers cannot anticipate all possible situations that the agent might ﬁnd itself in.
2. The designers cannot anticipate all changes over time
3. Sometimes human programmers have no idea how to program a solution themselves.
• Therefore, Instead of trying to hard code all the knowledge, it makes sense to learn it.

2
Forms of learning
• Any component of an agent can be improved by learning from
data. The improvements, and the techniques used to make them,
depend on four major factors:
– Which component is to be improved.
– What prior knowledge the agent already has.
– What representation is used for the data and the component.
– What feedback is available to learn from.

3
Agent component to be improved.
– A direct mapping from conditions on the current state to actions
– A means to infer relevant properties of the world from the percept sequence
– Information about the way the world evolves and about the results of
possible actions the agent can take
– Utility information indicating the desirably of world states
– Action-value information indicating the desirably of actions
– Goals that describe classes of states whose achievement maximizes the
agent’s utility
4
Example:
• An agent training to be come a taxi driver.
– Every time the instructor shouts “Brake!” the agent might learn a condition– action
rule for when to brake (component 1); the agent also learns every time the instructor
does not shout.
– By seeing many camera images that it is told contain buses, it can learn to recognize
them (2).
– By trying actions and observing the results—for example, braking hard on a wet
road—it can learn the effects of its actions (3).
– Then, when it receives no tip from passengers who have been thoroughly shaken up
during the trip, it can learn a useful component of its overall utility function (4).
5
Learning from Observations
• Supervised Learning – learn a function from a set of training examples which are pre-
classified feature vectors.
– Data – instantiations of some or all of the random variables describing the domain;
they are evidence
– Hypotheses – probabilistic theories of how the domain works
feature vector class
(shape,color) (circle, green) ?
(square, red) I (triangle, blue)?
(square, blue) I
(circle, red) II
(circle blue) II
(triangle, red) I
Given a previously unseen feature vector, what is
(triangle, green) I the rule that tells us if it is in class I or class II?
(ellipse, blue) II
(ellipse, red) II
6
Learning from Observations
• Unsupervised Learning – No classes are given. The idea is to find
patterns in the data. This generally involves clustering.

• Reinforcement Learning – learn from feedback after a decision is made.

– the agent must learn from reinforcement (reward or punishment)
7
Learning Probabilistic Models
• Agents can handle uncertainty by using the methods of
probability and decision theory, but ﬁrst they must learn their
probabilistic theories of the world from experience.
• Probabilistic models are statistical models

8
Bayesian Learning
• Bayesian learning simply calculates the probability of the
hypothesis and it makes predictions on that basis.
• That is, the predictions are made by using all the hypotheses
• The probability of each hypothesis is obtained by Bayes’ rule.

9
Bayes’ Rule
• This simple equation underlies most modern AI systems for probabilistic inference.

P(X | h) P(h)
P(h | X) = -----------------
P(X) Often assumed
constant and
left out.
• h is the hypothesis (such as the class).
• X is the feature vector to be classified.
• P(X | h) is the prior probability that this feature vector occurs, given that h is true.
• P(h) is the prior probability of hypothesis h.
• P(X) = the prior probability of the feature vector X. 10
Example
• Say that you have this (tiny) dataset that classifies animals into two classes:
cat and dog.

• probability of the example being a cat, given that hair color is black, body length is 18
inches, height is 9.2, weight is 8.1 lb, …
• The conditional probability is, generically, P(class | feature set). In our example, classes =
{cat, dog} and feature set = {hair color, body length, height, weight, ear length, claws}.
11
Choosing Hypothesis

12
Cancer Test Example
• Does patient have cancer or not?
– A patient takes a lab test and the result comes back positive. The test returns a correct
positive result in only 98% of the cases in which the disease is actually present, and a
correct negative result in only 97% of the cases in which the disease is not present.
Furthermore, .008 of the entire population have this cancer.

P(cancer) =0.008 P(¬cancer) = 0.992

P(+|cancer) = 0.98 P(−|cancer) = 0.02
P(+|¬cancer) = 0.03 P(−|¬cancer) =0.97

P(cancer | + ) = P( + | cancer ) P(cancer) = (.98) (.008) = .0078

P( + )
P(¬cancer | + ) = P( + | ¬cancer ) P(¬cancer ) = (.03) (.992) = .0298
P( + )
hMAP would say it’s not cancer. Depends strongly on priors! 13
Neural Net Learning
• Motivated by studies of the brain.

• A network of “artificial neurons” that learns a function.

• Doesn’t have clear decision rules like decision trees, but highly
successful in many different applications. (e.g. face detection)
• Knowledge is represented in numeric form

14
Biological Neuron

• Dendrites brings the input signals from other neurons

• Cell body gets the input signals from all dendrites and aggregates them. It then decides
whether to send output signal through Axon or not
• Axon carries the impulse generated by cell to other neurons
• Axon is connected to dendrites of other neurons through synapse
McCulloch-pitts Model of Neuron

• NET = X1W1+X2W2+....+XnWn
• f (NET)= Out

For simple threshold function

f (NET)= Out = 1 if NET >=T

=0 if NET < T
Activation functions
• Activation functions are mathematical equations that determine the output of
a neural network.
Architectures of NN
What do we mean by architecture of NN?
• Way in which neurons are connected to together
Feed Forward NN Recurrent NN Symmetrically
connected NN
Feed-forward example
Perceptron
• The perceptron(or single-layer perceptron) is the simplest model of a
neuron that illustrates how a neural network works.
• The perceptron is a machine learning algorithm developed in 1957 by
Frank Rosenblatt and first implemented in IBM 704.

20
How the Perceptron Works
• Example:
– The perceptron has three inputs x1, x2 and x3 and one output.

• Since the output of the perceptron could be either 0 or 1, this perceptron is an example
of binary classifier.
The Formula
Let’s write out the formula that joins the inputs and the weights together to produce the output
Output = w1x1 + w2x2 + w3x3

21
END

LECTURE SET 07 - Machine Learning For Artificial Intelligence
No ratings yet
LECTURE SET 07 - Machine Learning For Artificial Intelligence
75 pages
Classification and Clustering
No ratings yet
Classification and Clustering
80 pages
AI Chapter 19
No ratings yet
AI Chapter 19
53 pages
Perceptron 2014
No ratings yet
Perceptron 2014
44 pages
ML Unit Ii
No ratings yet
ML Unit Ii
16 pages
Module 4
No ratings yet
Module 4
84 pages
Sesi#1 - WJ - Machine Learning in Brief (Printed Version)
No ratings yet
Sesi#1 - WJ - Machine Learning in Brief (Printed Version)
37 pages
LECTURE SET 07 - Machine Learning For Artificial Intelligence
No ratings yet
LECTURE SET 07 - Machine Learning For Artificial Intelligence
48 pages
AI Notes Week 11
No ratings yet
AI Notes Week 11
68 pages
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
No ratings yet
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
127 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
Lecture#12 DM MS (DEIM) Spring 2025
No ratings yet
Lecture#12 DM MS (DEIM) Spring 2025
21 pages
86 37 196 Mod 5
No ratings yet
86 37 196 Mod 5
52 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
ML1 17 Hepsi
No ratings yet
ML1 17 Hepsi
90 pages
Unit - 1 - SC
No ratings yet
Unit - 1 - SC
98 pages
Lect 5
No ratings yet
Lect 5
41 pages
Sec 1630
No ratings yet
Sec 1630
145 pages
MachineLearning Lecture 2
No ratings yet
MachineLearning Lecture 2
23 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
AI Lecture 5
No ratings yet
AI Lecture 5
22 pages
Unit1 2
No ratings yet
Unit1 2
101 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
Nature 14541
No ratings yet
Nature 14541
8 pages
2-Inductive Learning
No ratings yet
2-Inductive Learning
37 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
32 pages
Chapter 9 - ANNs
No ratings yet
Chapter 9 - ANNs
25 pages
Module 4
No ratings yet
Module 4
50 pages
Ai Lect6 Genetic
No ratings yet
Ai Lect6 Genetic
94 pages
Chap 18
No ratings yet
Chap 18
51 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
1 Leaning Introduction
No ratings yet
1 Leaning Introduction
29 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
Learning
No ratings yet
Learning
48 pages
Eem520l1 2023
No ratings yet
Eem520l1 2023
20 pages
5 Le
No ratings yet
5 Le
36 pages
Module - 04 Machine Learning (BCS602) Search Creators
No ratings yet
Module - 04 Machine Learning (BCS602) Search Creators
21 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Machine Learning - ch1
No ratings yet
Machine Learning - ch1
46 pages
1b Different Types
No ratings yet
1b Different Types
26 pages
Unit 3
No ratings yet
Unit 3
62 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
63 pages
Tirth PDF
No ratings yet
Tirth PDF
19 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
AI Chapter 6
No ratings yet
AI Chapter 6
28 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
Unit IIAIProjectCycle
No ratings yet
Unit IIAIProjectCycle
9 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
No ratings yet
Introduction To AI and ML - Day 1: Gururajan Narasimhan Erode
39 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Deep Learning Lecture 0 Introduction Alexander Tkachenko
No ratings yet
Deep Learning Lecture 0 Introduction Alexander Tkachenko
31 pages
QC-041 00 Operation and Calibration of GC Agilent 7890A
No ratings yet
QC-041 00 Operation and Calibration of GC Agilent 7890A
10 pages
CFD Tutorial 1 - Elbow
100% (1)
CFD Tutorial 1 - Elbow
26 pages
Automation of Sewage Treatment Plant Using PLC & SCADA: A Major Project Report
No ratings yet
Automation of Sewage Treatment Plant Using PLC & SCADA: A Major Project Report
23 pages
Astm f2882
No ratings yet
Astm f2882
7 pages
C-Full Programs 001
No ratings yet
C-Full Programs 001
25 pages
Chapter-1 - Introduction To Data Mining
No ratings yet
Chapter-1 - Introduction To Data Mining
10 pages
Project: Integration Management
No ratings yet
Project: Integration Management
71 pages
Water Level Indicator
No ratings yet
Water Level Indicator
29 pages
Breif Induction GT 2019-20
No ratings yet
Breif Induction GT 2019-20
247 pages
Characteristics and Functions of Data Warehouse
No ratings yet
Characteristics and Functions of Data Warehouse
13 pages
Chapter 1 Introduction To AI
No ratings yet
Chapter 1 Introduction To AI
26 pages
Database Questions and Answers
No ratings yet
Database Questions and Answers
3 pages
Solving Problems by Searching & Constraint Satisfaction Problem
No ratings yet
Solving Problems by Searching & Constraint Satisfaction Problem
53 pages
Nnew - DC Lab Manual
No ratings yet
Nnew - DC Lab Manual
106 pages
SUNLU T3 Manul
No ratings yet
SUNLU T3 Manul
23 pages
Learning Piano by Yourself
No ratings yet
Learning Piano by Yourself
2 pages
Scope and Time Management
No ratings yet
Scope and Time Management
52 pages
An Efficient Algorithm For 3D Rectangular Box Packing
No ratings yet
An Efficient Algorithm For 3D Rectangular Box Packing
4 pages
(Question) Mat 491 Final Assessment 6aug2021 3PM-5PM
No ratings yet
(Question) Mat 491 Final Assessment 6aug2021 3PM-5PM
4 pages
Chapters 3 To 7 Study Guide
No ratings yet
Chapters 3 To 7 Study Guide
38 pages
Greenhouse Monitoring and Control System Based On Wireless Sensor Network
No ratings yet
Greenhouse Monitoring and Control System Based On Wireless Sensor Network
4 pages
DSA Internal Exam Questions With Quiz
No ratings yet
DSA Internal Exam Questions With Quiz
4 pages
Carry Out Mensuration and Calculation
No ratings yet
Carry Out Mensuration and Calculation
31 pages
Arba Minch University Arba Minch Institute of Technology Faculty of Computing & Software Engineering
No ratings yet
Arba Minch University Arba Minch Institute of Technology Faculty of Computing & Software Engineering
20 pages
Studbolt Catalouge
No ratings yet
Studbolt Catalouge
47 pages
23 July 2024 - Comprehensive Review of Depression Detection Techniques Based On Machine Learning Approach
No ratings yet
23 July 2024 - Comprehensive Review of Depression Detection Techniques Based On Machine Learning Approach
25 pages
Enumeration EH A
No ratings yet
Enumeration EH A
35 pages
TM11-2631 Antenna Equipment RC-154-A, 1944
No ratings yet
TM11-2631 Antenna Equipment RC-154-A, 1944
32 pages
Long-Term Exposure To Ambient Benzene and Brain Disorders Among Urban Adults
No ratings yet
Long-Term Exposure To Ambient Benzene and Brain Disorders Among Urban Adults
16 pages
Stata 1
No ratings yet
Stata 1
45 pages
Theory of Automata Assignment
No ratings yet
Theory of Automata Assignment
4 pages
Wind Farm Control: The Route To Bankability
No ratings yet
Wind Farm Control: The Route To Bankability
27 pages
Positouch DBF Files 2
No ratings yet
Positouch DBF Files 2
68 pages
Complied by Mesfin A. 1
No ratings yet
Complied by Mesfin A. 1
25 pages
CHE 3800 - Mass Transfer and Separation Process (Winter 2017)
No ratings yet
CHE 3800 - Mass Transfer and Separation Process (Winter 2017)
3 pages
m5 Datasheet
No ratings yet
m5 Datasheet
1 page
Geotechnical Earthquake Engineering: Prof. Deepankar Choudhury
No ratings yet
Geotechnical Earthquake Engineering: Prof. Deepankar Choudhury
38 pages
Atg - Format
No ratings yet
Atg - Format
8 pages
Preview: Gradient Based Histogram Equalization of Thermal Infrared Images
No ratings yet
Preview: Gradient Based Histogram Equalization of Thermal Infrared Images
24 pages
Mobile Antenna System Handbook
No ratings yet
Mobile Antenna System Handbook
15 pages
FID1 A, FID1A, Front Signal (2019/20190527 - PPNF2/20170619 - VINCI P12 2019-05-27 13-13-31/033F0201.D)
No ratings yet
FID1 A, FID1A, Front Signal (2019/20190527 - PPNF2/20170619 - VINCI P12 2019-05-27 13-13-31/033F0201.D)
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)

Uploaded by

Learning: Book: Artificial Intelligence, A Modern Approach (Russell & Norvig)

Uploaded by

Chapter 5

Book: Artificial Intelligence, A Modern Approach (Russell & Norvig) Melaku M. 1

• Reinforcement Learning – learn from feedback after a decision is made.

P(cancer) =0.008 P(¬cancer) = 0.992

P(cancer | + ) = P( + | cancer ) P(cancer) = (.98) (.008) = .0078

• A network of “artificial neurons” that learns a function.

• Dendrites brings the input signals from other neurons

For simple threshold function

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.