0% found this document useful (0 votes)

6 views37 pages

ai module 3

The document covers informed search strategies in AI, focusing on heuristic search methods, best-first search, and various algorithms such as A* and greedy search. It discusses the importance of heuristic functions, iterative improvement methods like hill climbing and simulated annealing, and the challenges of local optima. Additionally, it highlights memory-conserving variations of A* and the characteristics of effective heuristics.

Uploaded by

poomalaidivya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views37 pages

ai module 3

Uploaded by

poomalaidivya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 37

CMSC 471

Informed
Search
Chapter 4
Adapted from slides by Some material adopted from notes
Tim Finin and by Charles R. Dyer, University of
Marie desJardins. Wisconsin-Madison
Outline
• Heuristic search
• Best-first search
– Greedy search
– Beam search
– A, A*
– Examples
• Memory-conserving variations of A*
• Heuristic functions
• Iterative improvement methods
– Hill climbing
– Simulated annealing
– Local beam search
– Genetic algorithms
• Online search
Heuristic
Webster's Revised Unabridged Dictionary (1913) (web1913)
Heuristic \Heu*ris"tic\, a. [Gr. ? to discover.] Serving to discover or find
out.
The Free On-line Dictionary of Computing (15Feb98)
heuristic 1. <programming> A rule of thumb, simplification or educated
guess that reduces or limits the search for solutions in domains that are
difficult and poorly understood. Unlike algorithms, heuristics do not
guarantee feasible solutions and are often used with no theoretical
guarantee. 2. <algorithm> approximation algorithm.
From WordNet (r) 1.6
heuristic adj 1: (computer science) relating to or using a heuristic rule 2:
of or relating to a general formulation that serves to guide investigation
[ant: algorithmic] n : a commonsense rule (or set of rules) intended to
increase the probability of solving some problem [syn: heuristic rule,
heuristic program]
Informed methods add
domain-specific information
• Add domain-specific information to select the best path
along which to continue searching
• Define a heuristic function, h(n), that estimates the
“goodness” of a node n.
• Specifically, h(n) = estimated cost (or distance) of minimal
cost path from n to a goal state.
• The heuristic function is an estimate, based on domain-
specific information that is computable from the current
state description, of how close we are to a goal
Heuristics
• All domain knowledge used in the search is encoded in the
heuristic function h.
• Heuristic search is an example of a “weak method” because
of the limited way that domain-specific information is used to
solve the problem.
• Examples:
– Missionaries and Cannibals: Number of people on starting river bank
– 8-puzzle: Number of tiles out of place
– 8-puzzle: Sum of distances each tile is from its goal position
• In general:
– h(n) ≥ 0 for all nodes n
– h(n) = 0 implies that n is a goal node
– h(n) = infinity implies that n is a dead-end from which a goal cannot
be reached
Weak vs. strong methods
• We use the term weak methods to refer to methods that are
extremely general and not tailored to a specific situation.
• Examples of weak methods include
– Means-ends analysis is a strategy in which we try to represent the
current situation and where we want to end up and then look for ways to
shrink the differences between the two.
– Space splitting is a strategy in which we try to list the possible solutions
to a problem and then try to rule out classes of these possibilities.
– Subgoaling means to split a large problem into several smaller ones that
can be solved one at a time.
• Called “weak” methods because they do not take advantage of
more powerful domain-specific heuristics
Best-first search
• Order nodes on the nodes list by increasing
value of an evaluation function, f(n), that
incorporates domain-specific information in
some way.
• This is a generic way of referring to the class
of informed methods.
Greedy search
• Use as an evaluation function f(n) = h(n),
sorting nodes by increasing values of f. a
• Selects node to expand believed to be
closest (hence “greedy”) to a goal node h=2 b g h=4
(i.e., select node with smallest f value)
• Not complete h=1 c h h=1

• Not admissible, as in the example.

h=1 d i h=0
Assuming all arc costs are 1, then greedy
search will find goal g, which has a e
h=1
solution cost of 5, while the optimal
solution is the path to goal I with cost 3. g
h=0
Beam search
• Use an evaluation function f(n) = h(n), but the maximum
size of the nodes list is k, a fixed constant
• Only keeps k best nodes as candidates for expansion, and
throws the rest away
• More space efficient than greedy search, but may throw
away a node that is on a solution path
• Not complete
• Not admissible
Algorithm A
• Use as an evaluation function
f(n) = g(n) + h(n) S
• g(n) = minimal-cost path from the start 8
1 5
state to state n.
1
• The g(n) term adds a “breadth-first” 5 B
A C 8
component to the evaluation function. 9
• Ranks nodes on search frontier by
3
5
estimated cost of solution from start 1
4 D
node through the given node to goal. G
• Not complete if h(n) can equal infinity. 9
• Not admissible.
g(d)=4 C is chosen
next to expand
h(d)=9
Algorithm A
1. Put the start node S on the nodes list, called OPEN
2. If OPEN is empty, exit with failure
3. Select node in OPEN with minimal f(n) and place on CLOSED
4. If n is a goal node, collect path back to start and stop.
5. Expand n, generating all its successors and attach to them
pointers back to n. For each successor n' of n
1. If n' is not already on OPEN or CLOSED
• put n ' on OPEN
• compute h(n'), g(n')=g(n)+ c(n,n'), f(n')=g(n')+h(n')
2. If n' is already on OPEN or CLOSED and if g(n') is lower for
the new version of n', then:
• Redirect pointers backward from n' along path yielding lower g(n').
• Put n' on OPEN.
Algorithm A*
• Algorithm A with constraint that h(n) ≤ h*(n)
• h*(n) = true cost of the minimal cost path from n to a goal.
• Therefore, h(n) is an underestimate of the distance to the goal.
• h is admissible when h(n) ≤ h*(n) holds.
• Using an admissible heuristic guarantees that the first solution
found will be an optimal one.
• A* is complete whenever the branching factor is finite, and
every operator has a fixed positive cost
• A* is admissible
Some observations on A
• Perfect heuristic: If h(n) = h*(n) for all n, then only the nodes on
the optimal solution path will be expanded. So, no extra work will
be performed.
• Null heuristic: If h(n) = 0 for all n, then this is an admissible
heuristic and A* acts like Uniform-Cost Search.
• Better heuristic: If h1(n) < h2(n) ≤ h*(n) for all non-goal nodes,
then h2 is a better heuristic than h1
– If A1* uses h1, and A2* uses h2, then every node expanded by A2* is also
expanded by A1*.
– In other words, A1 expands at least as many nodes as A2*.
– We say that A2* is better informed than A1*.
• The closer h is to h*, the fewer extra nodes that will be expanded
Example search space
start state
parent pointer
0 S 8 arc cost
1 5 8

1 A 8 5 B 4 8 C 3
3 9 h value
7 4 5 g value
4 D  8 E  9 G 0

goal state
Example
n g(n) h(n) f(n) h*(n)
S 0 8 8 9
A 1 8 9 9
B 5 4 9 4
C 8 3 11 5
D 4 inf inf inf
E 8 inf inf inf
G 9 0 9 0
• h*(n) is the (hypothetical) perfect heuristic.
• Since h(n) ≤ h*(n) for all n, h is admissible
• Optimal path = S B G with cost 9.
Greedy search
f(n) = h(n)
node expanded nodes list
{ S(8) }
S { C(3) B(4) A(8) }
C { G(0) B(4) A(8) }
G { B(4) A(8) }

• Solution path found is S C G, 3 nodes expanded.

• See how fast the search is!! But it is NOT optimal.
A* search
f(n) = g(n) + h(n)

node exp. nodes list

{ S(8) }
S { A(9) B(9) C(11) }
A { B(9) G(10) C(11) D(inf) E(inf) }
B { G(9) G(10) C(11) D(inf) E(inf) }
G { C(11) D(inf) E(inf) }

• Solution path found is S B G, 4 nodes expanded..

• Still pretty fast. And optimal, too.
Proof of the optimality of A*
• We assume that A* has selected G2, a goal state with a
suboptimal solution (g(G2) > f*).
• We show that this is impossible.
– Choose a node n on the optimal path to G.
– Because h(n) is admissible, f(n) ≤ f *.
– If we choose G2 instead of n for expansion, f(G2) ≤ f(n).
– This implies f(G2) ≤ f *.
– G2 is a goal state: h(G2) = 0, f(G2) = g(G2).
– Therefore g(G2) ≤ f*
– Contradiction.
Dealing with hard problems
• For large problems, A* often requires too much space.
• Two variations conserve memory: IDA* and SMA*
• IDA* -- iterative deepening A*
– uses successive iteration with growing limits on f. For example,
• A* but don’t consider any node n where f(n) >10
• A* but don’t consider any node n where f(n) >20
• A* but don’t consider any node n where f(n) >30, ...
• SMA* -- Simplified Memory-Bounded A*
– uses a queue of restricted size to limit memory use.
– throws away the “oldest” worst solution.
What’s a good heuristic?
• If h1(n) < h2(n) ≤ h*(n) for all n, h2 is better than
(dominates) h1.
• Relaxing the problem: remove constraints to create a
(much) easier problem; use the solution cost for this
problem as the heuristic function
• Combining heuristics: take the max of several admissible
heuristics: still have an admissible heuristic, and it’s better!
• Use statistical estimates to compute g: may lose
admissibility
• Identify good features, then use a learning algorithm to
find a heuristic function: also may lose admissibility
Iterative improvement search
• Another approach to search involves starting
with an initial guess at a solution and
gradually improving it until it is one.
• Some examples:
– Hill Climbing
– Simulated Annealing
– Constraint satisfaction
Hill climbing on a surface of states

Height Defined by
Evaluation Function
Hill-climbing search
• If there exists a successor s for the current state n such that
– h(s) < h(n)
– h(s) ≤ h(t) for all the successors t of n,
• then move from n to s. Otherwise, halt at n.
• Looks one step ahead to determine if any successor is better
than the current state; if there is, move to the best successor.
• Similar to Greedy search in that it uses h, but does not
allow backtracking or jumping to an alternative path since it
doesn’t “remember” where it has been.
• Corresponds to Beam search with a beam width of 1 (i.e.,
the maximum size of the nodes list is 1).
• Not complete since the search will terminate at "local
minima," "plateaus," and "ridges."
Hill climbing example
2 8 3 1 2 3
start 1 6 4 h = -4 goal 8 4 h=0
7 5 7 6 5

-5 -5 -2
2 8 3 1 2 3
1 4 h = -3 8 4 h = -1
7 6 5 7 6 5

-3 -4
2 3 2 3
1 8 4 1 8 4 h = -2
7 6 5 7 6 5
h = -3 -4
f(n) = -(number of tiles out of place)
Exploring the Landscape
• Local Maxima: peaks that
aren’t the highest point in the local maximum
space
plateau
• Plateaus: the space has a
broad flat region that gives
the search algorithm no
direction (random walk)
ridge
• Ridges: flat like a plateau, but
with drop-offs to the sides; Image from: http://classes.yale.edu/fractals/CA/GA/Fitness/Fitness.html

steps to the North, East, South

and West may go down, but a
step to the NW may go up.
Drawbacks of hill climbing

• Problems: local maxima, plateaus, ridges

• Remedies:
– Random restart: keep restarting the search from
random locations until a goal is found.
– Problem reformulation: reformulate the search
space to eliminate these problematic features
• Some problem spaces are great for hill climbing
and others are terrible.
Example of a local optimum

1 2 5
move 7 4 f = -7
start up
8 6 3 goal
1 2 5 1 2 3
8 7 4 8 4 f=0
move
6 3 right 7 6 5
f = -6 1 2 5
8 7 4 f = -7
f = -(manhattan distance)
6 3
Gradient ascent / descent

Images from http://en.wikipedia.org/wiki/Gradient_descent

• Gradient descent procedure for finding the argx min f(x)

– choose initial x0 randomly
– repeat
• xi+1 ← xi – η f '(xi)
– until the sequence x0, x1, …, xi, xi+1 converges
• Step size η (eta) is small (perhaps 0.1 or 0.05)
Gradient methods vs. Newton’s method
• A reminder of Newton’s method
from Calculus:
xi+1 ← xi – η f '(xi) / f ''(xi)

• Newton’s method uses 2nd order

information (the second
derivative, or, curvature) to take
a more direct route to the
minimum.

• The second-order information is

Contour lines of a function
more expensive to compute, but Gradient descent (green)
converges quicker. Newton’s method (red)
Image from http://en.wikipedia.org/wiki/Newton's_method_in_optimization
Simulated annealing
• Simulated annealing (SA) exploits an analogy between the way
in which a metal cools and freezes into a minimum-energy
crystalline structure (the annealing process) and the search for a
minimum [or maximum] in a more general system.
• SA can avoid becoming trapped at local minima.
• SA uses a random search that accepts changes that increase
objective function f, as well as some that decrease it.
• SA uses a control parameter T, which by analogy with the
original application is known as the system “temperature.”
• T starts out high and gradually decreases toward 0.
Simulated annealing (cont.)
• A “bad” move from A to B is accepted with a probability

P(moveA→B) = e ( f (B) – f (A)) / T

• The higher the temperature, the more likely it is that a bad

move can be made.
• As T tends to zero, this probability tends to zero, and SA
becomes more like hill climbing
• If T is lowered slowly enough, SA is complete and
admissible.
The simulated annealing algorithm
Local beam search
• Begin with k random states
• Generate all successors of these states
• Keep the k best states

• Stochastic beam search: Probability of keeping a state is a

function of its heuristic value
Genetic algorithms
• Similar to stochastic beam search
• Start with k random states (the initial population)
• New states are generated by “mutating” a single state or
“reproducing” (combining via crossover) two parent states
(selected according to their fitness)
• Encoding used for the “genome” of an individual strongly
affects the behavior of the search

• Genetic algorithms / genetic programming are a large and

active area of research
Class Exercise:
Local Search for Map/Graph Coloring
Online search
• Interleave computation and action (search some, act some)
• Exploration: Can’t infer outcomes of actions; must actually perform
them to learn what will happen

• Competitive ratio = Path cost found* / Path cost that could be found**
* On average, or in an adversarial scenario (worst case)
** If the agent knew the nature of the space, and could use offline search

• Relatively easy if actions are reversible (ONLINE-DFS-AGENT)

• LRTA* (Learning Real-Time A*): Update h(s) (in state table) based on
experience
• More about these issues when we get to the chapters on Logic and
Learning!
Summary: Informed search
• Best-first search is general search where the minimum-cost nodes (according to some
measure) are expanded first.
• Greedy search uses minimal estimated cost h(n) to the goal state as measure. This reduces the
search time, but the algorithm is neither complete nor optimal.
• A* search combines uniform-cost search and greedy search: f(n) = g(n) + h(n). A* handles
state repetitions and h(n) never overestimates.
– A* is complete and optimal, but space complexity is high.
– The time complexity depends on the quality of the heuristic function.
– IDA* and SMA* reduce the memory requirements of A*.
• Hill-climbing algorithms keep only a single state in memory, but can get stuck on local
optima.
• Simulated annealing escapes local optima, and is complete and optimal given a “long
enough” cooling schedule.
• Genetic algorithms can search a large space by modeling biological evolution.
• Online search algorithms are useful in state spaces with partial/no information.

AI Chapter 3 Notes
No ratings yet
AI Chapter 3 Notes
18 pages
2025_Slide3_InformedSearch_eng
No ratings yet
2025_Slide3_InformedSearch_eng
67 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
PrologRevision
No ratings yet
PrologRevision
6 pages
Unit1 - New
No ratings yet
Unit1 - New
300 pages
Unit 2 - Heuristics Searching
No ratings yet
Unit 2 - Heuristics Searching
38 pages
Sabbir Ahmed Resume
No ratings yet
Sabbir Ahmed Resume
1 page
Syllabus DC
No ratings yet
Syllabus DC
3 pages
Percentage Worksheet
No ratings yet
Percentage Worksheet
2 pages
ai lecture-3
No ratings yet
ai lecture-3
27 pages
2024 Slide3 InformedSearch Eng
No ratings yet
2024 Slide3 InformedSearch Eng
67 pages
BIM309 AI Week5a
No ratings yet
BIM309 AI Week5a
41 pages
Lec 07 Informed Search I
No ratings yet
Lec 07 Informed Search I
52 pages
Informed Search
No ratings yet
Informed Search
42 pages
AI chapter-three
No ratings yet
AI chapter-three
32 pages
Ch04
No ratings yet
Ch04
33 pages
Python Programs Class VIII
No ratings yet
Python Programs Class VIII
6 pages
Informed_Search
No ratings yet
Informed_Search
49 pages
Chapter-3 Problem Solving
No ratings yet
Chapter-3 Problem Solving
37 pages
AI chapter-three
No ratings yet
AI chapter-three
33 pages
Lect_9_10
No ratings yet
Lect_9_10
43 pages
App Redo
No ratings yet
App Redo
51 pages
HEURISTIC
No ratings yet
HEURISTIC
68 pages
UNIT I Informed Search
No ratings yet
UNIT I Informed Search
76 pages
Heuristic Search
No ratings yet
Heuristic Search
79 pages
L04 Problem Solving As Search Informed
No ratings yet
L04 Problem Solving As Search Informed
49 pages
Ch2 3 Informed (Heuristic) Search
No ratings yet
Ch2 3 Informed (Heuristic) Search
66 pages
Informed Search Strategies: Artificial Intelligence
No ratings yet
Informed Search Strategies: Artificial Intelligence
72 pages
Informed Search Part 1
No ratings yet
Informed Search Part 1
21 pages
Lecture 10 Big M Method
No ratings yet
Lecture 10 Big M Method
52 pages
Snakes and Ladders - The Quickest Way Up - Lab 7 FODSA Question - Contests - HackerRank
No ratings yet
Snakes and Ladders - The Quickest Way Up - Lab 7 FODSA Question - Contests - HackerRank
5 pages
Set 3: Informed Heuristic Search: ICS 271 Fall 2014 Kalev Kask
No ratings yet
Set 3: Informed Heuristic Search: ICS 271 Fall 2014 Kalev Kask
78 pages
Ai Lect3 Search2
No ratings yet
Ai Lect3 Search2
135 pages
OOPS Java Lec1
No ratings yet
OOPS Java Lec1
7 pages
Heuristic Search 20032023 035504pm 24102023 053215pm
No ratings yet
Heuristic Search 20032023 035504pm 24102023 053215pm
50 pages
Week-6 - Informed Search and Local Search
No ratings yet
Week-6 - Informed Search and Local Search
38 pages
Unit 5 - Informed Search
No ratings yet
Unit 5 - Informed Search
48 pages
03 Informedsearch
No ratings yet
03 Informedsearch
36 pages
L3 - Decision Making and Control Structures
No ratings yet
L3 - Decision Making and Control Structures
28 pages
Training Module Imports
No ratings yet
Training Module Imports
1 page
4 Best First Search A Star
No ratings yet
4 Best First Search A Star
24 pages
Informed Search IIT
No ratings yet
Informed Search IIT
61 pages
L04 Problem Solving As Search II
No ratings yet
L04 Problem Solving As Search II
41 pages
Maze Solving AI
No ratings yet
Maze Solving AI
22 pages
Ahp Template
No ratings yet
Ahp Template
5 pages
Fundamentals of Data Science Unit 5
No ratings yet
Fundamentals of Data Science Unit 5
25 pages
03 Hsearch
No ratings yet
03 Hsearch
61 pages
Raptor Questions Answers
No ratings yet
Raptor Questions Answers
16 pages
Unit-3-Heuristics Search Techniques
No ratings yet
Unit-3-Heuristics Search Techniques
81 pages
Unit 2 - Heuristic Search
No ratings yet
Unit 2 - Heuristic Search
20 pages
Module-2 Notes
No ratings yet
Module-2 Notes
28 pages
AI Mod2
No ratings yet
AI Mod2
17 pages
Ai-Unit Ii
No ratings yet
Ai-Unit Ii
61 pages
Informed Search Strategies: J. Felicia Lilian
No ratings yet
Informed Search Strategies: J. Felicia Lilian
24 pages
Python: Henning Schulzrinne Department of Computer Science Columbia University
No ratings yet
Python: Henning Schulzrinne Department of Computer Science Columbia University
67 pages
Informed (Heuristic) Search: Mona Leeza Email: Monaleeza - Bukc@Bahria - Edu.Pk
No ratings yet
Informed (Heuristic) Search: Mona Leeza Email: Monaleeza - Bukc@Bahria - Edu.Pk
71 pages
200 Problem Set 6
No ratings yet
200 Problem Set 6
7 pages
Chapter 4
No ratings yet
Chapter 4
75 pages
Wa0021.
No ratings yet
Wa0021.
9 pages
Informed Searches: Associate Professor Email
No ratings yet
Informed Searches: Associate Professor Email
40 pages
Rapid Roll Game
No ratings yet
Rapid Roll Game
23 pages
AI Chap4 Heur
No ratings yet
AI Chap4 Heur
22 pages
Informed Search
No ratings yet
Informed Search
37 pages
Complement System and Substraction of Number Using R's Complement
No ratings yet
Complement System and Substraction of Number Using R's Complement
5 pages
Problem Solving by Searching: Search Methods
No ratings yet
Problem Solving by Searching: Search Methods
81 pages
Elzeroweb Part - 1
100% (1)
Elzeroweb Part - 1
17 pages
Binary Tree
No ratings yet
Binary Tree
18 pages
Cse-3201 (Ai - 06)
No ratings yet
Cse-3201 (Ai - 06)
43 pages
11 CS BT2 Assignment File
No ratings yet
11 CS BT2 Assignment File
3 pages
A.I.: Informed Search Algorithms: Chapter III: Part Deux
No ratings yet
A.I.: Informed Search Algorithms: Chapter III: Part Deux
35 pages
1.4.3 Q's Boolean Algebra
No ratings yet
1.4.3 Q's Boolean Algebra
13 pages
31 Paper Adaptive Interaction
No ratings yet
31 Paper Adaptive Interaction
15 pages
Chap 3 (3) - InformedSearch
No ratings yet
Chap 3 (3) - InformedSearch
13 pages
Lecture-3.1 Heuristic Search
No ratings yet
Lecture-3.1 Heuristic Search
131 pages
CD3291 Data Structures and Algorithms L T P C
No ratings yet
CD3291 Data Structures and Algorithms L T P C
2 pages
Lecture 08 Informed Search Strategies-I
No ratings yet
Lecture 08 Informed Search Strategies-I
45 pages
4.module3 INFORMED SEARCH 3
No ratings yet
4.module3 INFORMED SEARCH 3
48 pages
Assignment#1: Design & Analysis of Algorithm
No ratings yet
Assignment#1: Design & Analysis of Algorithm
7 pages
Heuristic Search
No ratings yet
Heuristic Search
50 pages
Informed Search PDF
No ratings yet
Informed Search PDF
45 pages
A Search: Introduction To Artificial Intelligence
No ratings yet
A Search: Introduction To Artificial Intelligence
22 pages
OSY-CT2-Model Answer Sheet
No ratings yet
OSY-CT2-Model Answer Sheet
6 pages
Informed Search
No ratings yet
Informed Search
25 pages
L04 Search (MoreSearchStrategies)
No ratings yet
L04 Search (MoreSearchStrategies)
23 pages
Artificial Intelligence: Informed Search
No ratings yet
Artificial Intelligence: Informed Search
21 pages
2-Heuristic Search
No ratings yet
2-Heuristic Search
19 pages
Advanced Operating System (HW3) Department: IITA Student ID: BIA110004 Name: 哈瓦尼
No ratings yet
Advanced Operating System (HW3) Department: IITA Student ID: BIA110004 Name: 哈瓦尼
5 pages
A Star: Fundamentals and Applications
From Everand
A Star: Fundamentals and Applications
Fouad Sabry
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ai module 3

Uploaded by

ai module 3

Uploaded by

CMSC 471

• Not admissible, as in the example.

• Solution path found is S C G, 3 nodes expanded.

node exp. nodes list

• Solution path found is S B G, 4 nodes expanded..

steps to the North, East, South

• Problems: local maxima, plateaus, ridges

Images from http://en.wikipedia.org/wiki/Gradient_descent

• Gradient descent procedure for finding the argx min f(x)

• Newton’s method uses 2nd order

• The second-order information is

P(moveA→B) = e ( f (B) – f (A)) / T

• The higher the temperature, the more likely it is that a bad

• Stochastic beam search: Probability of keeping a state is a

• Genetic algorithms / genetic programming are a large and

• Relatively easy if actions are reversible (ONLINE-DFS-AGENT)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.