0% found this document useful (0 votes)

27 views37 pages

Chapter3 - Search4

intro AI

Uploaded by

Hoàng Tùng Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views37 pages

Chapter3 - Search4

intro AI

Uploaded by

Hoàng Tùng Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

IT3160E

Introduction to Artificial Intelligence

Chapter 3: Problem solving

Advanced search methods

Lê Thanh Hương
School of Information and Communication Technology - HUST
Outline

• Local beam search

• Game and search
• Alpha-beta pruning

2
Local beam search

• Like greedy search, but keep K states at all times:

• Initially: k random states
• Next: determine all successors of k states
• If any of successors is goal → finished
• Else select k best from successors and repeat.

Greedy Search Beam Search

3
Local beam search

• Major difference with random-restart search

• Information is shared among k search threads: If one state generated good successor, but
others did not → “come here, the grass is greener!”

• Can suffer from lack of diversity.

• Stochastic variant: choose k successors at proportionally to state success.

• The best choice in MANY practical settings

4
Games and search

• Why study games?

• Why is search a good idea?

• Majors assumptions about games:

• Only an agent’s actions change the world
• World is deterministic and accessible

5
Why study games?

machines are better than humans in:

othello
humans are better than machines in:
go
here: perfect information zero-sum games

6
Why study games?

• Games are a form of multi-agent environment

• What do other agents do and how do they affect our success?
• Cooperative vs. competitive multi-agent environments.
• Competitive multi-agent environments give rise to adversarial search a.k.a. games

• Why study games?

• Fun; historically entertaining
• Interesting subject of study because they are hard
• Easy to represent and agents restricted to small number of actions

7
Relation of Games to Search

• Search – no adversary
• Solution is (heuristic) method for finding goal
• Heuristics and CSP techniques can find optimal solution
• Evaluation function: estimate of cost from start to goal through given node
• Examples: path planning, scheduling activities
• Games – adversary
• Solution is strategy (strategy specifies move for every possible opponent reply).
• Time limits force an approximate solution
• Evaluation function: evaluate “goodness” of game position
• Examples: chess, checkers, Othello, backgammon
• Ignoring computational complexity, games are a perfect application for a complete search.
• Of course, ignoring complexity is a bad idea, so games are a good place to study resource
bounded searches.

8
Types of Games

deterministic chance
perfect chess, checkers, go, othello backgammon monopoly
information

imperfect battleships, blind tictactoe bridge, poker, scrabble nuclear

information war

9
Minimax

• Two players: MAX and MIN

• MAX moves first and they take turns until the game is over. Winner gets award, looser gets
penalty.
• Games as search:
• Initial state: e.g. board configuration of chess
• Successor function: list of (move,state) pairs specifying legal moves.
• Terminal test: Is the game finished?
• Utility function: Gives numerical value of terminal states.
• E.g. win (+1), loose (-1) and draw (0) in tic-tac-toe
• MAX uses search tree to determine next move.
• Perfect play for deterministic games

10
Minimax

• From among the moves

available to you, take the best
one
• The best one is determined by a
search using the MiniMax
strategy

11
Optimal strategies

◼ MAX maximizes a function: find a move corresponding to max value

◼ MIN minimizes the same function: find a move corresponding to min value
At each step:
◼ If a state/node corresponds to a MAX move, the function value will be the maximum
value of its childs
◼ If a state/node corresponds to a MIN move, the function value will be the minimum
value of its childs
Given a game tree, the optimal strategy can be determined by using the minimax value of
each node:

MINIMAX-VALUE(n)=
UTILITY(n) If n is a terminal
maxs  successors(n) MINIMAX-VALUE(s) If n is a max node
mins  successors(n) MINIMAX-VALUE(s) If n is a min node

12
Minimax

13
Minimax algorithm

14
Properties of minimax

• Complete? Yes (if tree is finite)

• Optimal? Yes (against an optimal opponent)
• Time complexity? O(bm)
• Space complexity? O(bm) (depth-first exploration)

• For chess, b ≈ 35, m ≈100 for "reasonable" games

→ exact solution completely infeasible

15
Problem of minimax search

• Number of games states is exponential to the number of moves.

➢Solution: Do not examine every node

 Alpha-beta pruning:
• Remove branches that do not influence final decision
• Revisit example …

16
α-β pruning

◼ Alpha values: the best values achievable for MAX, hence the max value so
far

◼ Beta values: the best values achievable for MIN, hence the min value so far

◼ At MIN level: compare result V of node to alpha value. If V>alpha, pass

value to parent node and BREAK

◼ At MAX level: compare result V of node to beta value. If V<beta, pass value
to parent node and BREAK

17
α-β pruning

α: the best values achievable for MAX

β: the best values

achievable for MIN

18
α-β pruning example

Compare result V of node to β. If V< β, pass value to parent node

and BREAK

19
α-β pruning example

20
α-β pruning example

21
α-β pruning example

22
Properties of α-β

• Pruning does not affect final result

• Entire sub-trees can be pruned.
• Good move ordering improves effectiveness of pruning. With "perfect ordering"
➢ time complexity = O(bm/2)
→ doubles depth of search
➢ Branching factor of sqrt(b) !!
➢ Alpha-beta pruning can look twice as far as minimax in the same amount of time

• Repeated states are again possible.

➢ Store them in memory = transposition table

• A simple example of the value of reasoning about which computations are relevant (a
form of metareasoning)

23
Why is it called α-β?

• α is the value of the best (i.e., highest-value) choice found so far at any
choice point along the path for max
• If v is worse than α, max will avoid it
→ prune that branch
• Define β similarly for min

24
The α-β algorithm

25
The α-β algorithm

26
Imperfect, real-time decisions

• Minimax and alpha-beta pruning require too much leafnode evaluations.

• May be impractical within a reasonable amount of time.

• Suppose we have 100 secs, explore 104 nodes/sec

→ 106 nodes per move

• Standard approach (SHANNON, 1950):

• Cut off search earlier (replace TERMINAL-TEST by CUTOFF-TEST)
• Apply heuristic evaluation function EVAL (replacing utility function of alpha-beta)

27
Cut-off search

• Change:
if TERMINAL-TEST(state) then return UTILITY(state)
into:
if CUTOFF-TEST(state,depth) then return EVAL(state)

• Introduces a fixed-depth limit depth

• Is selected so that the amount of time will not exceed what the rules of the game
allow.

• When cut-off occurs, the evaluation is performed.

28
Heuristic evaluation (EVAL)

• Idea: produce an estimate of the expected utility of the game from a given
position.
• Requirements:
➢ EVAL should order terminal-nodes in the same way as UTILITY.
➢ Computation may not take too long.
➢ For non-terminal states the EVAL should be strongly correlated with the actual chance of
winning.
• Example:
Expected value e(p) for each state p:
E(p) = (# open rows, columns, diagonals for MAX)
- (# open rows, columns, diagonals for MIN)
• MAX moves all lines that don’t have o; MIN moves all lines that don’t have x

29
Reduces state spaces of Tictactoe based on the symmetry of the states
Expected value e(p) for each state p:
E(p) = (# open rows, columns, diagonals for MAX)
- (# open rows, columns, diagonals for MIN)
MAX moves all lines that don’t have o; MIN moves
all lines that don’t have x
1

MAX goes first

-1 1 -2

MIN goes

e(p) 1 0 1 0 -1 1 2 -1 0 -1 0 -2

→ A kind of depth-first search

30
Evaluation function example

• For chess, typically linear weighted sum of features

Eval(s) = w1 f1(s) + w2 f2(s) + … + wn fn(s)
• e.g., w1 = 9 with
f1(s) = (number of white queens) – (number of black queens), etc.

31
Chess complexity

• PC can search 200 millions nodes/3min.

• Branching factor: ~35
• 355 ~ 50 millions
➢ if use minimax, could look ahead 5 plies, defeated by average player, planning 6-8 plies.
• Does it work in practice?
• 4-ply ≈ human novice → hopeless chess player
• 8-ply ≈ typical PC, human master
• 12-ply ≈ Deep Blue, Kasparov
• To reach grandmaster level, needs a better extensively tuned evaluation and a
large database of optimal opening and ending of the game

32
Deterministic games in practice

• Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley in 1994.
Used a precomputed endgame database defining perfect play for all positions involving 8 or
fewer pieces on the board, a total of 444 billion positions.

• Chess: Deep Blue defeated human world champion Garry Kasparov in a six-game match in
1997. Deep Blue searches 200 million positions per second, uses very sophisticated
evaluation, and undisclosed methods for extending some lines of search up to 40 ply.

• Othello: human champions refuse to compete against computers, who are too good.

• Go: human champions refuse to compete against computers, who are too bad. In go, b > 300,
so most programs use pattern knowledge bases to suggest plausible moves.

33
Nondeterministic games

• Chance introduces by dice, card-shuffling, coin-flipping...

• Example with coin-flipping:

change nodes

34
Backgammon

Possible moves: (5-10,5-11), (5-11,19-24),(5-10,10-16) and (5-11,11-16)

35
Expected minimax value

EXPECTED-MINIMAX-VALUE(n)=
UTILITY(n) If n is a terminal
maxssuccessors(n) EXPECTEDMINIMAX(s) If n is a max node
minssuccessors(n) EXPECTEDMINIMAX(s) If n is a max node
Σssuccessors(n) P(s) .EXPECTEDMINIMAX(s) If n is a chance node

P(s) is probability of s occurence

36
Games of imperfect information

• E.g., card games, where opponent's initial cards are unknown

• Typically we can calculate a probability for each possible deal
• Seems just like having one big dice roll at the beginning of the game
• Idea: compute the minimax value of each action in each deal, then choose the action with
highest expected value over all deals
• Special case: if an action is optimal for all deals, it's optimal.
• GIB, current best bridge program, approximates this idea by
➢ generating 100 deals consistent with bidding information
➢ picking the action that wins most tricks on average

Scottish Fold Cat
100% (2)
Scottish Fold Cat
11 pages
16 Marks
No ratings yet
16 Marks
4 pages
Marketing Plan: de La Salle University - Dasmariñas
No ratings yet
Marketing Plan: de La Salle University - Dasmariñas
16 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Games
No ratings yet
Games
41 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
Game Playing
No ratings yet
Game Playing
32 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Local Adversarial Search
No ratings yet
Local Adversarial Search
44 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
6 Game
No ratings yet
6 Game
42 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Week 13
No ratings yet
Week 13
45 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Lecture 4
No ratings yet
Lecture 4
29 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
6 Game
No ratings yet
6 Game
53 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Game Playing
No ratings yet
Game Playing
53 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
No ratings yet
Lecture 09 10+Game+Playing+++MinMax-AlphaBeta
54 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
Module4 Chapter2
No ratings yet
Module4 Chapter2
30 pages
Game Playing: Games. Why? Minimax Search Alpha-Beta Pruning
No ratings yet
Game Playing: Games. Why? Minimax Search Alpha-Beta Pruning
31 pages
8.AI17game Final
No ratings yet
8.AI17game Final
30 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Part4.Game Playing
No ratings yet
Part4.Game Playing
35 pages
Adversarial Search
No ratings yet
Adversarial Search
49 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Game Playing
No ratings yet
Game Playing
60 pages
IA c06 NoAnim
No ratings yet
IA c06 NoAnim
31 pages
1 GamePlaying
No ratings yet
1 GamePlaying
30 pages
Cs 171 07a Games MiniMax
No ratings yet
Cs 171 07a Games MiniMax
28 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
Ai Unit 2
No ratings yet
Ai Unit 2
135 pages
Biti1113 Games in Ai
No ratings yet
Biti1113 Games in Ai
58 pages
1 1 4GamePlaying
No ratings yet
1 1 4GamePlaying
23 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
Brute Force Search: Fundamentals and Applications
From Everand
Brute Force Search: Fundamentals and Applications
Fouad Sabry
No ratings yet
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Hill Climbing: Fundamentals and Applications
From Everand
Hill Climbing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Intro and Overview
No ratings yet
Intro and Overview
5 pages
Slides
No ratings yet
Slides
37 pages
Tinf 8000 en 0
No ratings yet
Tinf 8000 en 0
1 page
26220606734381cfd 409z20e
No ratings yet
26220606734381cfd 409z20e
1 page
Sahil - Shamra - TCA NDA Form
No ratings yet
Sahil - Shamra - TCA NDA Form
2 pages
Installation Information Emg Models: Passive / Passive: Output
No ratings yet
Installation Information Emg Models: Passive / Passive: Output
1 page
Smart Assistive Multi Final
No ratings yet
Smart Assistive Multi Final
11 pages
TUGAS 2 BAHASA INGGRIS Arin
No ratings yet
TUGAS 2 BAHASA INGGRIS Arin
5 pages
PhilipCardiff UCD Geometry, Meshing in OpenFOAM
No ratings yet
PhilipCardiff UCD Geometry, Meshing in OpenFOAM
72 pages
Unit 5
No ratings yet
Unit 5
50 pages
Investor Presentation
No ratings yet
Investor Presentation
30 pages
Manila Standard Today - Friday (December 14, 2012) Issue
No ratings yet
Manila Standard Today - Friday (December 14, 2012) Issue
26 pages
Mini Score PDF
No ratings yet
Mini Score PDF
6 pages
Ec2209 Set 3
No ratings yet
Ec2209 Set 3
2 pages
Florida Department of Children and Families Legislative Budget Request FY 2010-11
No ratings yet
Florida Department of Children and Families Legislative Budget Request FY 2010-11
419 pages
Ejercicios de Matematica Avanzada para Ingenieros
No ratings yet
Ejercicios de Matematica Avanzada para Ingenieros
6 pages
Tanaman Hias
No ratings yet
Tanaman Hias
8 pages
1996-2001 Royal Star Tour Classic Service Manual
No ratings yet
1996-2001 Royal Star Tour Classic Service Manual
539 pages
Accounting - Study Plan
No ratings yet
Accounting - Study Plan
1 page
Get Through Primary FRCA SBAs, 1st Edition Official Download
92% (12)
Get Through Primary FRCA SBAs, 1st Edition Official Download
14 pages
Personal Letter Exercise
No ratings yet
Personal Letter Exercise
3 pages
A Comparative Analysis of The
No ratings yet
A Comparative Analysis of The
15 pages
Amit Yadav Project
No ratings yet
Amit Yadav Project
49 pages
GTU Big Data Analysis Question Paper Summer 2022
No ratings yet
GTU Big Data Analysis Question Paper Summer 2022
1 page
Post Nominals Procedures
No ratings yet
Post Nominals Procedures
3 pages
Versa CSeries Aluminum Solenoid Valves
No ratings yet
Versa CSeries Aluminum Solenoid Valves
24 pages
Lecture 1 - Introduction To Islamic Architecture
No ratings yet
Lecture 1 - Introduction To Islamic Architecture
51 pages
Maori Presentation
No ratings yet
Maori Presentation
13 pages
Napier 297 - WN26
No ratings yet
Napier 297 - WN26
68 pages
Sara
No ratings yet
Sara
160 pages
Design Standards and Specifications
No ratings yet
Design Standards and Specifications
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter3 - Search4

Uploaded by

Chapter3 - Search4

Uploaded by

IT3160E

Introduction to Artificial Intelligence

Chapter 3: Problem solving

• Local beam search

• Like greedy search, but keep K states at all times:

Greedy Search Beam Search

• Major difference with random-restart search

• Can suffer from lack of diversity.

• The best choice in MANY practical settings

• Why study games?

• Majors assumptions about games:

machines are better than humans in:

• Games are a form of multi-agent environment

• Why study games?

imperfect battleships, blind tictactoe bridge, poker, scrabble nuclear

• Two players: MAX and MIN

• From among the moves

◼ MAX maximizes a function: find a move corresponding to max value

• Complete? Yes (if tree is finite)

• For chess, b ≈ 35, m ≈100 for "reasonable" games

• Number of games states is exponential to the number of moves.

◼ At MIN level: compare result V of node to alpha value. If V>alpha, pass

α: the best values achievable for MAX

β: the best values

Compare result V of node to β. If V< β, pass value to parent node

• Pruning does not affect final result

• Repeated states are again possible.

• Minimax and alpha-beta pruning require too much leafnode evaluations.

• May be impractical within a reasonable amount of time.

• Suppose we have 100 secs, explore 104 nodes/sec

• Standard approach (SHANNON, 1950):

• Introduces a fixed-depth limit depth

• When cut-off occurs, the evaluation is performed.

MAX goes first

→ A kind of depth-first search

• For chess, typically linear weighted sum of features

• PC can search 200 millions nodes/3min.

• Chance introduces by dice, card-shuffling, coin-flipping...

Possible moves: (5-10,5-11), (5-11,19-24),(5-10,10-16) and (5-11,11-16)

P(s) is probability of s occurence

• E.g., card games, where opponent's initial cards are unknown

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.