0% found this document useful (0 votes)

7 views31 pages

IA c06 NoAnim

The document discusses artificial intelligence with a focus on adversarial search techniques, including the minimax algorithm and alpha-beta pruning. It outlines the principles of competitive environments, game trees, evaluation functions, and strategies for optimizing search in two-player zero-sum games. Additionally, it addresses the effectiveness of various pruning techniques and the performance of AI in chess, highlighting the importance of heuristic evaluation functions and the challenges of search depth.

Uploaded by

Davide Muresan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views31 pages

IA c06 NoAnim

Uploaded by

Davide Muresan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Artificial Intelligence

Radu Răzvan Slăvescu

Technical University of Cluj-Napoca

Department of Computer Science

(some slides adapted from A. Dragan, N. Kitaev, N. Lambert, S. Levine, S. Rao, S. Russell)
Outline

Adversarial Search

Minimax algorithm

Alpha beta pruning

Cutting off Search

Competitive environments

What is a competitive environment?

One in which two or more agents have conflicting goals
(and each of them can take actions in its own interest)

Examples
I games: chess, Go, poker
I economy: actors can increase demand, supply etc.
Two-player zero-sum games

Common games characteristics

I deterministic
I two-player
I turn-taking
I perfect information: we can see all moves and environment
I zero-sum: if player 1 gains amount a → player 2 looses a

Note on terminology
I move = action
I ply = one move by one player
I position = state
I players: MAX, MIN (MAX moves first, then alternate)
Formal definition of a game

Elements
I S0 : initial state (game setup at start)
I TO-MOVE(s): The player whose turn it is to move in state s
I ACTIONS(s): The set of legal moves in state s
I RESULT(s,a): The transition model (defines the state
resulting from taking action a in state s)
I IS-TERMINAL(s): A test which is true when in a terminal
state (the game is over) and false otherwise
I UTILITY(s,p): A utility function (aka objective function, aka
payoff function), defining the final numeric value to player p
when the game ends in terminal state s
E.g.: chess: 1 , 0, or 1/2 (and is a zero-sum game!)
Tic-Tac-Toe Game tree

MAX: X; MIN: O; number on leaves: utility values for MAX

Game trees and values

Minimax values

∆: MAX’s turn (aims to maximize utility); ∇: MIN’s turn

Numbers: leaves: MAX’s utility values
non-leaves: minimax values
Best moves sequence: A: a1 , B reply: b1
Game trees and values
Minimax values

I complete DF exploration of the game tree

I time: O(bm ) (m=tree depth, b=branching factor)
I space: O(bm) if all actions generated at once
I approximations needed for practical use
Game trees and values
Minimax values

MINIMAX(s) =

 UTILITY(s,MAX) if IS-TERMINAL(s)
maxa∈Actions(s) MINIMAX(RESULT(s,a)) if TO-MOVE(s)=MAX
mina∈Actions(s) MINIMAX(RESULT(s,a)) if TO-MOVE(s)=MIN

Minimax and optimallity

Non-optimal choice for MIN

What if MIN does not play optimally?
MAX will do at least as well as against an optimal player,
possibly better.

Non-optimal choice for MAX

Can MAX risk? If optimal moves leads to a draw and a suboptial
choice for MAX can lead to a situation where MIN has 5 options,
4 leading to its defeat, 1 to its victory and MAX believes that MIN
does not have the resources to find the best option.
General Principle for alpha-beta pruning
Bounds for the values in the path

α and β get updated and branches at a node are pruned (no

more recursive calls) as soon as the value of the current node
is known to be worse than the current α/β valuefor MAX/MIN
General Principle for alpha-beta pruning
General case

Player could move to n, but has a better choice (m0 or m). He

will never move to n; once he knows enough about n (by
examining some of its descendants) to reach this conclusion,
he can prune it (it has no impact on the outcome).
General Principle for alpha-beta pruning
Bounds for the values in the path

I α = the value of the best (i.e., highest-value) choice so far

at any choice point along the path for MAX. α = ”at least”
I β = the value of the best (i.e., lowest-value) choice so far at
any choice point along the path for MIN. β = ”at most”
Effectiveness of alpha-beta pruning

Performance if perfect pruning

O(bm/2 ) v. O(bm )

A simple ordering function for exploring moves

1. captures
2. threats
3. forward moves
4. backward moves

Dynamic move-ordering schemes

First, the best moves in the past → close to the theoretical limit
Effectiveness of alpha-beta pruning

Iterative deepening
I search one ply deep and record the moves ranking based
on their evaluations
I search one ply deeper, using the previous ranking to
inform move ordering; and so on
This also allows controlling time restrictions

Killer moves
Killer moves = the best moves (e.g., caused a beta cutoff).
These moves should be tried first
Effectiveness of alpha-beta pruning

Transpositions
Permutations of a move sequence ending up in the same
position

E.g., sequence [w1 , b1 , w2 , b2 ] leads to state s; by exploring the

tree under s, we get its backed value and cache it

If we ever get sequence [w2 , b2 , w1 , b1 ], we know it also leads

to s. We’ll look up the value rather than recomputing it
Effectiveness of alpha-beta pruning

Types of strategies
I Type A: consider all possible moves to a certain depth,
then use a heuristic evaluation function to estimate the
utility of states at that depth. It explores a wide but shallow
portion of the tree.
I Type B: ignore moves that look bad, and follows promising
lines ”as far as possible.” It explores a deep but narrow
portion of the tree.
Heuristic Alpha-Beta Tree Search

Cutting off search

Limited time → cut off search and apply a heuristic evaluation
function to states

EVAL replaces UTILITY

EXPECTIMINIMAX(s,d) =


 EVAL(s,MAX),



 if IS-CUTOFF(s,d)
maxa∈Actions(s) EXPECTIMINIMAX(RESULT(s,a),d+1),


 if TO-MOVE(s)=MAX



 mina∈Actions(s) EXPECTIMINIMAX(RESULT(s,a),d+1),
if TO-MOVE(s)=MIN

Evaluation functions

Heuristic function EVAL(s, p)

Returns an estimate of the expected utility of state s to player p

Terminal states:
EVAL(s,p) = UTILITY(s,p)
Non-terminal states:
UTILITY(loss,p) ≤ EVAL(s,p) ≤ UTILITY(win,p)

Desirable properties for evaluation functions

I can be computed fast
I strongly correlated with the actual chance of winning
Evaluation functions

Building evaluation functions

Usually based on features (e.g., how many pawns, queens
etc.), defining categories (equivalence classes) (e.g., a
category is ”all endgames with 2 pawns vs. 1 pawn”)

Experience says 82% of the states in this category lead to a

win (utility +1); 2% to a loss (0), and 16% to a draw (1/2).
Evaluation for states in the category is the expected value:
(0.82 × +1) + (0.02 × 0) + (0.16 × 1/2) = 0.90

Estimating probabilities requires too much experience

Evaluation functions

Building evaluation functions

Combine features:
I Material piece value estimation: P=1, B=3, Q=9
I ”good pawn structure” and ”king safety”=1/2

We can use a linear combination of features:

n
X
EVAL(s)= wi fi (s), wi normalized
i=1
Evaluation functions

Building evaluation functions

n
X
EVAL(s)= wi fi (s), wi normalized
i=1

Notes:
I wi can be estimated via Machine Learning
I correlation function–chances to win not necessarily linear:
s twice as likely to win as s0 requires EVAL(s)>EVAL(s’),
not necessarily EVAL(s)=2*EVAL(s’)
Cutting off search

When to cut off the search?

To a specific limit d BUT:

Quiescent positions
Apply the evaluation function only to quiescent positions: where
there is no pending move (e.g., capturing the queen) which
might swing the evaluation
Cutting off search

Quiescence

In (b), black ahead by K+2P, but the Q capture will change this
Cutting off search

Horizon effect

Damage unavoidable, but which could be delayed for a while.

B doomed, but P sacrifices can push the loss over the horizon
(→ so considered good)
Cutting off search

Horizon effect

Mitigation: singular extensions moves ”clearly better”, even if

the search is cut off
E.g., R h1 → a1 → a2 are clearly better, so given a chance to
extend the search
Forward Pruning

Forward pruning as a Type B strategy

Prune moves that appear poor, even if they might prove good.
I Beam search: on each ply, consider only ”top n” best
moves (according to the evaluation function)
I late move reduction: reduce the search depth for the
moves in the last part of the list of possible moves
Performance of techniques on chess

Chess
I Branching factor: 35 → 355 = 5 ∗ 106
I minimax search: 5 ply, not more → average human player
I alpha–beta search + large transposition table → 14 ply
(expert level)
I for grandmaster status, we need: an extensively tuned
evaluation function + a large database of endgame moves
I STOCKFISH: all of the above → 30 ply (> the ability of any
human player)
Search v. lookup

Reusing chess openings

Typically, rely on human experience for the first 10–15 moves
(then reach a rare position and switch to search)

Near end of the game (in chess)

Computers can completely solve the endgame → policy
mapping state to the best move in it; store it in a lookup table
Search v. lookup

Retrograde minimax search for building KBNK table

1. start with all possible positions
2. mark the ones where White wins
3. generate the ones from which White gets to the winning
positions no matter what Black does
4. mark them as White wins
5. repeat → perfect lookup table for RBNK

Where are we now?

Endings for up to 7 pieces (400M positions)
Endings for up to 8 pieces would need 40 ∗ 1015 positions
That’s all, folks!

Thanks for your attention...Questions?

Adversarial Search
No ratings yet
Adversarial Search
49 pages
Lecture14 - Alpha Beta Pruning
No ratings yet
Lecture14 - Alpha Beta Pruning
47 pages
cs188 sp23 Lec09
No ratings yet
cs188 sp23 Lec09
47 pages
Coulter Counter
No ratings yet
Coulter Counter
16 pages
Ai Lecture-4
No ratings yet
Ai Lecture-4
37 pages
Lec7 LU Su20
No ratings yet
Lec7 LU Su20
46 pages
Brushless ALTERNATOR Customer Training
100% (1)
Brushless ALTERNATOR Customer Training
61 pages
Ai Unit 2
No ratings yet
Ai Unit 2
135 pages
Ai 4
No ratings yet
Ai 4
25 pages
Ai (Un 03)
No ratings yet
Ai (Un 03)
18 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
38 pages
ITSC6121 Lecture 4 - Game Trees I
No ratings yet
ITSC6121 Lecture 4 - Game Trees I
34 pages
AI Unit-III
No ratings yet
AI Unit-III
124 pages
05 Games
No ratings yet
05 Games
42 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
2.4 Adversarial Search
No ratings yet
2.4 Adversarial Search
29 pages
06 Minimax
No ratings yet
06 Minimax
53 pages
L07 Adversarial Search
No ratings yet
L07 Adversarial Search
48 pages
Unit 2 Adversial Search
No ratings yet
Unit 2 Adversial Search
36 pages
05 Games
No ratings yet
05 Games
42 pages
Lect3 PDF
No ratings yet
Lect3 PDF
67 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Lecture 6 - Minmax Alpha Beta
No ratings yet
Lecture 6 - Minmax Alpha Beta
41 pages
BS EN 12524-2000 Hygrothermal Properties
No ratings yet
BS EN 12524-2000 Hygrothermal Properties
14 pages
Rexroth Servo Drives Programming:: Page 1 of 56
No ratings yet
Rexroth Servo Drives Programming:: Page 1 of 56
56 pages
Ai Lect 05
No ratings yet
Ai Lect 05
39 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Prowirl F 200 PDF
No ratings yet
Prowirl F 200 PDF
98 pages
Game Playing - AI
No ratings yet
Game Playing - AI
25 pages
5.1 GamePlaying (AIML)
No ratings yet
5.1 GamePlaying (AIML)
48 pages
PSLE Maths 2020 Paper 1 Booklet B
No ratings yet
PSLE Maths 2020 Paper 1 Booklet B
8 pages
Chapter 3 - Searching-Part 3
No ratings yet
Chapter 3 - Searching-Part 3
64 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
6 Game
No ratings yet
6 Game
53 pages
Biti1113 Games in Ai
No ratings yet
Biti1113 Games in Ai
58 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
Minimax Search Algorithm: 3. Back-Up The Scores at Level D To Assign A Score To Each
No ratings yet
Minimax Search Algorithm: 3. Back-Up The Scores at Level D To Assign A Score To Each
57 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Games
No ratings yet
Games
41 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Game Playing
No ratings yet
Game Playing
60 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
6 Game
No ratings yet
6 Game
42 pages
Game Playing
No ratings yet
Game Playing
53 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning, PDF
4 pages
Adversarial Search: in Artificial Intelligence
No ratings yet
Adversarial Search: in Artificial Intelligence
21 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
2010 Ford Scape 3.0l Fluid Capacities
No ratings yet
2010 Ford Scape 3.0l Fluid Capacities
2 pages
Aiml Cia 1 QUESTION WITH ANSWER
No ratings yet
Aiml Cia 1 QUESTION WITH ANSWER
5 pages
SAFECode Dev Practices0211
No ratings yet
SAFECode Dev Practices0211
56 pages
Game Playing: Adversarial Search
No ratings yet
Game Playing: Adversarial Search
66 pages
The Definition of El Niño: Kevin E. Trenberth
No ratings yet
The Definition of El Niño: Kevin E. Trenberth
7 pages
Husqvarna 2003 SM WRE 125 Manual
No ratings yet
Husqvarna 2003 SM WRE 125 Manual
2 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Cryptography and Network Security: Fifth Edition by William Stallings
No ratings yet
Cryptography and Network Security: Fifth Edition by William Stallings
23 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Moment Gradient Factor For Steel I-Beams
No ratings yet
Moment Gradient Factor For Steel I-Beams
20 pages
Understanding Scuffing and Micropitting of Gears: R W Snidle, H P Evans, M P Alanou, M J A Holmes
No ratings yet
Understanding Scuffing and Micropitting of Gears: R W Snidle, H P Evans, M P Alanou, M J A Holmes
18 pages
3 Magnetic Effect of Current and Magnetism
No ratings yet
3 Magnetic Effect of Current and Magnetism
12 pages
Radiator - Wikipedia
No ratings yet
Radiator - Wikipedia
8 pages
Electroválvula Honeywell TN UR
No ratings yet
Electroválvula Honeywell TN UR
20 pages
Evaporators Performance
No ratings yet
Evaporators Performance
14 pages
SAL Event Documentation
No ratings yet
SAL Event Documentation
13 pages
CSC270 DB CDF V4.0
No ratings yet
CSC270 DB CDF V4.0
2 pages
CWS19产品资料英文
No ratings yet
CWS19产品资料英文
7 pages
Motor Current Calculator
No ratings yet
Motor Current Calculator
2 pages
Summary of Assignment Grouped and Ungrouped Data
No ratings yet
Summary of Assignment Grouped and Ungrouped Data
8 pages
DUAL NATURE Test
No ratings yet
DUAL NATURE Test
2 pages
Table 1: Sales and Advertising Data Agent Sales Advertising: Regression Statistics
No ratings yet
Table 1: Sales and Advertising Data Agent Sales Advertising: Regression Statistics
10 pages
Kluang (A) S2 STPM 2019
No ratings yet
Kluang (A) S2 STPM 2019
9 pages
Error TPV
No ratings yet
Error TPV
7 pages
Data Handling Practice Sheets
No ratings yet
Data Handling Practice Sheets
8 pages
Chapter 2 Fiber Optics A Brief History of Fiber Optics Lesson 4
No ratings yet
Chapter 2 Fiber Optics A Brief History of Fiber Optics Lesson 4
5 pages
Ammonia STD 10
No ratings yet
Ammonia STD 10
2 pages
College of Engineering Science and Technology Department of Computing Science & Information Systems
No ratings yet
College of Engineering Science and Technology Department of Computing Science & Information Systems
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

IA c06 NoAnim

Uploaded by

IA c06 NoAnim

Uploaded by

Artificial Intelligence

Radu Răzvan Slăvescu

Technical University of Cluj-Napoca

Alpha beta pruning

Cutting off Search

What is a competitive environment?

Common games characteristics

MAX: X; MIN: O; number on leaves: utility values for MAX

∆: MAX’s turn (aims to maximize utility); ∇: MIN’s turn

I complete DF exploration of the game tree

Non-optimal choice for MIN

Non-optimal choice for MAX

α and β get updated and branches at a node are pruned (no

Player could move to n, but has a better choice (m0 or m). He

I α = the value of the best (i.e., highest-value) choice so far

Performance if perfect pruning

A simple ordering function for exploring moves

Dynamic move-ordering schemes

E.g., sequence [w1 , b1 , w2 , b2 ] leads to state s; by exploring the

If we ever get sequence [w2 , b2 , w1 , b1 ], we know it also leads

Cutting off search

EVAL replaces UTILITY

Heuristic function EVAL(s, p)

Desirable properties for evaluation functions

Building evaluation functions

Experience says 82% of the states in this category lead to a

Estimating probabilities requires too much experience

Building evaluation functions

We can use a linear combination of features:

Building evaluation functions

When to cut off the search?

Damage unavoidable, but which could be delayed for a while.

Mitigation: singular extensions moves ”clearly better”, even if

Forward pruning as a Type B strategy

Reusing chess openings

Near end of the game (in chess)

Retrograde minimax search for building KBNK table

Where are we now?

Thanks for your attention...Questions?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.