0% found this document useful (0 votes)

65 views22 pages

Minimax and Alpha-Beta Reduction: Borrows From Spring 2006 CS 440 Lecture Slides

Minimax and Alpha-Beta Reduction are algorithms used to find the best move for games. Minimax searches a game tree to find the optimal move assuming the opponent plays optimally. Alpha-Beta Reduction prunes parts of the tree that don't affect the result, speeding up Minimax. It uses alpha and beta values to avoid searching subtrees where the result is already known. This allows games like chess to be played by computer in a reasonable time despite the large search space.

Uploaded by

Nguyen Khac Chien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views22 pages

Minimax and Alpha-Beta Reduction: Borrows From Spring 2006 CS 440 Lecture Slides

Uploaded by

Nguyen Khac Chien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Minimax and Alpha-Beta Reduction

Borrows from Spring 2006 CS 440 Lecture Slides

Motivation
Want to create programs to play games

Want to play optimally

Want to be able to do this in a reasonable amount of

time

Types of Games
Deterministic

Nondeterministic
(Chance)

Fully
Observable

Chess
Checkers
Go
Othello

Backgammon
Monopoly

Partially
Observable

Battleship

Card Games

Minimax is for deterministic, fully observable games

Basic Idea

Search problem

Searching a tree of the possible moves in order to find

the move that produces the best result
Depth First Search algorithm

Assume the opponent is also playing optimally

Try to guarantee a win anyway!

Required Pieces for Minimax

An initial state

Operators

Legal moves the player can make

Terminal Test

The positions of all the pieces

Whose turn it is

Determines if a state is a final state

Utility Function

Gives the utility of a game state

utility(State)

Examples

-1, 0, and +1, for Player 1 loses, draw, Player 1 wins,

respectively
Difference between the point totals for the two players
Weighted sum of factors (e.g. Chess)

utility(S) = w1f1(S) + w2f2(S) + ... + wnfn(S)

f1(S) = (Number of white queens) (Number of black queens),

w1 = 9
f2(S) = (Number of white rooks) (Number of black rooks),
w2 = 5
...

Two Agents

MAX

Wants to maximize the result of the utility function

Winning strategy if, on MIN's turn, a win is obtainable
for MAX for all moves that MIN can make

MIN

Wants to minimize the result of the evaluation function

Winning strategy if, on MAX's turn, a win is
obtainable for MIN for all moves that MAX can make

Basic Algorithm

Example

Coins game

There is a stack of N coins

In turn, players take 1, 2, or 3 coins from the stack
The player who takes the last coin loses

Coins Game: Formal Definition

Initial State: The number of coins in the stack

Operators:
1. Remove one coin
2. Remove two coins
3. Remove three coins

Terminal Test: There are no coins left on the stack

Utility Function: F(S)

F(S) = 1 if MAX wins, 0 if MIN wins

MAX
MIN

N=4
K=

1
N=3
K=

3
2

3
N=0
K=

2
N=1
K=

F(S)=1

1
N=3
K=

1
N=1
K=

2
N=0
K=

F(S)=0

1
N=0
K=
F(S)=1

F(S)=0

N=2
K=

N=1
K=

2
N=0
K=

1
N=1
K=

1
N=0
K=

F(S)=1

1
N=0
K=

F(S)=1

F(S)=0

MAX
MIN

1
N=3
K=0

Solution

N=4
K= 1

3
2

3
N=0
K= 1

2
N=1
K=0

F(S)=1

1
N=3
K= 0

1
N=1
K=1

2
N=0
K=0

F(S)=0

1
N=0
K= 1
F(S)=1

F(S)=0

N=2
K=1

N=2
K=0

N=1
K=1

2
N=0
K=1

1
N=1
K=0

1
N=0
K=1

F(S)=1

1
N=0
K=0

F(S)=1

F(S)=0

Analysis

Max Depth: 5
Branch factor: 3
Number of nodes: 15
Even with this trivial example, you can see that
these trees can get very big

Generally, there are O(bd) nodes to search for

Branch factor b: maximum number of moves from each

node
Depth d: maximum depth of the tree

Exponential time to run the algorithm!

How can we make it faster?

Alpha-Beta Pruning

Main idea: Avoid processing subtrees that have

no effect on the result
Two new parameters

: The best value for MAX seen so far

: The best value for MIN seen so far

is used in MIN nodes, and is assigned

in MAX nodes
is used in MAX nodes, and is assigned
in MIN nodes

Alpha-Beta Pruning

MAX (Not at level 0)

If a subtree is found with a value k greater

than the value of , then we do not need to
continue searching subtrees

MAX can do at least as good as k in this node,

so MIN would never choose to go here!

MIN

If a subtree is found with a value k less than

the value of , then we do not need to
continue searching subtrees

MIN can do at least as good as k in this node, so

MAX would never choose to go here!

Algorithm

MAX
MIN

1
N=3 =
K= =

=
=

3
N=0
K=

F(S)=1
=
=

N=4 =
K= =

2
N=1 =
K= =
1
N=3
K=
F(S)=0

2
1

=
=

N=2 =
K= =
1
N=1 =
K= =

3
N=2 =
K= =
2
N=0
K=

2
F(S)=1
N=0 =
K= =

1
F(S)=0
N=0 =
K= =
F(S)=1

=
=

1
N=1 =
K= =

N=1
K=
1
N=0
K=

1
F(S)=1

N=0 =
=
K= =

F(S)=0

MAX
MIN

N=4 =0
K=01 =

=0
=1

= 1
=

3
N=0
K=1

F(S)=1
=0
=1

N=3 =
N=2 =0
K=10 = 10
K = 1 0 = 0
=0
1
2
2
1
=
N=1 =0
N=2 =
N=0
N=1 =0
K=0 = 10 K=1 =0
K=1
K= =0

N=1
K=1
1
N=0
K= 1

1
N=3
K=0

1
2
F(S)=1
N=1 =1 N=0 =
K=1 =0 K= =

1
F(S)=1
N=0 =0 =0
K=0 =0 =

F(S)=0

1
F(S)=0
N=0 =1
K=1 =0
F(S)=1

F(S)=0

Nondeterministic Games

Minimax can also be used for nondeterministic

games (those that have an element of chance)
There is an additional node added (Random node)
Random node is between MIN and MAX (and
vice versa)
Make subtrees over all of the possibilities,and
average the results

Weighted coin
.6 Heads (1)
.4 Tails (0)

Example
N=2
K = 8.6

Random Node

0
K=5

K = .45 + .611 = 8.6

1
K = 11

K = .4*2 + .6*7 = 5
0
K=2

1
K=7

Our Project

We will focus on deterministic, two-player, fully

observable games
We will be trying to learn the evaluator function,
in order to save time when playing the game

Training on data from Minimax runs (Neural Network)

Having the program play against itself (Genetic
Algorithms)

Conclusion

Minimax finds optimal play for deterministic, fully

observable, two-player games
Alpha-Beta reduction makes it faster

Adversarial Search
No ratings yet
Adversarial Search
49 pages
Seminar PPT (Minimax Algorithm)
100% (1)
Seminar PPT (Minimax Algorithm)
35 pages
Mini Max
100% (1)
Mini Max
9 pages
AI Unit - 2
No ratings yet
AI Unit - 2
103 pages
Lec3-Adversarial Search
No ratings yet
Lec3-Adversarial Search
73 pages
New8 11
No ratings yet
New8 11
70 pages
Artificial Intelligence-Unit2
No ratings yet
Artificial Intelligence-Unit2
17 pages
CH 5 Adversarial Search
No ratings yet
CH 5 Adversarial Search
20 pages
05 Adversarial Search
No ratings yet
05 Adversarial Search
51 pages
Ai Unit 3
No ratings yet
Ai Unit 3
138 pages
Chapter05 4e
No ratings yet
Chapter05 4e
40 pages
Module 10
No ratings yet
Module 10
8 pages
Game Theory Unit IV
No ratings yet
Game Theory Unit IV
6 pages
21CSC206T Unit3
100% (1)
21CSC206T Unit3
138 pages
AI Unit3 Gameplaying
No ratings yet
AI Unit3 Gameplaying
43 pages
Adversarial Search: Game Playing: But, First Let's Talk About Heuristic Function
No ratings yet
Adversarial Search: Game Playing: But, First Let's Talk About Heuristic Function
33 pages
Chapter 3:game Theory: 3.1optimal Decision in Games
No ratings yet
Chapter 3:game Theory: 3.1optimal Decision in Games
17 pages
1 GamePlaying
No ratings yet
1 GamePlaying
30 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
CS2201 7
No ratings yet
CS2201 7
56 pages
Minimax Alpha Beta Pruning
No ratings yet
Minimax Alpha Beta Pruning
15 pages
Unit2e Adversarial Search
No ratings yet
Unit2e Adversarial Search
26 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Artificial Intelligence: Gaming Algorithms
No ratings yet
Artificial Intelligence: Gaming Algorithms
26 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Game Playing - AI
No ratings yet
Game Playing - AI
25 pages
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
6CS4 AI Unit-2
No ratings yet
6CS4 AI Unit-2
77 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
Min Max and Alpha Beta
No ratings yet
Min Max and Alpha Beta
43 pages
Min-Max and Alpha-Beta Pruning Algorithms
No ratings yet
Min-Max and Alpha-Beta Pruning Algorithms
7 pages
Unit 5 AI
No ratings yet
Unit 5 AI
80 pages
Unit 5
No ratings yet
Unit 5
15 pages
Module 3
No ratings yet
Module 3
18 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
Recruiting Toolbox
100% (3)
Recruiting Toolbox
55 pages
AI Unit 4
No ratings yet
AI Unit 4
25 pages
Module-2 Lecture 7
100% (1)
Module-2 Lecture 7
21 pages
Games
No ratings yet
Games
41 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Basic 05 Games
No ratings yet
Basic 05 Games
74 pages
Game Playing in AI
No ratings yet
Game Playing in AI
12 pages
Unit 2 MinMaxScaling With Alpha Beta Pruning
No ratings yet
Unit 2 MinMaxScaling With Alpha Beta Pruning
24 pages
Lecture 7
No ratings yet
Lecture 7
62 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Game Playing
No ratings yet
Game Playing
60 pages
MCS 3201-Intelligent Systems: Gihan Seneviratne Gps@ucsc - LK
No ratings yet
MCS 3201-Intelligent Systems: Gihan Seneviratne Gps@ucsc - LK
88 pages
AI - Module 2 - Min Max & Alpha Beta Pruning
No ratings yet
AI - Module 2 - Min Max & Alpha Beta Pruning
11 pages
Unit 2 - Part 2
No ratings yet
Unit 2 - Part 2
18 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
No ratings yet
Adversarial Search Two - Persons Game: Russel Norvig (Text) Book and Patrick Henry Winston (Reference Book)
71 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Automatix Art of RPA
50% (2)
Automatix Art of RPA
25 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
Game Playing
No ratings yet
Game Playing
24 pages
Lec11&12-Adversarial Search
No ratings yet
Lec11&12-Adversarial Search
30 pages
4 Adversarial Search
No ratings yet
4 Adversarial Search
14 pages
اعطال شارب PDF
100% (1)
اعطال شارب PDF
13 pages
Kaspersky Key and Instruction
No ratings yet
Kaspersky Key and Instruction
4 pages
116 hw1
No ratings yet
116 hw1
5 pages
3 IT 35 Design Principles and Design Patterns
No ratings yet
3 IT 35 Design Principles and Design Patterns
8 pages
The Ultimate Beginners Guide To Fuzzy Logic
No ratings yet
The Ultimate Beginners Guide To Fuzzy Logic
17 pages
Resume Anitha (1 PDF
No ratings yet
Resume Anitha (1 PDF
2 pages
Automata: The Methods & The Madness: Angkor Wat, Cambodia
No ratings yet
Automata: The Methods & The Madness: Angkor Wat, Cambodia
7 pages
Data Acquisition Catalog en
No ratings yet
Data Acquisition Catalog en
17 pages
5marks C Programming Important Qa
No ratings yet
5marks C Programming Important Qa
54 pages
Digital Marketing and Social Networking in Business Environtment
No ratings yet
Digital Marketing and Social Networking in Business Environtment
32 pages
Floating Dry Dock Specification
No ratings yet
Floating Dry Dock Specification
2 pages
OOPs Concepts - What Is Aggregation in Java
No ratings yet
OOPs Concepts - What Is Aggregation in Java
15 pages
Vishwakarma Institute of Information Technology
No ratings yet
Vishwakarma Institute of Information Technology
4 pages
Chapter 1 NonlinearAdaptiveControl
No ratings yet
Chapter 1 NonlinearAdaptiveControl
28 pages
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
No ratings yet
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
6 pages
Sophos Enterprise Console Quick Startup Guide: 5.2 Product Version: March 2015 Document Date
No ratings yet
Sophos Enterprise Console Quick Startup Guide: 5.2 Product Version: March 2015 Document Date
28 pages
Alphabeta PDF
No ratings yet
Alphabeta PDF
4 pages
Code Clone Detection Using Sequential Pattern Mining: October 2015
No ratings yet
Code Clone Detection Using Sequential Pattern Mining: October 2015
10 pages
HybridDimensional Correa PDF
No ratings yet
HybridDimensional Correa PDF
9 pages
Intelligent Generic Statistical Query Mode: Article
No ratings yet
Intelligent Generic Statistical Query Mode: Article
6 pages
Class1 Cs
No ratings yet
Class1 Cs
3 pages
C Token's: Tarun Sharma Lecturer (Computer Science)
No ratings yet
C Token's: Tarun Sharma Lecturer (Computer Science)
30 pages
LAB 10-Loops and Files
No ratings yet
LAB 10-Loops and Files
7 pages
FMCG Market Share Global
No ratings yet
FMCG Market Share Global
1 page
p51 PDF
No ratings yet
p51 PDF
3 pages
Cyber Hunter-Installer
No ratings yet
Cyber Hunter-Installer
18 pages
Connection Overview Wincc V7.0 Sp2 Update3 (02/2011)
No ratings yet
Connection Overview Wincc V7.0 Sp2 Update3 (02/2011)
4 pages
16 April 2012 Nse
No ratings yet
16 April 2012 Nse
7 pages
Jake S Resume Anonymous
No ratings yet
Jake S Resume Anonymous
1 page
EQloc2 1
No ratings yet
EQloc2 1
5 pages
National Chung Cheng University Student Personal Information
No ratings yet
National Chung Cheng University Student Personal Information
1 page
FInal Quiz TQM
No ratings yet
FInal Quiz TQM
2 pages
Simulation and Modelling (R18 Syllabus) (19.06.2019)
No ratings yet
Simulation and Modelling (R18 Syllabus) (19.06.2019)
2 pages
The Psecret Psociety VAFL
From Everand
The Psecret Psociety VAFL
Mike Bozart
No ratings yet
The Virtual Reality Network Elimination Game: A Science Fiction Role Playing Game
From Everand
The Virtual Reality Network Elimination Game: A Science Fiction Role Playing Game
Rik Hunik
No ratings yet
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
From Everand
Fun Online Games For Teens with Tips and Tricks: Ages 13 And Up: Games for Kids and Teens
Baby Professor
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Minimax and Alpha-Beta Reduction: Borrows From Spring 2006 CS 440 Lecture Slides

Uploaded by

Minimax and Alpha-Beta Reduction: Borrows From Spring 2006 CS 440 Lecture Slides

Uploaded by

Minimax and Alpha-Beta Reduction

Borrows from Spring 2006 CS 440 Lecture Slides

Want to play optimally

Want to be able to do this in a reasonable amount of

Minimax is for deterministic, fully observable games

Searching a tree of the possible moves in order to find

Assume the opponent is also playing optimally

Try to guarantee a win anyway!

Required Pieces for Minimax

Legal moves the player can make

The positions of all the pieces

Determines if a state is a final state

Gives the utility of a game state

-1, 0, and +1, for Player 1 loses, draw, Player 1 wins,

utility(S) = w1f1(S) + w2f2(S) + ... + wnfn(S)

f1(S) = (Number of white queens) (Number of black queens),

Wants to maximize the result of the utility function

Wants to minimize the result of the evaluation function

There is a stack of N coins

Coins Game: Formal Definition

Initial State: The number of coins in the stack

Terminal Test: There are no coins left on the stack

F(S) = 1 if MAX wins, 0 if MIN wins

Generally, there are O(bd) nodes to search for

Branch factor b: maximum number of moves from each

Exponential time to run the algorithm!

Main idea: Avoid processing subtrees that have

: The best value for MAX seen so far

is used in MIN nodes, and is assigned

MAX (Not at level 0)

If a subtree is found with a value k greater

MAX can do at least as good as k in this node,

If a subtree is found with a value k less than

MIN can do at least as good as k in this node, so

Minimax can also be used for nondeterministic

K = .4*5 + .6*11 = 8.6

We will focus on deterministic, two-player, fully

Training on data from Minimax runs (Neural Network)

Minimax finds optimal play for deterministic, fully

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

K = .45 + .611 = 8.6