0% found this document useful (0 votes)

89 views12 pages

Midterm Exam: CS 188 Introduction To Fall 2008 Artificial Intelligence

This document contains instructions and questions for a midterm exam in an introduction to artificial intelligence course. It begins by providing instructions for the exam, including that it is closed book, lasts 80 minutes, and has 70 total points. It then provides a sample cover page for students to fill out with their information. The remainder of the document contains 5 questions worth various points totaling the 70 points for the exam.

Uploaded by

Samip Kalyani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views12 pages

Midterm Exam: CS 188 Introduction To Fall 2008 Artificial Intelligence

Uploaded by

Samip Kalyani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

CS 188 Introduction to

Fall 2008 Artificial Intelligence Midterm Exam

INSTRUCTIONS

• You have 80 minutes. 70 points total. Don’t panic!

• The exam is closed book, closed notes except a one-page crib sheet, non-programmable calculators only.

• Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a
brief explanation. All short answer sections can be successfully answered in a few sentences at most.
• Question 0: Fill out the following grid and write your name, SID, login, and GSI at the top of
each subsequent page. (-1 points if done incorrectly!)

Last Name

First Name

SID

GSI

All the work on this

exam is my own.
(please sign)

For staff use only

Q. 1 Q. 2 Q. 3 Q. 4 Q. 5 Total

/12 /11 /15 /17 /15 /70

THIS PAGE INTENTIONALLY LEFT BLANK

NAME: SID#: Login: GSI: 3

1. (12 points.) Search: Mr. and Ms. Pacman

Pacman and Ms. Pacman are lost in an N xN maze and would like to meet; they don’t care where. In each time
step, both simultaneously move in one of the following directions: {NORTH, SOUTH, EAST, WEST, STOP}.
They do not alternate turns. You must devise a plan which positions them together, somewhere, in as few
time steps as possible. Passing each other does not count as meeting; they must occupy the same square at
the same time.
(a) (4 points) Formally state this problem as a single-agent state-space search problem.
States:

Answer: The set of pairs of positions for Pacman and Ms. Pacman:
{((x1 , y1 ), (x2 , y2 )) | x1 , x2 , y1 , y2 ∈ {1, 2, . . . , N }}
Maximum size of state space:

Answer: N 2 for both pacmen, hence N 4 total

Maximum branching factor:

Answer: Each pacman has a choice of 5 actions, hence 52 = 25 total

Goal test:

Answer: isGoal((x1 , y1 ), (x2 , y2 )) := (x1 = x2 ) ∧ (y1 = y2 )

(b) (3 points) Give a non-trivial admissible heuristic for this problem.
Answer: Manhattan distance between Pacman and Ms. Pacman DIVIDED BY 2 (since both take a step
simultaneously)

(c) (3 points) Circle all of the following graph search methods which are guaranteed to output optimal
solutions to this problem:

(i) DFS
(ii) BFS
(iii) UCS
(iv) A* (with a consistent and admissible heuristic)
(v) A* (with heuristic that returns zero for each state)
(vi) Greedy search (with a consistent and admissible heuristic)

Answer: BFS, UCS, A* (with a consistent and admissible heuristic), A* (with heuristic that returns zero for
each state)
(d) (2 points) If h1 and h2 are admissible, which of the following are also guaranteed to be admissible? Circle
all that apply:

(i) h1 + h2
(ii) h1 ∗ h2
(iii) max(h1 , h2 )
(iv) min(h1 , h2 )
(v) (α)h1 + (1 − α)h2 , for α ∈ [0, 1]

Answer: max(h1 , h2 ), min(h1 , h2 ), (α)h1 + (1 − α)h2 , for α ∈ [0, 1]

2. (11 points.) CSPs: Finicky Feast

You are designing a menu for a special event. There are several choices, each represented as a variable:
(A)ppetizer, (B)everage, main (C)ourse, and (D)essert. The domains of the variables are as follows:

A: (v)eggies, (e)scargot
B: (w)ater, (s)oda, (m)ilk
C: (f)ish, (b)eef, (p)asta
D: (a)pple pie, (i)ce cream, (ch)eese
Because all of your guests get the same menu, it must obey the following dietary constraints:

(i) Vegetarian options: The appetizer must be veggies or the main course must be pasta or fish (or both).
(ii) Total budget: If you serve the escargot, you cannot afford any beverage other than water.
(iii) Calcium requirement: You must serve at least one of milk, ice cream, or cheese.

(a) (3 points) Draw the constraint graph over the variables A, B, C, and D.

A B

C D

(b) (2 points) Imagine we first assign A=e. Cross out eliminated values to show the domains of the variables
after forward checking.
A [ e ]
B [ w s m ]
C [ f b p ]
D [ a i ch ]
Answer: The values s, m, and b should be crossed off. “s” and “m” are eliminated due to being incompatible
with “e” based on constraint (ii). “b” is eliminated due to constraint (i).

(c) (3 points) Again imagine we first assign A=e. Cross out eliminated values to show the domains of the
variables after arc consistency has been enforced.
A [ e ]
B [ w s m ]
C [ f b p ]
D [ a i ch ]
Answer: The values s, m, b, and a should be eliminated. The first three are crossed off for the reasons above,
and “a” is eliminated because there is no value for (B) that is compatible with “a” (based on constraint (iii)).

(d) (1 point) Give a solution for this CSP or state that none exists.
Answer: Multiple solutions exist. One is A=e, B=w, C=f, and D=i.

(e) (2 points) For general CSPs, will enforcing arc consistency after an assignment always prune at least as
many domain values as forward checking? Briefly explain why or why not.
Answer: Two answers are possible:
Yes. The first step of arc consistency is equivalent to forward checking, so arc consistency removes all values
that forward checking does.
No. While forward checking is a subset of arc consistency, after any assignment, arc consistency may have
already eliminated values in a previous step that are eliminated in that step by forward checking. Thus,
enforcing arc consistency will never leave more domain values than enforcing forward checking, but on a given
NAME: SID#: Login: GSI: 5

step, forward checking might prune values than arc consistency by pruning values that have already been
pruned by arc consistency.
6

3. (15 points.) Game Trees: The Balancer

Consider the following zero-sum game, in which the utilities UA (s) are shown for the first player (A). Assume
the second player (B) is a minimizer: B holds the opposite utilities to A, UB (s) = −UA (s). In this case, B’s
maximization of UB is equivalent to minimization of UA (i.e. the computation is standard minimax).

(a) (2 points) In each node, write UA (s), the (minimax) utility of that state for player A, assuming that B is
a minimizer.
Answer: Displayed above.

(b) (3 points) Cross off any nodes which will be skipped by alpha-beta pruning, assuming left-to-right ordering.
Answer: Displayed above.

Assume now that B is not a minimizer, but a balancer. A balancer does not try to minimize A’s score, but
rather wishes the outcome of the game to be as balanced as possible. Formally, assume B’s utility for a state
s is defined as UB (s) = −|UA (s)|. The game tree is shown here, with hexagons indicating player B’s control.
NAME: SID#: Login: GSI: 7

(c) (3 points) In each node, write UA (s), the utility of that state for player A, assuming that B is a balancer.
Answer: Displayed above.

(d) (3 points) Write pseudocode for the functions which compute the UA (s) values of game states in the
general case of multi-turn games where B is a balancer. Assume you have access to the following functions:
successors(s) gives the possible next states, isTerminal(s) checks whether a state is a terminal state, and
terminalValue(s) returns A’s utility for a terminal state. Careful: As in minimax, be sure that both functions
compute and return player A’s utilities for states – B’s utility can always be computed from A’s utility.

Answer: Below. Note that for balanceValue(s), we must return the utility the maximizer’s perspective.

def maxValue(s): // compute $U_A(s)$ assuming that A is next to move.

if isTerminal(s): return terminalValue(s)
return max([balanceValue(succ) for succ in successors(s)])

def balanceValue(s): // compute $U_A(s)$ assuming that B is next to move.

if isTerminal(s): return terminalValue(s)
maxVal = -infty
for succ in successors(s):
val = maxValue(succ)
if math.abs(val) < math.abs(maxVal): maxVal = val
return maxVal

(h) (2 points) Consider pruning children of a B node in this scenario. On the tree on the bottom of the
previous page, cross off any nodes which can be pruned, again assuming left-to-right ordering.

Answer: Answers above.

(i) (2 points) Again consider pruning children of a B node s. Let α be the best option for an A node higher in
the tree, just as in alpha-beta pruning, and let v be the UA value of the best action B has found so far from s.
Give a general condition under which balanceValue(s) can return without examining any more of its children.

Answer: |v| < α.

4. (17 points.) MDPs and RL: Wandering Merchant

There are N cities along a major highway numbered 1 through N . You are a merchant from city 1 (that’s
where you start). Each day, you can either travel to a neighboring city (actions East or West) or stay and do
business in the current city (action Stay). If you choose to travel from city i, you successfully reach the next
city with probability pi , but there is probability 1 − pi that you hit a storm, in which case you waste the day
and do not go anywhere. If you stay to do business in city i, you get ri > 0 in reward; a travel day has reward
0 regardless of whether or not you succeed in changing cities.
The diagram below shows the actions and transitions from city i. Solid arrows are actions; dashed arrows are
resulting transitions labeled with their probability and reward, in that order.

(a) (2 points) If for all i, ri = 1, pi = 1, and there is a discount γ = 0.5, what is the value V stay (1) of being
in city 1 under the policy that always chooses stay? Your answer should be a real number.
Answer: for all cities (states) i = 1, . . . , N , we have that the optimal value behaves as follows:

V stay (i) = ri + γV stay (i)

(remember, this is like the Bellman equation for a fixed policy). Plugging in values, we get V stay (i) = 1 +
0.5V stay (i). Now we can just solve for V stay (i) using algebra to obtain V stay (i) = 2. In particular, V stay (1) = 2.
(b) (2 points) If for all i, ri = 1, pi = 1, and there is a discount γ = 0.5, what is the optimal value V ∗ (1) of
being in city 1?
Intuitive Answer: since all the cities offer the same reward (ri = 1), there is no incentive to move to another
city to do business, so the optimal policy is to always stay, yielding V ∗ (1) = 2.
More Formal Answer:
For all cities (states) i = 1, . . . , N , writing out the Bellman equations:1

V ∗ (i) = max{ri + γV ∗ (i), pi γV ∗ (i − 1) + (1 − pi )γV ∗ (i), pi γV ∗ (i + 1) + (1 − pi )γV ∗ (i)}

| {z } | {z } | {z }
stay left right

Since pi = 1, this drastically simplifies:

V ∗ (i) = max{ri + γV ∗ (i), γV ∗ (i − 1), γV ∗ (i + 1)}

| {z } | {z } | {z }
stay left right

From this, we see that V ∗ (i) is the same for all i, so the max is obtained always with the stay action.
(c) (2 points) If the ri ’s and pi ’s are known positive numbers and there is almost no discount, i.e. γ ≈ 1,
describe the optimal policy. You may define it formally or in words, e.g. “always go east,” but your answer
1 For i = 1, omit the left action; for i = N , omit the right action.
NAME: SID#: Login: GSI: 9

should precisely define how an agent should act in any given state. Hint: You should not need to do any
computation to answer this question.
Basically Right Answer: the optimal policy is to always move towards the city with the highest reward. Once
there, stay there and do business forever.
Technical Answer: The only complication is due to possible ties. Let r∗ = max1≤i≤n ri be the maximum reward
out of all the cities. The optimal policy from city i is as follows: if ri = r∗ , stay; otherwise, move towards the
closest city j that has rj = r∗ , where distance between i and j > i is the the expected number of moves to get
Pj−1
there ( k=i 1/pk ).
Suppose we run value iteration. Recall that Vk (s) is the value of state s after k rounds of value iteration and
all the values are initialized to zero.
(d) (2 points) If the optimal value of being in city 1 is positive, i.e. V ∗ (1) > 0, what is the largest k for which
Vk (1) could still be zero? Be careful of off-by-one errors.
Answer: Assuming ri > 0, then the largest k is 0, because V1 (s) = max{ri + 0, · · · } > 0.
(Intended) Answer: If we don’t assume ri > 0, then the largest k is N − 1. Proof: since V ∗ (1) > 0, at least one
of the ri ’s must be strictly positive. After one iteration, V1 (i) > 0; after two iterations, V2 (i − 1) > 0; finally
after i iterations, Vi (1) > 0. In the meantime, if all rj = 0 for j < i, then Vj (1) = 0 for all j < i. In the worst
case, i = N , so VN −1 (1) = 0 is possible, but VN (1) > 0.
(e) (2 points) If all of the ri and pi are positive, what is the largest k for which Vk (s) could still be zero for
some state s? Be careful of off-by-one errors.
Answer: Since ri > 0, the largest k is 0, because V1 (s) = max{ri + 0, · · · } > 0.
Suppose we don’t know the ri ’s or the pi ’s, so we decide to do Q-learning.
(f ) (3 points) Suppose we experience the following sequence of states, actions, and rewards: (s=1, a=stay,
r=4), (s=1, a=east, r=0), (s=2, a=stay, r=6), (s=2, a=west, r=0), (s=1, a=stay, r=4, s=1). What are the
resulting Q(s, a) values if the learning rate is 0.5, the discount is 1, and we start with all Q(s, a) = 0? Fill in
the table below; each row should hold the q-values after the transition specified in its first column. You may
leave unchanged values blank.

After (1, S, 4, 1), we update Q(1, S) ← 0.5[4 + 1 · 0] + 0.5(0) = 2.

After (1, E, 0, 2), we update Q(1, E) ← 0.5[0 + 1 · 0] + 0.5(0) = 0.
After (2, S, 6, 2), we update Q(2, S) ← 0.5[6 + 1 · 0] + 0.5(0) = 3.
After (2, W, 0, 1), we update Q(2, W ) ← 0.5[0 + 1 · 2] + 0.5(0) = 1.
10

After (1, S, 4, 1), we update Q(1, S) ← 0.5[4 + 1 · 2] + 0.5(2) = 4.

Circle true or false; skipping here is worth 1 point per question.

(g) (2 points) (True/False) Q-learning will only learn the optimal q-values if actions are eventually selected
according to the optimal policy.
Answer: False. As long as the policy used explores all the states (even a random policy will work), Q-learning
will find the optimal q-values.
(h) (2 points) (True/False) In a deterministic MDP (i.e. one in which each state / action leads to a single
deterministic next state), the Q-learning update with a learning rate of α = 1 will correctly learn the optimal
q-values.
Answer: True. Remember that the learning rate is only there because we are trying to approximate a summation
with a single sample. In a deterministic MDP where s0 is the single state that always follows when we take
action a in state s, we have Q(s, a) = R(s, a, s0 ) + maxa0 Q(s0 , a0 ), which is exactly the update we make.
NAME: SID#: Login: GSI: 11

5. (15 points.) Probability and Utilities: Wheel of Fortune

You are playing a simplified game of Wheel of Fortune. The objective is to

correctly guess a three letter word. Let X, Y, and Z represent the first, second,
and third letters of the word, respectively. There are only 8 possible words: X
can take on the values ‘c’ or ‘l’, Y can be ‘a’ or ‘o’, and Z can be ‘b’ or ‘t’.

Before you guess the word, two of the three letters will be revealed to
you. In the first round of the game, you choose one of X, Y or Z to be
revealed. In the second round, you choose one of the remaining two letters to
be revealed. In the third round, you guess the word. If you guess correctly,
you win. The utility of winning is 1, while the utility of losing is 0.

You watch the game a lot and determine that the eight possible words
occur with the probabilities shown on the right. Your goal is to act in such
a way as to maximize your chances of winning (and thereby your expected
utility).

(a) (3 points) What is the distribution P(Y, Z)? Your answer should be in the form of a table.
Answer:

P(X=c,Y=a)=0.2
P(X=c,Y=o)=0.4
P(X=l,Y=a)=0.2
P(X=l,Y=o)=0.2
(b) (2 points) Are the second and third letters (Y and Z) independent? Show a specific computation that
supports your claim.
Answer: No, since P(X=c) = 0.6, P(Y=a) = 0.4 but P(X=c,Y=a)=0.2 which is not P(X=c)P(Y=a)=0.24
(other counterexamples exist too)
(c) (2 points) Are the second and third letters (Y and Z) independent if you know the value of the first letter
(X)? Show a specific computation that supports your claim.
Answer: Yes. P (Y = a, Z = b|X = c) = P (X = c, Y = a, Z = b)/P (X = c) = 1/6.
P (Y = a|X = c) = (0.1 + 0.1)/0.6P (Z = b|X = c) = (0.1 + 0.2)/(0.6) = 1/2.
Thus, P (Y = a, Z = b|X = c) = 1/6 = P (Y = a|X = c)P (Z = b|X = c). To be certain, you have to also check
for all pairs (not required for full credit). Alternatively, you can show that P (Y |X, Z) = P (Y |X)
12

Suppose that in the first round, you ask about X and are told that X = c. It is the second round and you can
now either ask the host to reveal Y or to reveal Z.
(d) (2 points) If you ask the host to reveal Y, what is the probability that you will win in the third round?
Answer: Since Y and Z are independent conditioned on X, no matter what Y comes out to be, P (Z = b|X =
c, Y ) will be 0.5 Thus, you’ll guess arbitrarily and win with probability 0.5

(e) (1 point) What letter should you ask the host about in the second round to maximize your chance of
winning, Y or Z?
Answer: Z, since you’ll be able to win 2/3 of the time (see part f)

(f ) (3 points) What is your expected utility if you act optimally from the state where X=c?
Answer: Since Y and Z are conditionally independent given X, knowing Z won’t give you any additional
information about Y . So, you’ll guess the most likely value of Y given X = c, which is o since P (Y = o|X =
c) = 2/3 and win 2/3 of the time.

(g) (2 points) Suppose that the host is allowed to pick any distribution over the three variables but has to
tell you what the distribution is before the game starts. What distribution should the host pick to minimize
your chances of winning? Justify your answer briefly.
Answer: Uniform: you need some distribution where each letter has 50% chance of occuring and the values of
each variable are independent of the others, so knowing the revealed values doesn’t give any information about
the hidden one.

Soderstrom T., Stoica P. System Identification (PH 1989) (ISBN S
100% (6)
Soderstrom T., Stoica P. System Identification (PH 1989) (ISBN S
637 pages
Fa13 Midterm1 Solutions
No ratings yet
Fa13 Midterm1 Solutions
21 pages
Cse473sp19 Midterm
No ratings yet
Cse473sp19 Midterm
12 pages
2024 CSC14003 ReviewExercisesForMidterm
No ratings yet
2024 CSC14003 ReviewExercisesForMidterm
7 pages
cs188 Fa2010 mt1 Klein Soln
No ratings yet
cs188 Fa2010 mt1 Klein Soln
15 pages
cs188 Fa17 mt1
No ratings yet
cs188 Fa17 mt1
10 pages
Midterm 16au Sol
No ratings yet
Midterm 16au Sol
6 pages
Department of Computer Science and Engineering Islamic University of Technology (IUT)
No ratings yet
Department of Computer Science and Engineering Islamic University of Technology (IUT)
3 pages
sp2014 Midterm
No ratings yet
sp2014 Midterm
5 pages
AI - Mid 1 Exam - Sol
No ratings yet
AI - Mid 1 Exam - Sol
10 pages
cs188 Su19 Final - Sol
No ratings yet
cs188 Su19 Final - Sol
29 pages
Ioc Ai Ese 2023 24 Solution v1
No ratings yet
Ioc Ai Ese 2023 24 Solution v1
8 pages
Midterm Solution
No ratings yet
Midterm Solution
17 pages
Search AI
No ratings yet
Search AI
17 pages
Midterm Review
No ratings yet
Midterm Review
13 pages
2025 CS420 ReviewExercisesForMidterm 22TT2
No ratings yet
2025 CS420 ReviewExercisesForMidterm 22TT2
8 pages
Umbc CMSC 471 Midterm Exam 15 March 2017
No ratings yet
Umbc CMSC 471 Midterm Exam 15 March 2017
5 pages
Final: CS 188 Spring 2014 Introduction To Artificial Intelligence
No ratings yet
Final: CS 188 Spring 2014 Introduction To Artificial Intelligence
28 pages
Ai Mid Done
No ratings yet
Ai Mid Done
5 pages
Midterm sp09 Solution
No ratings yet
Midterm sp09 Solution
11 pages
cs188 sp19 Final Sol
No ratings yet
cs188 sp19 Final Sol
28 pages
Getting Started With Competitive Programming - Unit 3 - Week 0
No ratings yet
Getting Started With Competitive Programming - Unit 3 - Week 0
7 pages
Soft Computing Unit 2 Notes..
No ratings yet
Soft Computing Unit 2 Notes..
24 pages
Artificial Intelligence CS188 Midterm1 Solutions
No ratings yet
Artificial Intelligence CS188 Midterm1 Solutions
28 pages
Ai2022 End Merged
No ratings yet
Ai2022 End Merged
28 pages
Midterm 1: CS 188 Fall 2018 Introduction To Artificial Intelligence
No ratings yet
Midterm 1: CS 188 Fall 2018 Introduction To Artificial Intelligence
14 pages
AI Exam Papers
No ratings yet
AI Exam Papers
7 pages
Assignment 2 Solutions PDF
No ratings yet
Assignment 2 Solutions PDF
13 pages
202 2018 1 b-7 PDF
No ratings yet
202 2018 1 b-7 PDF
12 pages
Sam's 2025-CS420-ReviewExercisesForMidterm-22TT2
No ratings yet
Sam's 2025-CS420-ReviewExercisesForMidterm-22TT2
10 pages
PAI Model Answer 2021 22
No ratings yet
PAI Model Answer 2021 22
15 pages
Exam1 s15 Sol
No ratings yet
Exam1 s15 Sol
10 pages
Final Exam: CS 188 Spring 2019 Introduction To Artificial Intelligence
No ratings yet
Final Exam: CS 188 Spring 2019 Introduction To Artificial Intelligence
23 pages
Unit2 Extra
No ratings yet
Unit2 Extra
16 pages
AIML CIA II Question Paper ECE Remedial Anskey
No ratings yet
AIML CIA II Question Paper ECE Remedial Anskey
33 pages
Artificial Intelligence (AI 2002) Sessional-I Exam: National University of Computer and Emerging Sciences
No ratings yet
Artificial Intelligence (AI 2002) Sessional-I Exam: National University of Computer and Emerging Sciences
7 pages
21csc206t Ai Ft3 Set B Answer Key
No ratings yet
21csc206t Ai Ft3 Set B Answer Key
10 pages
Name Umer Hussain Qidwai REGNO 40274 Course Artifical Intelligence Theory DR - Aarij Mehmood
No ratings yet
Name Umer Hussain Qidwai REGNO 40274 Course Artifical Intelligence Theory DR - Aarij Mehmood
13 pages
AI - AI417DE01 Lab - MidTerm Exam Review 23.2A
No ratings yet
AI - AI417DE01 Lab - MidTerm Exam Review 23.2A
7 pages
8960 - DWM Experiment 5
No ratings yet
8960 - DWM Experiment 5
6 pages
Midterm Exam Solution
No ratings yet
Midterm Exam Solution
11 pages
Midterm Review
No ratings yet
Midterm Review
4 pages
cs188 Fa07 mt1 Sol
No ratings yet
cs188 Fa07 mt1 Sol
8 pages
Computer Vision ch4
No ratings yet
Computer Vision ch4
100 pages
Practice - Midterm - 1 - Solutions KOOOOLL PDF
No ratings yet
Practice - Midterm - 1 - Solutions KOOOOLL PDF
15 pages
Block 4
No ratings yet
Block 4
96 pages
Midterm: Th. Nov.03, 2pm - 3pm
No ratings yet
Midterm: Th. Nov.03, 2pm - 3pm
3 pages
(Pieter Abbeel Midterm 1 ) Spring 2010
No ratings yet
(Pieter Abbeel Midterm 1 ) Spring 2010
12 pages
Midterm 1: CS 188 Summer 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm 1: CS 188 Summer 2019 Introduction To Artificial Intelligence
13 pages
(Probability2023) Chapter7
No ratings yet
(Probability2023) Chapter7
37 pages
Cs 188 HW Solutions Artificial Intelligence
No ratings yet
Cs 188 HW Solutions Artificial Intelligence
7 pages
Fa11 Final
No ratings yet
Fa11 Final
21 pages
2 - 04. Energy Method (5. Minimum Total Potential E - 02)
No ratings yet
2 - 04. Energy Method (5. Minimum Total Potential E - 02)
18 pages
Artificial Intelligence - Quiz1 Review Sol
No ratings yet
Artificial Intelligence - Quiz1 Review Sol
25 pages
Solutions by Mike Sokolovsky, Sam Ogden, Ahmedul Kabir, and Prof. Ruiz
No ratings yet
Solutions by Mike Sokolovsky, Sam Ogden, Ahmedul Kabir, and Prof. Ruiz
9 pages
T1 QP-Answer Key
No ratings yet
T1 QP-Answer Key
5 pages
MT 2009 Answers
No ratings yet
MT 2009 Answers
8 pages
Ioc Ai Ese 2023 24 13 04 2024
No ratings yet
Ioc Ai Ese 2023 24 13 04 2024
3 pages
Practice Final CS61c
No ratings yet
Practice Final CS61c
19 pages
Cs 4511 Mid1 s18
No ratings yet
Cs 4511 Mid1 s18
2 pages
978 0 7503 3395 5.preview
No ratings yet
978 0 7503 3395 5.preview
26 pages
Final Exam: DISC 333 (Part 2)
100% (1)
Final Exam: DISC 333 (Part 2)
11 pages
Practice Questions On Height balanced/AVL Tree
No ratings yet
Practice Questions On Height balanced/AVL Tree
5 pages
Midterm F06 Solutions
No ratings yet
Midterm F06 Solutions
12 pages
Temperature Control of CSTR Using PID Controller: Rubi, Vipul Agarwal, Anuj Deo, Nitin Kumar
0% (1)
Temperature Control of CSTR Using PID Controller: Rubi, Vipul Agarwal, Anuj Deo, Nitin Kumar
4 pages
Sample Mid Term ACI
No ratings yet
Sample Mid Term ACI
3 pages
ML Paper
No ratings yet
ML Paper
4 pages
PracticeSolution 1
No ratings yet
PracticeSolution 1
15 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
Probabilistic Load Forecasting For Integrated Energy System - 2024 - Advances in
No ratings yet
Probabilistic Load Forecasting For Integrated Energy System - 2024 - Advances in
13 pages
Introduction To Cryptography CS 355: Encryption Modes & Other Block Ciphers
No ratings yet
Introduction To Cryptography CS 355: Encryption Modes & Other Block Ciphers
22 pages
Power Point Presentation On-: Array Based Applications in C Language
No ratings yet
Power Point Presentation On-: Array Based Applications in C Language
20 pages
Unit I: Introduction To Neural Networks Biological Neural Networks Characteristics of Neural Networks Models of Neurons
No ratings yet
Unit I: Introduction To Neural Networks Biological Neural Networks Characteristics of Neural Networks Models of Neurons
35 pages
CH 01
No ratings yet
CH 01
11 pages
对高斯分布函数形式的推导
No ratings yet
对高斯分布函数形式的推导
4 pages
Cse3521 Hw1 Solutions
No ratings yet
Cse3521 Hw1 Solutions
5 pages
CSE/MATH 6643: Numerical Linear Algebra: Haesun Park
No ratings yet
CSE/MATH 6643: Numerical Linear Algebra: Haesun Park
13 pages
Icesc48915.2020.9155615
No ratings yet
Icesc48915.2020.9155615
6 pages
Lab Manual - AETN2302 - L3 (Variables and Input) Ali Al Shamlan
No ratings yet
Lab Manual - AETN2302 - L3 (Variables and Input) Ali Al Shamlan
6 pages
23 Ex 5G Absolute Maximum and Minimum
No ratings yet
23 Ex 5G Absolute Maximum and Minimum
8 pages
Moment Distribution
No ratings yet
Moment Distribution
13 pages
4.3 Example of Single Exponential Smoothing - Minitab
No ratings yet
4.3 Example of Single Exponential Smoothing - Minitab
3 pages
ME1401 - Finite Elements Analysis
No ratings yet
ME1401 - Finite Elements Analysis
5 pages
Pope's Solution 5.20
No ratings yet
Pope's Solution 5.20
2 pages
Control System Final Roadmap
No ratings yet
Control System Final Roadmap
3 pages
Nuclear Engineering and Technology: Ahmad Salehi, Mohammad Hosein Kazemi, Omid Safarzadeh
No ratings yet
Nuclear Engineering and Technology: Ahmad Salehi, Mohammad Hosein Kazemi, Omid Safarzadeh
7 pages
Sat Mathematics Review And Practice
From Everand
Sat Mathematics Review And Practice
Addison Shaw
1/5 (1)
IGNOU BCA Discrete Mathematics Previous Year Unsolved Papers MCS 013
From Everand
IGNOU BCA Discrete Mathematics Previous Year Unsolved Papers MCS 013
Manish Soni
No ratings yet
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Trigonometric Ratios to Transformations (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Midterm Exam: CS 188 Introduction To Fall 2008 Artificial Intelligence

Uploaded by

Midterm Exam: CS 188 Introduction To Fall 2008 Artificial Intelligence

Uploaded by

CS 188 Introduction to

Fall 2008 Artificial Intelligence Midterm Exam

• You have 80 minutes. 70 points total. Don’t panic!

All the work on this

For staff use only

/12 /11 /15 /17 /15 /70

THIS PAGE INTENTIONALLY LEFT BLANK

1. (12 points.) Search: Mr. and Ms. Pacman

Answer: N 2 for both pacmen, hence N 4 total

Answer: Each pacman has a choice of 5 actions, hence 52 = 25 total

Answer: isGoal((x1 , y1 ), (x2 , y2 )) := (x1 = x2 ) ∧ (y1 = y2 )

Answer: max(h1 , h2 ), min(h1 , h2 ), (α)h1 + (1 − α)h2 , for α ∈ [0, 1]

2. (11 points.) CSPs: Finicky Feast

3. (15 points.) Game Trees: The Balancer

def maxValue(s): // compute $U_A(s)$ assuming that A is next to move.

def balanceValue(s): // compute $U_A(s)$ assuming that B is next to move.

Answer: Answers above.

Answer: |v| < α.

4. (17 points.) MDPs and RL: Wandering Merchant

V stay (i) = ri + γV stay (i)

V ∗ (i) = max{ri + γV ∗ (i), pi γV ∗ (i − 1) + (1 − pi )γV ∗ (i), pi γV ∗ (i + 1) + (1 − pi )γV ∗ (i)}

Since pi = 1, this drastically simplifies:

V ∗ (i) = max{ri + γV ∗ (i), γV ∗ (i − 1), γV ∗ (i + 1)}

After (1, S, 4, 1), we update Q(1, S) ← 0.5[4 + 1 · 0] + 0.5(0) = 2.

After (1, S, 4, 1), we update Q(1, S) ← 0.5[4 + 1 · 2] + 0.5(2) = 4.

Circle true or false; skipping here is worth 1 point per question.

5. (15 points.) Probability and Utilities: Wheel of Fortune

You are playing a simplified game of Wheel of Fortune. The objective is to

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.