Lecture Decision Trees 2017
Lecture Decision Trees 2017
Decision trees represent a systematic analysis of a sequence of decisions. The tree is similar in
form to an event tree, however the head node, and possibly some other nodes, are decision nodes.
At these nodes the decision-maker chooses which option to take rather than having probabilities
assigned. Thus the probability of the preferred option is one and all others are zero. The other
nodes are referred to as chance nodes.
D
0.3
B
A 0.7 E
C
First Subsequent
Decision possible outcomes
Choices
or decisions
Node A in the diagram is the head node for the tree. It is also a decision node where the decision
maker needs to choose between B and C. After the decision maker makes the choice probability
of the preferred option is one and all others are zero
Node B in the diagram is a chance node. In this case the decision maker does not know which of
the child nodes (D and E in the example) will occur. The numbers on the diagram represent the
probabilities for the different outcomes.
Nodes C and E in the diagram are leaf nodes. These represent known outcomes.
Node D in the diagram is another decision node. It will have its own child nodes and the tree will
continue. This represents a multi-stage decision where the outcome of Decision A is known
before Decision D needs to be made.
Analysis
While construction of the tree proceeds from the head node to the leaf nodes, analysis of the tree
commences at the tips of the branches.
Commonly decisions will be made based on highest value. Value is usually measured as
Expected Monetary Value.
0.7 E $1M For a chance node the EMV is the sum of the products
of the probabilities and the EMV’s of the respective
branch nodes. In the diagram B will have an EMV of
Weak market 0.3 * $5M + 0.7 * $1M = $2.2M
Start a Business
B For a decision node the decision maker simply picks the
$2.2M
best option. In the diagram this would be to start a
A business. The EMV for node A thus becomes $2.2M
C $2M
Get a Job
Utility
The EMV approach is good when applied to a lot of small decisions. However it has problems
when applied to one off decisions involving a loss of a significant proportion of a person's or
company's resources. For example not many people are willing to bet their home, even if they
have a better than 50% chance of getting more than twice as much.
To deal with this we can use a utility curve. We use this to convert the dollar amounts on a
decision tree into utility values before we calculate the EMV. We then proceed as normal except
that we are calculating an expected mean utility. Some examples of utility diagrams are shown in
the following figure.
Utility
Risk Averter
100
Risk Neutral
Risk Accepter
Monetary Value
Example
A company is deciding whether to make a large or small resort on an island that it owns. The
payoff matrix is given below and the utility curve for the company is given by 51 + 1.4x - 0.01x2.
If we calculate the EMV for the decision tree we find that the Large resort is the preferred option.
However, if we calculate the expected mean utility we find that the small resort is the better
choice because the loss that a large resort would make under low demand is too big a problem for
the company.
Gift
U(gift) = 100 * p
An individual is asked whether he or she would prefer a gamble between two options (for
example winning $100,000 or losing $50,000) at a given probability, or a gift (for example
$30,000). The probability is adjusted until the individual finds both options equally apealling.
The utility of the gift is then calculated as 100 times the probability. For example if the
individual chooses a probability of p=0.6 for a gift of $30,000 then the utility of $30,000 is 60.
The experiment is repeated with gifts of different values. When enough data points have been
Economists also talk about a concept of utility. This is the same thing as discussed here.
Namely, the utility is the value of something (in this case money) to a particular person under
particular circumstances, as opposed to the price, which is determined by supply and demand.
The underlying concept is that for someone (who is a risk averter) that has a large sum of money
a little extra is not worth as much as it would be if the extra amount was all that they had.
Bayes Theorem
If we have many events X1...Xj...Xn which are possible outcomes of a decision with initial
estimates of probabilities P(X1)...P(Xj)....P(Xn), and we perform an experiment that leads to
event E occurring
n
P(E) = j=1
P(E Xj ) P(Xj )
P(E X j ).P(Xj )
(Xj E) =
P
n
i=1
P(E X i ).P(X i)
The construction of a pre-stressed concrete bridge has been delayed. The pre-stressing strand has
rusted. We need to make a decision to replace the strand or not due to a possible reduction in
properties due to the rusting.
States of Nature
Decision Alternatives Failure No Failure
___________________________________________________________________________
Replace Strand (a1) 0.00001 0.99999
Do not Replace Strand (a2) 0.10 0.9
If we carry out a decision analysis based on these figures and the EMV criterion we would
choose to replace the strand.
We have a third option a3 which involves postponing the decision to allow tests to be carried out.
(The tests cost $2000) Action a3 has two possible outcomes E1 - tests indicate no significant
reduction in strength E2 - tests indicate significant reduction in strength.
P(E1|X3) = 0.95
P(E1|X4) = 0.09
The problems that we have in this example are that we do not know the probabilities of the two
different test outcomes )P(E1) and P(E2) or the probabilities that the bridge will or will not fail in
the case of either test result. That is we do not know P(X3|E1), etc. (notice that the symbols in the
conditional probability are reversed compared to what we are given).
To solve the first problem we use P(X3) = 0.9 and P(X4) = 0.1 with the first equation from Bayes'
Theorem above to obtain:
0.95 0.9
| 0.99
0.864
0.09 0.1
| 0.01
0.864
Similarly, if E2 occurs,
P(E2) = 0.136
and
P(X3|E2) = 0.33
P(X4|E2) = 0.67
failure 770000
x2(0.00001)
a1
replace x3(0.90)
no failure 0
EMV= 75000
failure 750000
a2 x4(0.10)
don't replace
EMV = x1(0.99999) 22000
22008
772000
a3 a1
EMV=9500
tests x2(0.00001)
EMV =
9500 2000
(0.864) x3(0.99)
E1 a2 752000
x4(0.01)
EMV = $11 201
EMV =
22008 22000
x1(0.99999)
(0.136)
E2 a1 772000
x2(0.00001)
EMV =
504500
EMV=22008 x3(0.33) 2000
a2
x4(0.67) 752000
In summary
EMV(replace) = $20,008
EMV(don't replace) = $75,000
If E1 is the outcome of the test option a2 is the preferred option. If E2 is the outcome of the test
then it is better to replace the strand (option a1). This gives an EMV (a3) = (0.136*22,008) +
(0.864 x 9500) = $11,201.
This indicates that carrying out the tests is the best option.
References
Cross, J. (1997), Risk Management, The University of New South Wales.
Dandy, G. C. and Warner, R. F. (1989), Planning and Design of Engineering Systems,
Unwin Hyman, London.
1. (after Meredith et al., 1973) Consider the problem encountered by a contractor who is
performing some work with special equipment in a river flats area that has been subjected
in the past to high-water conditions and occasional destructive flooding. There is a 4-
month period during which there is no use for the equipment either on this job or on
others. The contractor can keep the equipment on the job in the river flats, or else move it
out, store it, and then move it back, at a total cost of $1800.
3. A bank classifies 70% of their credit card customers as having a good credit rating and the
other 30% as having a poor credit rating. Customers with good credit rating allow their
accounts to go overdue 10% of the time on average, while customers with a bad credit
rating allow their accounts to go overdue 60% of the time on average. What percentage
of overdue credit card accounts do customers with poor credit ratings hold?
If another group finishes first the group making the decision get nothing (but bills). If they begin
executing on a current computer and then purchase a faster computer then the code will have to
start executing from scratch.
Draw a desicion tree and determine what the scientists should do.
5. The Port of Wattalil Harbour needs some more space for its container terminal. There is a
bay in the harbour that can be used for the terminal, but it will need to be deepened. The
cost of this deepening will depend on the hardness of the sand at the bottom of the
harbour and the excavation method used. If dredging is used then the cost of excavating
the sand will be $200 000 if it is soft, and $700 000 if it is hard. If a dragline is used then
the cost of excavating the sand will be $300 000 if it is soft, and $400 000 if it is hard.
The hardness of the sand is unknown, but the engineer estimates that there is a 70%
chance that it will be soft.
It is possible for the engineer to do some SPT testing to see if the sand is hard or soft. If
he does this test and the sand is hard then there is a 90% probability that high SPT results
will be obtained and 10% chance that low SPT results will be obtained while if the sand is
soft then there is a 10% probability that high SPT results will be obtained and 90% chance
that low SPT results will be obtained.
What is the value of the test to the engineer?