-
Notifications
You must be signed in to change notification settings - Fork 546
/
decision-theory-exercises.tex
502 lines (439 loc) · 22.7 KB
/
decision-theory-exercises.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
%%%% 16.1: Combining Beliefs and Desires Under Uncertainty (5 exercises, 3 labelled)
%%%% ===============================================================================
\begin{exercise}[almanac-game]%
(Adapted from David Heckerman\nindex{Heckerman, D.}.) This exercise
concerns the \term{Almanac Game}\tindex{Almanac Game}, which is used
by decision analysts to calibrate numeric estimation. For each of
the questions that follow, give your best guess of the answer, that
is, a number that you think is as likely to be too high as it is to be
too low. Also give your guess at a 25th percentile estimate, that is,
a number that you think has a 25\% chance of being too high, and a
75\% chance of being too low. Do the same for the 75th
percentile. (Thus, you should give three estimates in all---low,
median, and high---for each question.)
\begin{enumerate}
\item Number of passengers who flew between New York and Los Angeles in 1989.
\item Population of Warsaw in 1992.
\item Year in which Coronado discovered the Mississippi River.
\item Number of votes received by Jimmy Carter in the 1976 presidential
election.
\item Age of the oldest living tree, as of 2002.
\item Height of the Hoover Dam in feet.
\item Number of eggs produced in Oregon in 1985.
\item Number of Buddhists in the world in 1992.
\item Number of deaths due to AIDS in the \indextext{United States} in 1981.
\item Number of U.S. patents granted in 1901.
\end{enumerate}
The correct answers appear after the last exercise of this chapter. From the
point of view of decision analysis, the interesting thing is not how close
your median guesses came to the real answers, but rather how often the real
answer came within your 25\% and 75\% bounds. If it was about half the time,
then your bounds are accurate. But if you're like most people, you will be
more sure of yourself than you should be, and fewer than half the answers will
fall within the bounds. With practice, you can calibrate yourself to give
realistic bounds, and thus be more useful in supplying information for
decision making. Try this second set of questions and see if there is any
improvement:
\begin{enumerate}
\item Year of birth of Zsa Zsa Gabor\nindex{Gabor, Z.~Z.}. %1917
\item Maximum distance from Mars to the sun in miles. %155M
\item Value in dollars of exports of wheat from the United States in 1992. %4500M
\item Tons handled by the port of Honolulu in 1991. % 11M
\item Annual salary in dollars of the governor of California in 1993. % 120000
\item Population of San Diego in 1990. % 1.1M
\item Year in which Roger Williams\nindex{Williams, R.} founded Providence, Rhode Island. % 1636
\item Height of Mt.~Kilimanjaro in feet. %19340
\item Length of the Brooklyn Bridge in feet. %1595
\item Number of deaths due to automobile accidents in the United States in 1992.
%41,710
\end{enumerate}
\end{exercise}
% id=16.0 section=16.1
\begin{uexercise}
Chris considers four used cars before buying the one with maximum
expected utility. Pat considers ten cars and does the same.
All other things being equal, which one is more likely to have the
better car? Which is more likely to be disappointed with their car's
quality? By how much (in terms of standard deviations of expected
quality)?
\end{uexercise}
% id=16.2 section=16.1
\begin{iexercise}
Chris considers five used cars before buying the one with maximum
expected utility. Pat considers eleven cars and does the same.
All other things being equal, which one is more likely to have the
better car? Which is more likely to be disappointed with their car's
quality? By how much (in terms of standard deviations of expected
quality)?
\end{iexercise}
% id=16.2 section=16.1
\begin{exercise}[St-Petersburg-exercise]
In 1713, Nicolas Bernoulli\nindex{Bernoulli, N.} stated a puzzle, now called the
St.~Petersburg paradox\index{Saint P@St. Petersburg paradox},\index{paradox} which
works as follows. You have the opportunity to play a game in which a
fair coin\index{coin flip} is tossed repeatedly until it comes up
heads. If the first heads appears on the \(n\)th toss, you win \(2^n\)
dollars.
\begin{enumerate}
\item Show that the expected monetary value of this game is infinite.
\item How much would you, personally, pay to play the game?
\item Nicolas's cousin Daniel Bernoulli resolved the apparent paradox in 1738 by suggesting
that the utility of money
is measured on a logarithmic scale (i.e.,
\(U(S_{n}) = a\log_2 n +b\), where \(S_n\) is the state of having {\DollarSign}\(n\)).
What is the expected utility of the game under this assumption?
\item What is the maximum amount that it would be rational to pay
to play the game, assuming that
one's initial wealth is {\DollarSign}\(k\,\)?
\end{enumerate}
\end{exercise}
% id=16.3 section=16.1
\begin{exercise} \prgex
Write a computer program to automate the process in
\exref{assessment-exercise}. Try your program out on several people of
different net worth and political outlook. Comment on the consistency of your
results, both for an individual and across individuals.
\end{exercise}
% id=16.6 section=16.1
\begin{iexercise}[surprise-candy-exercise]
The Surprise Candy Company makes
candy in two flavors: 75\% are strawberry flavor and 25\% are anchovy
flavor. Each new piece of candy starts out with a round shape; as it
moves along the production line, a machine randomly selects a certain
percentage to be trimmed into a square; then, each piece is wrapped in
a wrapper whose color is chosen randomly to be red or brown. 70\% of the strawberry
candies are round and 70\% have a red wrapper, while 90\% of the
anchovy candies are square and 90\% have a brown wrapper. All candies are sold
individually in sealed, identical, black boxes.
Now you, the customer, have just bought a Surprise candy at the store
but have not yet opened the box. Consider the
three Bayes nets in \figref{3candy-figure}.
%
\begin{figure}[htb]
%%\epsfxsize=0.9\maxfigwidth
\figboxnew{figures/3candy.eps}
\caption{Three proposed Bayes nets for the Surprise Candy
problem, \exref{surprise-candy-exercise}. }
\label{3candy-figure}
\end{figure}
%
\begin{enumerate}
\item Which network(s) can correctly represent \(\pv(Flavor,Wrapper,Shape)\)?
\item Which network is the best representation for this problem?
\item Does network (i) assert that \(\pv(Wrapper|Shape)\eq \pv(Wrapper)\)?
\item What is the probability that your candy has a red wrapper?
\item In the box is a round candy with a red wrapper. What is the probability that its flavor is strawberry?
\item A unwrapped strawberry candy is worth \(s\) on the open market and an
unwrapped anchovy candy is worth \(a\). Write an expression for the value of an unopened candy box.
\item A new law prohibits trading of unwrapped candies, but it is still legal to trade wrapped candies
(out of the box). Is an unopened candy box now worth more than less than, or the same as before?
\end{enumerate}
\end{iexercise}
% id=16.9 section=16.1
\begin{uexercise}[surprise-candy-exercise]
The Surprise Candy Company makes
candy in two flavors: 70\% are strawberry flavor and 30\% are anchovy
flavor. Each new piece of candy starts out with a round shape; as it
moves along the production line, a machine randomly selects a certain
percentage to be trimmed into a square; then, each piece is wrapped in
a wrapper whose color is chosen randomly to be red or brown. 80\% of the strawberry
candies are round and 80\% have a red wrapper, while 90\% of the
anchovy candies are square and 90\% have a brown wrapper. All candies are sold
individually in sealed, identical, black boxes.
Now you, the customer, have just bought a Surprise candy at the store
but have not yet opened the box. Consider the
three Bayes nets in \figref{3candy-figure}.
%
\begin{figure}[htb]
%%\epsfxsize=0.9\maxfigwidth
\figboxnew{figures/3candy.eps}
\caption{Three proposed Bayes nets for the Surprise Candy
problem, \exref{surprise-candy-exercise}. }
\label{3candy-figure}
\end{figure}
%
\begin{enumerate}
\item Which network(s) can correctly represent \(\pv(Flavor,Wrapper,Shape)\)?
\item Which network is the best representation for this problem?
\item Does network (i) assert that \(\pv(Wrapper|Shape)\eq \pv(Wrapper)\)?
\item What is the probability that your candy has a red wrapper?
\item In the box is a round candy with a red wrapper. What is the probability that its flavor is strawberry?
\item A unwrapped strawberry candy is worth \(s\) on the open market and an
unwrapped anchovy candy is worth \(a\). Write an expression for the value of an unopened candy box.
\item A new law prohibits trading of unwrapped candies, but it is still legal to trade wrapped candies
(out of the box). Is an unopened candy box now worth more than less than, or the same as before?
\end{enumerate}
\end{uexercise}
% id=16.9 section=16.1
%%%% 16.2: The Basis of Utility Theory (2 exercises, 0 labelled)
%%%% ===========================================================
\begin{exercise}
Prove that the judgments \(B \pref A\) and \(C \pref D\) in the Allais paradox (\pgref{allais-page})
violate the axiom of substitutability.
\end{exercise}
% id=16.4 section=16.2
\begin{exercise}
Consider the Allais paradox described on \pgref{allais-page}:
an agent who prefers \(B\) over \(A\) (taking the
sure thing), and \(C\) over \(D\) (taking the higher EMV) is not
acting rationally, according to utility theory. Do you think this indicates
a problem for the agent, a problem for the theory, or no problem at all? Explain.
\end{exercise}
% id=16.18 section=16.2
%%%% 16.3: Utility Functions (6 exercises, 2 labelled)
%%%% =================================================
\begin{uexercise}
Tickets to a \indextext{lottery} cost {\DollarSign}1. There are two
possible prizes: a {\DollarSign}10 payoff with probability 1/50, and a
{\DollarSign}1,000,000 payoff with probability 1/2,000,000. What is the expected
monetary value of a lottery ticket? When (if ever) is it rational to
buy a ticket? Be precise---show an equation involving utilities. You
may assume current wealth of {\DollarSign}\(k\) and that \(U(S_k)=0\). You may also
assume that \(U(S_{k+{10}}) = {10}\times U(S_{k+1})\), but you may not make
any assumptions about \(U(S_{k+1,{000},{000}})\). Sociological studies show
that people with lower income buy a disproportionate number of lottery
tickets. Do you think this is because they are worse decision makers
or because they have a different utility function? Consider the value of
contemplating the possibility of winning the lottery versus the value of
contemplating becoming an action hero while watching an adventure movie.
\end{uexercise}
% id=16.1 section=16.3
\begin{exercise}[assessment-exercise]%
Assess your own utility for different incremental amounts of money
by running a series of preference tests between some definite amount \(M_1\) and
a lottery \([p,M_2; (1-p), 0]\). Choose different values of \(M_1\) and \(M_2\),
and vary \(p\) until you are indifferent between the two choices. Plot the
resulting utility function.
\end{exercise}
% id=16.5 section=16.3
\begin{exercise}
How much is a micromort\index{micromort} worth to you? Devise a protocol to determine this.
Ask questions based both on paying to avoid risk and being paid to accept risk.
\end{exercise}
% id=16.7 section=16.3
\begin{exercise}[kmax-exercise]
Let continuous variables \(X_1,\ldots,X_k\) be independently
distributed according to the same probability density function
\(f(x)\). Prove that the density function for
\(\max\{X_1,\ldots,X_k\}\) is given by \(kf(x)(F(x))^{k-1}\), where
\(F\) is the cumulative distribution for \(f\).
\end{exercise}
% id=16.10 section=16.3
\begin{uexercise}
Economists often make use of an
exponential utility function for money: \(U(x) = -e^{-x/R}\), where \(R\) is
a positive constant representing an individual's risk tolerance.
Risk tolerance reflects how likely an individual is to accept a
lottery with a particular expected monetary value (EMV) versus some
certain payoff. As \(R\) (which is measured in the same units as \(x\))
becomes larger, the individual becomes less risk-averse.
\begin{enumerate}
\item Assume Mary has an exponential utility function with \(R = \$500\). Mary
is given the choice between receiving \$500 with certainty (probability
1) or participating in a lottery which has a 60\% probability of
winning \$5000 and a 40\% probability of winning nothing. Assuming
Marry acts rationally, which option would she choose? Show how you
derived your answer.
\item Consider the choice between receiving \$100 with certainty
(probability 1) or participating in a lottery which has a 50\%
probability of winning \$500 and a 50\% probability of winning nothing.
Approximate the value of R (to 3 significant digits) in an exponential
utility function that would cause an individual to be indifferent to
these two alternatives. (You might find it helpful to write a short
program to help you solve this problem.)
\end{enumerate}
\end{uexercise}
% id=16.17 section=16.3
\begin{iexercise}
Economists often make use of an
exponential utility function for money: \(U(x) = -e^{-x/R}\), where \(R\) is
a positive constant representing an individual's risk tolerance.
Risk tolerance reflects how likely an individual is to accept a
lottery with a particular expected monetary value (EMV) versus some
certain payoff. As \(R\) (which is measured in the same units as \(x\))
becomes larger, the individual becomes less risk-averse.
\begin{enumerate}
\item Assume Mary has an exponential utility function with \(R = \$400\). Mary
is given the choice between receiving \$400 with certainty (probability
1) or participating in a lottery which has a 60\% probability of
winning \$5000 and a 40\% probability of winning nothing. Assuming
Marry acts rationally, which option would she choose? Show how you
derived your answer.
\item Consider the choice between receiving \$100 with certainty
(probability 1) or participating in a lottery which has a 50\%
probability of winning \$500 and a 50\% probability of winning nothing.
Approximate the value of R (to 3 significant digits) in an exponential
utility function that would cause an individual to be indifferent to
these two alternatives. (You might find it helpful to write a short
program to help you solve this problem.)
\end{enumerate}
\end{iexercise}
% id=16.17 section=16.3
\begin{iexercise}
Alex is given the choice between two games. In Game 1, a fair coin
is flipped and if it comes up heads, Alex receives \$100. If the coin
comes up tails, Alex receives nothing. In Game 2, a fair coin is
flipped twice. Each time the coin comes up heads, Alex receives \$50,
and Alex receives nothing for each coin flip that comes up tails.
Assuming that Alex has a monotonically increasing utility function for
money in the range [\$0, \$100], show mathematically that if Alex
prefers Game 2 to Game 1, then Alex is risk averse
(at least with respect to this range of monetary amounts).
\end{iexercise}
% id=16.20 section=16.3
%%%% 16.4: Multiattribute Utility Functions (3 exercises, 1 labelled)
%%%% ================================================================
\begin{iexercise}
Show that if \(X_1\) and \(X_2\) are preferentially\index{preference independence} independent of \(X_3\),
and \(X_2\) and \(X_3\) are preferentially independent of \(X_1\), then
\(X_3\) and \(X_1\) are preferentially independent
of \(X_2\).
\end{iexercise}
% id=16.8 section=16.4
\begin{exercise}[airport-au-id-exercise]%
Repeat \exref{airport-id-exercise}, using the action-utility
representation shown in \figref{airport-au-id-figure}.
\end{exercise}
% id=16.12 section=16.4
\begin{exercise}
For either of the airport-siting diagrams from Exercises
\ref{airport-id-exercise} and \ref{airport-au-id-exercise},
to which conditional probability table entry is the utility most sensitive,
given the available evidence?
\end{exercise}
% id=16.13 section=16.4
%%%% 16.5: Decision Networks (2 exercises, 0 labelled)
%%%% =================================================
\begin{iexercise}\prgex
Modify and extend the Bayesian network code in the code repository to
provide for creation and evaluation of decision networks and the calculation of information value.
\end{iexercise}
% id=16.16 section=16.5
\begin{exercise}
Consider a student who has the choice to buy or not buy a textbook for
a course. We'll model this as a decision problem with one Boolean
decision node, \(B\), indicating whether the agent chooses to buy the
book, and two Boolean chance nodes, \(M\), indicating whether the
student has mastered the material in the book, and \(P\), indicating
whether the student passes the course. Of course, there is also a
utility node, \(U\). A certain student, Sam, has an additive utility
function: 0 for not buying the book and -\$100 for buying it; and \$2000
for passing the course and 0 for not passing. Sam's conditional probability
estimates are as follows:
\[\begin{array}{ll}
P(p|b,m) = 0.9 & P(m|b) = 0.9 \\
P(p|b, \lnot m) = 0.5 & P(m|\lnot b) = 0.7 \\
P(p|\lnot b, m) = 0.8 & \\
P(p|\lnot b, \lnot m) = 0.3 & \\
\end{array}\]
You might think that \(P\) would be independent of \(B\) given \(M\), But this course has an open-book final---so having the book helps.
\begin{enumerate}
\item Draw the decision network for this problem.
\item Compute the expected utility of buying the book and of not buying it.
\item What should Sam do?
\end{enumerate}
\end{exercise}
% id=16.19 section=16.5
%%%% 16.6: The Value of Information (3 exercises, 3 labelled)
%%%% ========================================================
\begin{exercise}[airport-id-exercise]%
\prgex
This exercise completes the analysis of the airport-siting problem\index{problem!airport-siting} in \figref{airport-id-figure}.
\begin{enumerate}
\item Provide reasonable variable domains, probabilities, and
utilities for the network, assuming that there are three possible
sites.
\item Solve the decision problem.
\item What happens if changes in technology mean that each aircraft generates half the
noise?
\item What if noise avoidance becomes three times more important?
\item Calculate the VPI for \(\J{AirTraffic}\), \(\J{Litigation}\), and \(\J{Construction}\)
in your model.
\end{enumerate}
\end{exercise}
% id=16.11 section=16.6
\begin{exercise}[car-vpi-exercise] (Adapted from Pearl~\citeyear{Pearl:1988}.)
A used-car buyer can decide to carry out various tests with various costs
(e.g., kick the tires, take the car to a qualified mechanic) and then,
depending on the outcome of the tests, decide which car to buy. We will assume
that the buyer is deciding whether to buy car \(c_1\), that there is time to
carry out at most one test, and that \(t_1\) is the test of \(c_1\) and costs
{\DollarSign}50.
A car can be in good shape (quality \(q^+\)) or bad shape (quality
\(q^-\)), and the tests might help indicate what shape the car is
in. Car \(c_1\) costs {\DollarSign}1,500, and its market value is
{\DollarSign}2,000 if it is in good shape; if not, {\DollarSign}700 in
repairs will be needed to make it in good shape. The buyer's estimate
is that \(c_1\) has a 70\% chance of being in good shape.
\begin{enumerate}
\item Draw the decision network that represents this problem.
\item Calculate the expected net gain from buying \(c_1\), given no test.
\item Tests can be described by the probability that the car will
pass or fail the test given that the car is in good or bad shape. We have the
following information:\\
\(P(\J{pass}(c_1,t_1) | q^+(c_1)) = {0.8}\)\\
\(P(\J{pass}(c_1,t_1) | q^-(c_1)) = {0.35}\)\\
Use Bayes' theorem to calculate the probability that the car will
pass (or fail) its test and hence the probability that it is in good
(or bad) shape given each possible test outcome.
\item Calculate the optimal decisions given either a pass or a fail,
and their expected utilities.
\item Calculate the value of information of the test, and derive an
optimal conditional plan for the buyer.
\end{enumerate}
\end{exercise}
% id=16.14 section=16.6
\begin{exercise}[nonnegative-VPI-exercise]%
Recall the definition of {\em value of information}\index{value of information} in \secref{VPI-section}.
\begin{enumerate}
\item Prove that the value of information is nonnegative and order independent.
\item Explain why it is that some people would prefer not to get some information---for example, not
wanting to know the sex of their baby when an ultrasound is done.
\item A function \(f\) on sets is \newterm[submodularity]{submodular}\ntindex{submodularity} if, for any element \(x\) and any
sets \(A\) and \(B\) such that \(A\subseteq B\), adding \(x\) to \(A\) gives a greater increase in \(f\) than adding \(x\) to \(B\):
\[
A\subseteq B \implies (f(A\union \{x\}) - f(A)) \geq (f(B\union \{x\}) - f(B))\ .
\]
Submodularity captures the intuitive notion of {\em diminishing returns}. Is the value of information, viewed as a function \(f\) on sets of possible observations, submodular? Prove this or find a counterexample.
\end{enumerate}
\end{exercise}
% id=16.15 section=16.6
% \begin{exercise}
% It has sometimes been suggested that \newterm{lexicographic
% preference}\ntindex{preference!lexicographic} is a form of rational behavior that is not captured by
% utility theory. Lexicographic preferences rank attributes in some
% order {\mathdelim}X_1,\, \ldots,\, X_n{\mathdelim}, and treat each attribute as infinitely more
% important than attributes later in the order. In choosing between two
% prizes, the value of attribute
% {\mathdelim}X_i{\mathdelim} only matters if the prizes have the same values for {\mathdelim}X_1,\, \ldots,\,
% X_{i-1}{\mathdelim}. In a lottery, an infinitesimal probability of a tiny
% improvement in a
% more important attribute is considered better than a dead certainty
% of a huge improvement in a less important attribute.
% For example, in the airport-siting problem, it might be
% proposed that preserving human life is of paramount importance, and
% therefore if one site is more dangerous than another, it should be
% ruled out immediately, without considering the other attributes. Only
% if two sites are equally safe should they be compared on
% other attributes such as cost.
% \begin{enumerate}
% \item Give a precise definition of lexicographic preference
% between deterministic outcomes.
% \item Give a precise definition of lexicographic preference
% between lotteries.
% \item Does lexicographic preference violate any of the axioms of utility
% theory? If so, which? ({\em Hint:} consider pair-wise preference
% comparisons of three different possibilities.)
% \item Suggest a set of attributes for which you might exhibit
% lexicographic preferences.
% \end{enumerate}
% \end{exercise}
%\begin{exercise}
%<<ex on burglary figure - add d and u nodes, vpi, vii=vpi]]
%\end{exercise}
%% Mehran's exercises
%% End Mehran's exercises
\bigskip
\noindent The answers to \exref{almanac-game} (where M stands for million):
First set: 3M, 1.6M, 1541, 41M, 4768, 221, 649M, 295M, 132, 25,546.
Second set: 1917, 155M, 4,500M, 11M, 120,000, 1.1M, 1636, 19,340, 1,595, 41,710.