Abstract
The goal of this chapter is to provide a general introduction to decision making under uncertainty. The mathematical foundations of the most popular models used in artificial intelligence are described, notably the Expected Utility model (EU), but also new decision making models, like Rank Dependent Utility (RDU), which significantly extend the descriptive power of EU. Decision making under uncertainty naturally involves risks when decisions are made. The notion of risk is formalized as well as the attitude of agents w.r.t. risk. For this purpose, probabilities are often exploited to model uncertainties. But there exist situations in which agents do not have sufficient knowledge or data available to determine these probability distributions. In this case, more general models of uncertainty are needed and this chapter describes some of them, notably belief functions. Finally, in most artificial intelligence problems, sequences of decisions need be made and, to get an optimal sequence, decisions must not be considered separately but as a whole. We thus study at the end of this chapter models of sequential decision making under uncertainty, notably the most widely used graphical models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Outside the EU framework, the behavior of an agent cannot be rational (w.r.t. Savage’s meaning) and, therefore, it is thought in artificial intelligence that such a behavior must be proscribed. In the 70’s and 80’s, decision theorists, notably Kahneman, Tversky and Quiggin, suggested that Savage’s rationality was not the only possible form of rationality and they proposed to depart from the Savagian framework and developed their own kinds of “rationality”. This paved the way to new decision models like, e.g., RDU, that recently attracted the attention of AI researchers.
- 2.
In his book, Savage did not name this operation. The term “splicing” was introduced in Gilboa (2009).
- 3.
Most authors name P2 as the “sure thing principle” but it was pointed out by Peter Wakker that, in Savage’s book, the sure thing principle refers to axioms P2, P3 and P7.
- 4.
Note that qualitative probabilities are slightly different from probabilities, see Kraft et al. (1959) for a proof of this assertion.
- 5.
Savage’s theorem is somewhat more general than the theorem mentioned here: acts need not have a finite support, it is sufficient that the set of consequences \(\mathscr {X}\) is finite. In this case, the summation needs be substituted by an integral w.r.t. the subjective probability measure.
- 6.
For an interpretation in terms of weights of agents’ coalitions or of criteria, see chapter “Multicriteria Decision Making” of this volume.
References
Allais M (1953) Le comportement de l’homme rationnel devant le risque: critique des postulats et axiomes de l’école américaine. Econometrica 21:503–546
Anand P (1993) The philosophy of intransitive preference. Econ J 103(417):337–346
Anscombe F, Aumann R (1963) A definition of subjective probability. Ann Math Stat 34:199–205
Argall BD, Chernova S, Veloso M, Browning B (2009) A survey of robot learning from demonstration. Robot Auton Syst 57:469–483
Arrow KJ (1965) Aspects of the theory of risk bearing, chapter The theory of risk aversion. Yrjo Jahnsson Fondation, pp 90–120
Bellman R (1957) Dynamic programming. Princeton University Press, Princeton
Bernoulli D (1738) Specimen theoriae novae de mensura sortis. Commentarii academiae scientiarum imperialis Petropolitanae 5:175–192
Bleichrodt H (1996) Applications of utility theory in the economic evaluation of health care. PhD thesis, Erasmus University, Rotterdam, the Netherlands
Boutilier C (1994) Towards a logic for qualitative decision theory. In: Proceedings of the international conference on principles of knowledge representation and reasoning (KR’94), pp 75–56
Boutilier C (2002) A POMDP formulation of preference elicitation problems. In: Proceedings of the national conference on artificial intelligence (AAAI’02), pp 239–246
Boutilier C, Regan K, Viappiani P (2010) Simultaneous elicitation of preference features and utility. In: Proceedings of the national conference on artificial intelligence (AAAI’10), pp 1160–1167
Brafman RI, Tennenholtz M (1996) On the foundation of qualitative decision theory. In: Proceedings of the national conference on artificial intelligence (AAAI’96), pp 1291–1296
Chajewska U, Koller D, Parr R (2000) Making rational decisions using adaptive utility elicitation. In: Proceedings of the national conference on artificial intelligence (AAAI’00), pp 363–369
Chateauneuf A (1999) Comonotonicity axioms and RDU theory for arbitrary consequences. J Math Econ 32:21–45
Chateauneuf A, Cohen M (1994) Risk-seeking with diminishing marginal utility in a non-expected utility model. J Risk Uncertain 9:77–91
Chateauneuf A, Cohen M, Meilijson I (2004) Four notions of mean-preserving increase in risk, risk attitudes and applications to the rank-dependent expected utility model. J Math Econ 40(6):547–571
Chew S, Karni E, Safra Z (1987) Risk aversion in the theory of expected utility with rank dependent preferences. J Econ Theory 42:370–381
Chew S, Wakker PP (1996) The comonotonic sure thing principle. J Risk Uncertain 12:5–27
Conati C, Gertner AS, VanLehn K, Drudzel MJ (1997) On-line student modeling for coached problem solving using Bayesian networks. In: Proceedings of the international conference on user modeling (UM’97)
Dasgupta P (2006) Distributed automatic target recognition using multiagent UAV swarms. In: Proceedings of the international conference on autonomous agents and multiagent systems (AAMAS’06), pp 479–481
de Salvo Braz R, Amir E, Roth D (2005) Lifted first-order probabilistic inference. In Proceedings of the international joint conference on artificial intelligence (IJCAI’05), pp 1319–1325
Dempster AP (1967) Upper and lower probabilities induced by a multivalued mapping. Ann Math Stat 38:325–339
Doucet A, Johansen A (2011) The Oxford handbook of nonlinear filtering, chapter a tutorial on particle filtering and smoothing: fifteen years later. Oxford University Press, Oxford, pp 656–704
Dubois D, Fargier H, Perny P (2003) Qualitative decision theory with preference relations and comparative uncertainty: an axiomatic approach. Artif Intell J 148(1):219–260
Dubois D, Fargier H, Perny P, Prade H (2002) Qualitative decision theory: from Savage’s axioms to nonmonotonic reasoning. Int J Assoc Comput Mach 49(4):455–495
Dubois D, Fargier H, Prade H (1997) Decision-making under ordinal preferences and uncertainty. In: Proceedings of the conference on Uncertainty in artificial intelligence (UAI’97), pp 157–164
Dubois D, Le Berre D, Prade H, Sabbadin R (1999) Using possibilistic logic for modeling qualitative decision: ATMS-based algorithms. Fund Inform 37(1–2):1–30
Dubois D, Prade H (1995) Possibility theory as a basis of qualitative decision theory. In: Proceedings of the international joint conference on artificial intelligence (IJCAI’95), pp 1924–1930
Dubois D, Prade H, Sabbadin R (1998) Qualitative decision theory with Sugeno integrals. In: Proceedings of the conference on uncertainty in artificial intelligence (UAI’98), pp 121–128
Ellsberg D (1961) Risk, ambiguity and the Savage axioms. Q J Econ 75:643–669
Fargier H, Lang J, Sabbadin R (1998) Towards qualitative approaches to multistage decision making. Int J Approx Reason 19:441–471
Fishburn PC (1970) Utility theory for decision making. Wiley, New York
Fishburn PC (1982) The foundations of expected utility. Kluwer
Fishburn PC, Roberts FS (1978) Mixture axioms in linear and multilinear utility theories. Theory Decis 9:161–171
Franklin R, Spiegelhalter D, Macartney F, Bull K (1991) Evaluation of an algorithm for neonates. Br Med J 302:935–939
Getoor L, Taskar B (2007) Introduction to statistical relational learning. MIT Press, Cambridge
Gilboa I (1987) Expected utility with purely subjective non-additive probabilities. J Math Econ 16:65–88
Gilboa I (2009) Theory of decision under uncertainty. Econometric society monographs. Cambridge University Press, Cambridge
Gonzales C, Perny P (2004) GAI networks for utility elicitation. In: Proceedings of the international conference on principles of knowledge representation and reasoning (KR’04), pp 224–234
Herstein IN, Milnor J (1953) An axiomatic approach to measurable utility. Econometrica 21:291–297
Horvitz E, Barry M (1995) Display of information for time-critical decision making. In: Proceedings of the conference on uncertainty in artificial intelligence (UAI’95), pp 296–305
Horvitz E, Breese J, Heckerman D, Hovel D, Rommelse K (1998) The Lumière project: Bayesian user modeling for inferring the goals and needs of software users. In: Proceedings of the conference on uncertainty in artificial intelligence (UAI’98), pp 256–265
Howard RA (1960) Dynamic programming and Markov processes. MIT Press, Cambridge
Howard RA, Matheson JE (1984) Influence diagrams. In: Howard R, Matheson J (eds) Readings on the principles and applications of decision analysis, vol 2. Strategic Decision Group, Menlo Park, pp 719–762
Hurwicz L (1951) Optimality criteria for decision making under ignorance, vol 370. Cowles Commission discussion paper, Statistics
Ingersoll J (1987) Theory of financial decision making. Rowman and Littlefeld
Jaffray J-Y (1988) Choice under risk and the security factor: an axiomatic model. Theory Decis 24(2):169–200
Jaffray J-Y (1989) Linear utility theory for belief functions. Oper Res Lett 8:107–112
Jensen F, Jensen FV, Dittmer SL (1994) From influence diagrams to junction trees. In: Proceedings of the conference on uncertainty in artificial intelligence (UAI’94)
Jensen FV, Kjærulff U, Kristiansen B, Langseth H, Skaanning C, Vomlel J, Vomlelova M (2001) The SACSO methodology for troubleshooting complex systems
Jensen NE (1967) An introduction to Bernoullian utility theory. I: utility functions. Swed J Econ 69:163–183
Kahneman D, Tversky A (1972) Subjective probability: a judgment of representativeness. Cogn Psychol 3:430–454
Kahneman D, Tversky A (1979) Prospect theory: an analysis of decision under risk. Econometrica 47:263–291
Keeney RL, Raiffa H (1993)Decisions with multiple objectives - preferences and value tradeoffs. Cambridge University Press, Cambridge. (Version originale en 1976 chez Wiley)
Khosravi H, Schulte O, Man T, Xu X, Bina B (2010) Structure learning for Markov logic networks with many descriptive attributes. In: Proceedings of the national conference on artificial intelligence (AAAI’10)
Knight F (1921) Risk, uncertainty and profit. Houghton Miffin
Kok S, Domingos P (2009) Learning Markov logic network structure via hypergraph lifting. In: Proceedings of the international conference on machine learning (ICML’09)
Kraft CH, Pratt JW, Seidenberg A (1959) Intuitive probability on finite sets. Ann Math Stat 30:408–419
Lehmann D (1996) Generalized qualitative probability: Savage revisited. In: Proceedings of the conference on uncertainty in artificial intelligence (UAI’96), pp 381–388
Lu T, Boutilier C (2011) Robust approximation and incremental elicitation in voting protocols. In Proceedings of the international joint conference on artificial intelligence (IJCAI’11), pp 287–293
Machina M (1982) Expected utility analysis without the independence axiom. Econometrica 50:277–323
Monahan GE (1982) A survey of partially observable Markov decision processes: theory, models and algorithms. Manag Sci 28:1–16
Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufman Publishers Inc
Perny P, Rolland A (2006) Reference-dependent qualitative models for decision making under uncertainty. In: Proceedings of the European conference on artificial intelligence (ECAI’06), pp 422–426
Pratt J (1964) Risk aversion in the small and in the large. Econometrica 32:122–136
Puterman ML (1994) Markov decision processes: discrete stochastic dynamic programming. Wiley, New York
Quiggin J (1982) A theory of anticipated utility. J Econ Behav Organ 3:323–343
Quiggin J (1992) Increasing risk: another definition. In: Chikan A (ed) Progress in decision, utility and risk theory. Kluwer, Dordrecht
Quiggin J (1993) Generalized expected utility theory: the rank-dependent model. Springer, Berlin
Raiffa H (1968) Decision analysis: introductory lectures on choices under uncertainty. Addison-Wesley, Reading
Ramsey FP (1931) Truth and probability. In: Ramsey F (ed) The foundations of mathematics and other logical essays. Harcourt, Brace and Co, California
Regan K, Boutilier C (2011) Robust online optimization of reward-uncertain MDPs. In: Proceedings of the international joint conference on artificial intelligence (IJCAI’11), pp 2165–2171
Rotschild M, Stiglitz J (1970) Increasing risk I: a definition. J Econ Theory 2:225–243
Rotschild M, Stiglitz J (1971) Increasing risk II: its economic consequences. J Econ Theory 3:66–84
Sabbadin R (1998) Une Approche Ordinale de la Décision dans l’Incertain: Axiomatisation, Représentation Logique et Application à la Décision Séquentielle. Thèse de doctorat, Université Paul Sabatier, Toulouse, France
Sabbadin R (2001) Possibilistic Markov decision processes. Eng Appl Artif Intell 14:287–300
Savage LJ (1954) The foundations of statistics. Dover
Schmeidler D (1986) Integral representation without additivity. In: Proceedings of the American mathematical society (AMS), vol 97, pp 255–261
Shachter R (1986) Evaluating influence diagrams. Oper Res 34:871–882
Shafer G (1976) Mathematical theory of evidence. Princeton University Press, Princeton
Sondik E (1971) The optimal control of partially observable Markov processes. PhD thesis. Stanford University
Sordoni A, Briot J-P, Alvarez I, Vasconcelos E, Irving M, Melo G (2010) Design of a participatory decision making agent architecture based on argumentation and influence function: application to a serious game about biodiversity conservation. RAIRO Oper Res 44(4):269–284
Tan S, Pearl J (1994) Qualitative decision theory. In: Proceedings of the national conference on artificial intelligence (AAAI’94), pp 928–933
von Neumann J, Morgenstern O (1944) Theory of games and economic behaviour. Princetown University Press, Princetown
Wakker PP (1990) Under stochastic dominance Choquet expected utility and anticipated utility are identical. Theory Decis 29:119–132
Wakker PP (1994) Separating marginal utility and risk aversion. Theory Decis 36:1–44
Wald A (1950) Statistical decision functions. Wiley, New York
Wang T, Boutilier C (2003) Incremental utility elicitation with the minimax regret decision criterion. In: Proceedings of the international joint conference on artificial intelligence (IJCAI’03), pp 309–316
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Gonzales, C., Perny, P. (2020). Decision Under Uncertainty. In: Marquis, P., Papini, O., Prade, H. (eds) A Guided Tour of Artificial Intelligence Research. Springer, Cham. https://doi.org/10.1007/978-3-030-06164-7_17
Download citation
DOI: https://doi.org/10.1007/978-3-030-06164-7_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-06163-0
Online ISBN: 978-3-030-06164-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)