Introduction to Computational Game Theory CMPT 882. Simon Fraser University. Oliver Schulte. Decision Making Under Uncertainty

Similar documents
Exercise Set #6. Venus DL.2.8 CC.5.1

Maximin equilibrium. Mehmet ISMAIL. March, This version: June, 2014

(67686) Mathematical Foundations of AI June 18, Lecture 6

Experimental Computational Philosophy: shedding new lights on (old) philosophical debates

UNIVERSITY OF CALIFORNIA, SAN DIEGO DEPARTMENT OF ECONOMICS

Economics 470 Some Notes on Simple Alternatives to Majority Rule

Mehmet Ismail. Maximin equilibrium RM/14/037

Self-Organization and Cooperation in Social Systems

Arrow s Impossibility Theorem

1.6 Arrow s Impossibility Theorem

1 Strategic Form Games

The (Many) Models of Rawls and Harsanyi

Section Voting Methods. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Supporting Information Political Quid Pro Quo Agreements: An Experimental Study

Game Theory II: Maximin, Equilibrium, and Refinements

An example of public goods

Mathematics and Social Choice Theory. Topic 4 Voting methods with more than 2 alternatives. 4.1 Social choice procedures

A NOTE ON THE THEORY OF SOCIAL CHOICE

A representation theorem for minmax regret policies

Voting Criteria April

On the Axiomatization of Qualitative Decision Criteria. Faculty of Industrial Engineering and Mgmt.

Bargaining and Cooperation in Strategic Form Games

MATH4999 Capstone Projects in Mathematics and Economics Topic 3 Voting methods and social choice theory

While viewing this PBS Documentary video answer the following questions. 3. Is voting a Right or a Privilege? (Circle the answer)

TI /1 Tinbergen Institute Discussion Paper A Discussion of Maximin

Goods, Games, and Institutions : A Reply

1 Grim Trigger Practice 2. 2 Issue Linkage 3. 3 Institutions as Interaction Accelerators 5. 4 Perverse Incentives 6.

Title: Adverserial Search AIMA: Chapter 5 (Sections 5.1, 5.2 and 5.3)

1 Electoral Competition under Certainty

Lecture 12: Topics in Voting Theory

Indecision Theory: Explaining Selective Abstention in Multiple Elections

Notes for Session 7 Basic Voting Theory and Arrow s Theorem

Pitfalls of a Minimax Approach to Model Uncertainty

Mechanism Design with Public Goods: Committee Karate, Cooperative Games, and the Control of Social Decisions through Subcommittees

Property Rights and the Rule of Law

Algorithms, Games, and Networks February 7, Lecture 8

Game-Theoretic Remarks on Gibbard's Libertarian Social Choice Functions

Honors General Exam Part 1: Microeconomics (33 points) Harvard University

Do not turn over until you are told to do so by the Invigilator.

Prof. Bryan Caplan Econ 854

In Elections, Irrelevant Alternatives Provide Relevant Data

Learning and Belief Based Trade 1

Planning versus Free Choice in Scientific Research

Democratic Rules in Context

Notes for an inaugeral lecture on May 23, 2002, in the Social Sciences division of the University of Chicago, by Roger Myerson.

Unit 03. Ngo Quy Nham Foreign Trade University

Electing the President. Chapter 12 Mathematical Modeling

Computational Social Choice: Spring 2017

the social dilemma?» Emmanuel SOL, Sylvie THORON, Marc WILLINGER

Coalitional Game Theory

On Preferences for Fairness in Non-Cooperative Game Theory

Ellsberg s Paradox and the Value of Chances

How much benevolence is benevolent enough?

Edited by Paul Edwards

Voting rules: (Dixit and Skeath, ch 14) Recall parkland provision decision:

Chapter 9: Social Choice: The Impossible Dream Lesson Plan

Voting and preference aggregation

Voting and preference aggregation

Ellsberg s Paradox and the Value of Chances

ABSTRACT. HATUNOGLU, ERDOGAN EMRAH. A Game Theory Approach to Agricultural Support Policies. (Under the direction of Umut Dur.)

Game Theory and the Law: The Legal-Rules-Acceptability Theorem (A rationale for non-compliance with legal rules)

Figure 1. Payoff Matrix of Typical Prisoner s Dilemma This matrix represents the choices presented to the prisoners and the outcomes that come as the

Phil 115, June 13, 2007 The argument from the original position: set-up and intuitive presentation and the two principles over average utility

Approval Voting Theory with Multiple Levels of Approval

Simple methods for single winner elections

Chapter 10. The Manipulability of Voting Systems. For All Practical Purposes: Effective Teaching. Chapter Briefing

Law & Economics Lecture 1: Basic Notions & Concepts

The Arrow Impossibility Theorem: Where Do We Go From Here?

MATH 1340 Mathematics & Politics

The Origins of the Modern State

The Internet and the Tragedy of the Commons

CS 4407 Algorithms Greedy Algorithms and Minimum Spanning Trees

CSC304 Lecture 16. Voting 3: Axiomatic, Statistical, and Utilitarian Approaches to Voting. CSC304 - Nisarg Shah 1

Game Theory and Climate Change. David Mond Mathematics Institute University of Warwick

Lecture 11 Sociology 621 February 22, 2017 RATIONALITY, SOLIDARITY AND CLASS STRUGGLE

Applications of Mathematics

Arrow s Impossibility Theorem on Social Choice Systems

Some Fundamental Problems of Opinion Modeling with Implications to Committee Composition and Social Choice

Topics on the Border of Economics and Computation December 18, Lecture 8

Introduction to Game Theory. Lirong Xia

Economic philosophy of Amartya Sen Social choice as public reasoning and the capability approach. Reiko Gotoh

VOTING SYSTEMS AND ARROW S THEOREM

Game Theory for Political Scientists. James D. Morrow

Game Theory -RE-S-O-N-A-N-C-E--I-J-U-IY ~~ ~- 1. Nash Equilibrium. P GBabu

Approaches to Voting Systems

econstor Make Your Publications Visible.

Lecture 1 Microeconomics

Lecture 7 A Special Class of TU games: Voting Games

Rationality & Social Choice. Dougherty, POLS 8000

Introduction to Computational Social Choice. Yann Chevaleyre. LAMSADE, Université Paris-Dauphine

An Optimal Single-Winner Preferential Voting System Based on Game Theory

"Efficient and Durable Decision Rules with Incomplete Information", by Bengt Holmström and Roger B. Myerson

Econ 522 Review 3: Tort Law, Criminal Law, and the Legal Process

Example 8.2 The Economics of Terrorism: Externalities and Strategic Interaction

Non-Probabilistic Decision Strategies behind the Veil

I assume familiarity with multivariate calculus and intermediate microeconomics.

INTERNATIONAL ECONOMICS, FINANCE AND TRADE Vol. II - Strategic Interaction, Trade Policy, and National Welfare - Bharati Basu

Rationality of Voting and Voting Systems: Lecture II

Experimental Economics, Environment and Energy Lecture 3: Commons and public goods: tragedies and solutions. Paolo Crosetto

Voter Participation with Collusive Parties. David K. Levine and Andrea Mattozzi

Transcription:

Introduction to Computational Game Theory CMPT 882 Simon Fraser University Oliver Schulte Decision Making Under Uncertainty

Outline Choice Under Uncertainty: Formal Model Choice Principles o Expected Utility o Dominance o Maximin o Regret

Choice and Uncertainty Most of decision theory is concerned with choice under uncertainty. We begin with a model of uncertainty. An agent faces a choice with uncertainty if there are several different ways the world might turn out to be that are relevant to his payoffs. Formally, we represent an agent s uncertainty with a set S of possible states of the world. That s it! If the agent knew which state of the world was the actual one, he would know what outcome would result from a given act. Thus we will think of acts as functions that assign an outcome to each possible state of the world.

The Formal Model of Decision Under Uncertainty We start with three sets. 1. A set A of acts that the agent is choosing among. 2. A set O of possible outcomes. 3. A set S of possible states of the world. An act a in A is a function from states of the world to outcomes. So for each state s in S, an act a assigns an outcome a(s) in O. States of the World Acts s 1 s 2... s k a 1 o 11 o 12... o 1k a 2 o 21 o 22... o 2k............... a n o n1 o n2... o nk

Examples States of the World Acts get caught not get caught Don t pay pay fine pay nothing Skytrain fare Pay fare pay fare pay fare A : pay fare, not pay fare O : pay fine, pay fare, pay nothing S : get caught, not get caught States of the World Acts Accused is innocent Accused is guilty Secret trial Likely wrong conviction Likely true conviction Public trial Less likely wrong conviction Less likely true conviction A : secret trial, public trial O : likely true conviction, likely false conviction, less likely wrong conviction, less likely true conviction S : the accused is innocent/guilty

Decision-making under Risk Suppose we know that Skytrain checks fares on 1/10 rides. Mathematically, we can represent this as a distribution that assigns a probability to each state of the world. 1/10 9/10 States of the World Acts get caught not get caught Don t pay pay fine pay nothing Skytrain fare Pay fare pay fare pay fare The fact that the probabilities of the state of the world are known makes this a decision problem under risk, as opposed to uncertainty. (This terminology is traditional in decision theory but not much used in computer science.) How does knowing the probabilities affect your decision?

Decision-making under Risk: Expected Utility Suppose that an agent can measure the value of each outcome. In the Skytrain example, the value may well be determined by the price of the fare and the fine. 1/10 9/10 States of the World Acts get caught not get caught Don t pay -$50 0 Skytrain fare Pay fare -$3.50 -$3.50 Now we can assign to each option the average dollar cost as its expected value. EU(don t pay) = -1/10 x 50-9/10 x 0 EU(pay) = -3.50 Note the change in units: $3.50 is a dollar amount, whereas EU(pay) is an expected utility amount (sometimes called utiles ). In this example we have assumed that dollar amount = utility. It actually suffices to assume only that utility is a positive linear function of money (e.g. utility = double dollar amount). You will often read in economics papers we assume that money and utility are linearly related.

Notes on Expected Utility Expected utility is a score function for options: x! y if and only if EU(x)! EU(y). Thus given a probability distribution over states of the world, expected utility extends a utility function on outcomes to a utility function over options. It mean seem very natural to make decisions in this way. The famous von Neuman-Morgenstern representation theorem shows that an agent will choose as if they were maximizing an expected utility function if and only if the weak preference relation satisfies a set of axioms for choice under risk. The average payoff does not reflect the variance of the payoff. Given two options with the same expected monetary amount, most people prefer the one with less risk. A simple dramatic demonstration of this phenomenon is the Ellsberg paradox. Perhaps a rational choice theory should take variance into account as well.

Subjective Expected Utility Von Neumann-Morgenstern expected utility theory assumes that the probabilities of outcomes are known. What if they are not? One answer is that in this case the agent should assign probabilities according to their best guess, and then maximize expected utility. The famous representation theorem due to Leonard Savage (1954) is a justification of this approach. It states a set of axioms like the ones we saw for weak preference such that an agent satisfies the axioms if and only if the agent acts if they are maximizing a personal probability assessment. Since the probabilities in question are not given objectively but constitute an agent s case, this approach is called person or subjective expected utility.

Choice Under Uncertainty: Strict Dominance Consider an agent facing a decision under uncertainty with actions A, outcomes O and states of the world S. We assume that the agent has a rational preference relation ο over the outcomes O. Definition An act a strictly dominates another act a iff for all states of the world s S, it is the case that a(s) > a (s). To put it in a slogan, with a strictly dominant act you can only win. Example Consider the Prisoner s Dilemma from the point of view of the row player only. States of the World Acts Cooperate Defect Cooperate 2 0 Defect 3 1 No matter what the column player does, Row prefers Defect.

Choice Under Uncertainty: Weak Dominance 1. Consider an agent facing a decision under uncertainty with actions A, outcomes O and states of the world S. 2. We assume that the agent has a rational preference relation ο over the outcomes O. Definition An act a weakly dominates another act a iff 1. for each state of the world s S, it is the case that a(s) ο a (s) and 2. for some state of the world s S, it is the case that a(s) ν a (s). To put it in a slogan, with a weakly dominant act you can t lose and you might win.

Examples Somebody sends you some books to check out, with no obligation to buy. States of the World Acts like book don t like book check out book return book got good book no book, no money spent no book, no money spent no book, no money spent Checking out the book weakly dominates returning it immediately.

Exercise 1. Jennifer is not satisfied with her grade on an assignment. She considers two options, accepting the grade, and taking up the instructor s office hour with a discussion of her grade. Model Jennifer s situation with a decision matrix with two acts and two states of the world. Assume that Jennifer prefers a better grade, and does not mind spending an hour discussing with her instructor. Show that going to discuss the grade weakly dominates not discussing it.

The Maximin Principle The maximin principle directs us to choose the act whose worst-case performance is the best. Examples States of the World Options Heads Tails Bet 1 100 60 Bet 2 70 100 Maximin says to choose Bet 2. States of the World Options take bird in the hand chase two birds in the bush catch two birds in the don t catch two birds in bush the bush 1 1 2 0 maximin says to take the bird in the hand In CS terms, maximin is a worst-case criterion.

Minimax Regret The regret criterion can be intuitively looked at as a hindsight criterion: if you make a choice and then find out the true state of the world, how much better could you have done? Formally, suppose we have a utility function over the outcomes O. Recall that a(s) is the outcome of act a in state s. The regret for option a in state s is max a u(a (s)) u(a(s)). The max regret is max s max a u(a (s)) u(a(s)). The minimax regret criterion recommends keeping the worst-case regret as small as possible. argmin a max s max a u(a (s)) u(a(s)). Example States of the World Options Heads Tails Bet 1 100 60 Bet 2 70 100 Max regret of Bet 1 = 40 Max regret of Bet 2 = 30

Exercise Blue White Green X 0 50 100 Y 100 0 50 Z 50 100 0 1. Compute the max regrets of Options X,Y,Z. 2. What happens if we eliminate Option Z? 3. What happens if we eliminate Option X?

Review and Closing Comments The formal model of uncertainty uses states of the world to represent the fact that acts may have several nondeterministic consequences. If the probabilities of the different states are known, the most common choice criterion is expected utility. However: o Expected utility is insensitive to the variance of the payoff distribution. o In many experiments, behavioral decision theorists have shown that people systematically violate expected utility theory. Time permitting, we can reproduce some of these experiments in class. I guarantee some of you will fall into irrational behaviour---even Savage did! If the probabilities of outcomes are unknown, most economists and some statisticians think that an agent should choose according to their subjective probabilities. Computer scientists and engineers often prefer to look at objective guarantees like maximin and regret which give worst-case bounds on performance. Dominance also does not assume any probabilities and is uncontroversial. However, in most single-agent problems there is no dominated act. For reasons we can discuss later, dominance plays a more useful role in game theory. One example is the Prisoner s Dilemma.