Introduction to Game Theory. Lirong Xia

Similar documents
Computational Social Processes. Lirong Xia

Supporting Information Political Quid Pro Quo Agreements: An Experimental Study

Voting. Suppose that the outcome is determined by the mean of all voter s positions.

Strategic voting. with thanks to:

Game Theory for Political Scientists. James D. Morrow

Introduction to Game Theory

David R. M. Thompson, Omer Lev, Kevin Leyton-Brown & Jeffrey S. Rosenschein COMSOC 2012 Kraków, Poland

Dynamic Games Lesson 4: Economic Aplica4ons. Universidad Carlos III

Introduction to Game Theory

GAME THEORY. Analysis of Conflict ROGER B. MYERSON. HARVARD UNIVERSITY PRESS Cambridge, Massachusetts London, England

Computational social choice Combinatorial voting. Lirong Xia

International Cooperation, Parties and. Ideology - Very preliminary and incomplete

1 Strategic Form Games

CSC304 Lecture 16. Voting 3: Axiomatic, Statistical, and Utilitarian Approaches to Voting. CSC304 - Nisarg Shah 1

Introduction to Political Economy Problem Set 3

Social Rankings in Human-Computer Committees

Preferential votes and minority representation in open list proportional representation systems

Sequential Voting with Externalities: Herding in Social Networks

Politics is the subset of human behavior that involves the use of power or influence.

Stackelberg Voting Games

I assume familiarity with multivariate calculus and intermediate microeconomics.

Bargaining and Cooperation in Strategic Form Games

Notes for Session 7 Basic Voting Theory and Arrow s Theorem

Voting rules: (Dixit and Skeath, ch 14) Recall parkland provision decision:

MIDTERM EXAM 1: Political Economy Winter 2017

Exercise Set #6. Venus DL.2.8 CC.5.1

Game Theory II: Maximin, Equilibrium, and Refinements

Coalitional Game Theory

The mathematics of voting, power, and sharing Part 1

Self-Organization and Cooperation in Social Systems

Goods, Games, and Institutions : A Reply

The Origins of the Modern State

MATH4999 Capstone Projects in Mathematics and Economics Topic 3 Voting methods and social choice theory

A Study of Approval voting on Large Poisson Games

Enriqueta Aragones Harvard University and Universitat Pompeu Fabra Andrew Postlewaite University of Pennsylvania. March 9, 2000

Mathematics and Social Choice Theory. Topic 4 Voting methods with more than 2 alternatives. 4.1 Social choice procedures

Chapter 9: Social Choice: The Impossible Dream

Some Game-Theoretic Aspects of Voting

arxiv: v1 [cs.gt] 11 Jul 2018

Classical papers: Osborbe and Slivinski (1996) and Besley and Coate (1997)

Voting Systems. High School Circle I. June 4, 2017

An example of public goods

Math for Liberal Studies

1 Grim Trigger Practice 2. 2 Issue Linkage 3. 3 Institutions as Interaction Accelerators 5. 4 Perverse Incentives 6.

1 Electoral Competition under Certainty

Sincere Versus Sophisticated Voting When Legislators Vote Sequentially

Election Theory. How voters and parties behave strategically in democratic systems. Mark Crowley

Property Rights and the Rule of Law

Strategy in Law and Business Problem Set 1 February 14, Find the Nash equilibria for the following Games:

INTERNATIONAL ECONOMICS, FINANCE AND TRADE Vol. II - Strategic Interaction, Trade Policy, and National Welfare - Bharati Basu

HOTELLING-DOWNS MODEL OF ELECTORAL COMPETITION AND THE OPTION TO QUIT

Topics on the Border of Economics and Computation December 18, Lecture 8

Discussion Paper No FUNDAMENTALS OF SOCIAL CHOICE THEORY by Roger B. Myerson * September 1996

Social welfare functions

Sequential vs. Simultaneous Voting: Experimental Evidence

The Manipulability of Voting Systems. Check off these skills when you feel that you have mastered them.

Computational Social Choice: Spring 2017

1.6 Arrow s Impossibility Theorem

Chapter 9: Social Choice: The Impossible Dream Lesson Plan

Social Choice & Mechanism Design

Manipulating Two Stage Voting Rules

c M. J. Wooldridge, used by permission/updated by Simon Parsons, Spring

Game Theory and the Law: The Legal-Rules-Acceptability Theorem (A rationale for non-compliance with legal rules)

Corruption in Committees: An Experimental Study of Information Aggregation through Voting 1

Strategic Models of Politics

In deciding upon a winner, there is always one main goal: to reflect the preferences of the people in the most fair way possible.

14.770: Introduction to Political Economy Lecture 11: Economic Policy under Representative Democracy

Sincere versus sophisticated voting when legislators vote sequentially

What is Computational Social Choice?

Computational Social Choice: Spring 2007

Refinements of Nash equilibria. Jorge M. Streb. Universidade de Brasilia 7 June 2016

Introduction to Computational Game Theory CMPT 882. Simon Fraser University. Oliver Schulte. Decision Making Under Uncertainty

Candidate Citizen Models

Strategic voting in a social context: considerate equilibria

PROBLEM SET #2: VOTING RULES

A Higher Calling: Career Concerns and the Number of Political Parties

Introduction to Combinatorial Voting

Social Choice Theory. Denis Bouyssou CNRS LAMSADE

MIDTERM EXAM: Political Economy Winter 2013

University of Toronto Department of Economics. Party formation in single-issue politics [revised]

Social Choice: The Impossible Dream. Check off these skills when you feel that you have mastered them.

CSC304 Lecture 14. Begin Computational Social Choice: Voting 1: Introduction, Axioms, Rules. CSC304 - Nisarg Shah 1

Published in Canadian Journal of Economics 27 (1995), Copyright c 1995 by Canadian Economics Association

Homework 7 Answers PS 30 November 2013

On Preferences for Fairness in Non-Cooperative Game Theory

Elections with Only 2 Alternatives

PS 124A Midterm, Fall 2013

Strategic Voting and Strategic Candidacy

Committee proposals and restrictive rules

Section Voting Methods. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Uninformed search. Lirong Xia

Evaluating and Comparing Voting Rules behind the Veil of Ignorance: a Brief and Selective Survey and an Analysis of Two-Parameter Scoring Rules

Practice TEST: Chapter 14

Influence in Social Networks

Manipulating Two Stage Voting Rules

Constraint satisfaction problems. Lirong Xia

Introduction to Computational Social Choice. Yann Chevaleyre. LAMSADE, Université Paris-Dauphine

Electoral Uncertainty and the Stability of Coalition Governments

Wage Rigidity and Spatial Misallocation: Evidence from Italy and Germany

Strategic party formation on a circle and Duverger s Law

Transcription:

Introduction to Game Theory Lirong Xia Fall, 2016

Homework 1 2

Announcements ØWe will use LMS for submission and grading ØPlease just submit one copy ØPlease acknowledge your team mates 3

Ø Show the math and formal proof No math/steps, no points (esp. in midterm) Especially Problem 1, 4, 5 Ø Problem 1 Must use u(1m) etc. Must hold for all utility function Ø Problem 2 must show your calculation For Schulze, if you have already found one strict winner, no need to check other alternatives Kemeny outputs a single winner, unless otherwise mentioned Ø Problem 3.2 b winning itself is not a paradox Remarks people can change the outcome by not voting is not a paradox 4

Last class Ø Mallows model Ø MLE and MAP Ø P = {a>b>c, 2@c>b>a} Ø Likelihood Ø Prior distribution Pr(a>b>c)=Pr(a>c>b)=0.3 all other linear orders have prior 0.1 Ø Posterior distribution proportional to Likelihood*prior 5

Last class Ø Plackett-Luce model Example alternatives {a,b,c} parameter space {(4,3,3), (3,4,3), (3,3,4)} Ø MLE and MAP Ø P = {a>b>c, 2@c>b>a} Ø Likelihood Ø Prior distribution Pr(4,3,3)=0.8 all others have prior 0.1 Ø Posterior distribution proportional to Likelihood*prior 6

Review: manipulation (ties are broken alphabetically) > > YOU > > Plurality rule Bob > > Carol > >

What if everyone is incentivized to lie? > > YOU > > Plurality rule Bob Carol > >

Today s schedule: game theory Ø What? Agents may have incentives to lie Ø Why? Hard to predict the outcome when agents lie Ø How? A general framework for games Solution concept: Nash equilibrium Modeling preferences and behavior: utility theory Special games Normal form games: mixed Nash equilibrium Extensive form games: subgame-perfect equilibrium 9

A game R 1 * s 1 Strategy Profile D Mechanism R 2 * s 2 Outcome R n * s n Players: N={1,,n} Strategies (actions): - S j for agent j, s j S j - (s 1,,s n ) is called a strategy profile. Outcomes: O Preferences: total preorders (full rankings with ties) over O often represented by a utility function u i : Π j S j R Mechanism f : Π j S j O 10

A game of plurality elections YOU > > Plurality rule Bob > > Carol > > Players: { YOU, Bob, Carol } Outcomes: O = {,, } Strategies: S j = Rankings(O) Preferences: See above Mechanism: the plurality rule 11

A game of two prisoners Column player Cooperate Defect Row player Cooperate (-1, -1) (-3, 0) Defect ( 0, -3) (-2, -2) Ø Players: Ø Strategies: { Cooperate, Defect } Ø Outcomes: {(-2, -2), (-3, 0), ( 0, -3), (-1, -1)} Ø Preferences: self-interested 0 > -1 > -2 > -3 : ( 0, -3) > (-1, -1) > (-2, -2) > (-3, 0) : (-3, 0) > (-1, -1) > (-2, -2) > ( 0, -3) Ø Mechanism: the table 12

Ø Suppose Solving the game every player wants to make the outcome as preferable (to her) as possible by controlling her own strategy (but not the other players ) Ø What is the outcome? No one knows for sure A stable situation seems reasonable Ø A Nash Equilibrium (NE) is a strategy profile (s 1,,s n ) such that For every player j and every s j ' S j, f (s j, s -j ) j f (s j ', s -j ) or u j (s j, s -j ) u j (s j ', s -j ) s -j = (s 1,,s j-1, s j+1,,s n ) no single player can be better off by deviating 13

Prisoner s dilemma Column player Cooperate Defect Row player Cooperate (-1, -1) (-3, 0) Defect ( 0, -3) (-2, -2) 14

A beautiful mind Ø If everyone competes for the blond, we block each other and no one gets her. So then we all go for her friends. But they give us the cold shoulder, because no one likes to be second choice. Again, no winner. But what if none of us go for the blond. We don t get in each other s way, we don t insult the other girls. That s the only way we win. That s the only way we all get [a girl.] 15

A beautiful mind: the bar game Hansen Column player Blond Another girl Nash Row player Blond ( 0, 0 ) ( 5, 1 ) Another girl ( 1, 5 ) ( 2, 2 ) Ø Players: { Nash, Hansen } Ø Strategies: { Blond, another girl } Ø Outcomes: {(0, 0), (5, 1), (1, 5), (2, 2)} Ø Preferences: self-interested Ø Mechanism: the table 16

Does an NE always exists? Ø Not always Column player L R Row player U ( -1, 1 ) ( 1, -1 ) D ( 1, -1 ) ( -1, 1 ) Ø But an NE exists when every player has a dominant strategy s j is a dominant strategy for player j, if for every s j ' S j, 1. for every s -j, f (s j, s -j ) j f (s j ', s -j ) 2. the preference is strict for some s -j 17

Dominant-strategy NE ØFor player j, strategy s j dominates strategy s j, if 1. for every s -j, u j (s j, s -j ) u j (s j ', s -j ) 2. the preference is strict for some s -j ØRecall that an NE exists when every player has a dominant strategy s j, if s j dominates other strategies of the same agent ØA dominant-strategy NE (DSNE) is an NE where every player takes a dominant strategy may not exists, but if exists, then must be unique 18

Prisoner s dilemma Column player Cooperate Defect Row player Cooperate (-1, -1) (-3, 0) Defect ( 0, -3) (-2, -2) Defect is the dominant strategy for both players 19

The Game of Chicken Ø Two drivers for a single-lane bridge from opposite directions and each can either (S)traight or (A)way. If both choose S, then crash. If one chooses A and the other chooses S, the latter wins. If both choose A, both are survived Column player A S Row player A ( 0, 0 ) ( 0, 1 ) S ( 1, 0 ) ( -10, -10 ) NE 20

Rock Paper Scissors ØActions: {R, P, S} ØTwo-player zero sum game No pure NE Column player R P S Row player R ( 0, 0 ) ( -1, 1 ) ( 1, -1 ) P ( 1, -1 ) ( 0, 0 ) ( 1, -1 ) S ( 1, -1 ) ( 1, -1 ) ( 0, 0 ) 21

Rock Paper Scissors: Lirong vs. young Daughter Ø Actions Lirong: {R, P, S} Daughter: {mini R, mini P} Ø Two-player zero sum game Daughter No pure NE mini R mini P Lirong R ( 0, 0 ) ( -1, 1 ) P ( 1, -1 ) ( 0, 0 ) S ( 1, -1 ) ( 1, -1 ) 22

Computing NE: Iterated Elimination ØEliminate dominated strategies sequentially Column player Row player L M R U ( 1, 0 ) ( 1, 2 ) ( 0, 1 ) D ( 0, 3 ) ( 0, 1 ) ( 2, 0 ) 23

Iterated Elimination: Lirong vs. young Daughter Ø Actions Lirong: {R, P, S} Daughter: {mini R, mini P} Ø Two-player zero sum game Daughter No pure NE mini R mini P R ( 0, 0 ) ( -1, 1 ) Lirong P ( 1, -1 ) ( 0, 0 ) S ( -1, 1 ) ( 1, -1 ) 24

Normal form games Ø Given pure strategies: S j for agent j Normal form games Ø Players: N={1,,n} Ø Strategies: lotteries (distributions) over S j L j Lot(S j ) is called a mixed strategy (L 1,, L n ) is a mixed-strategy profile Ø Outcomes: Π j Lot(S j ) Ø Mechanism: f (L 1,,L n ) = p p(s 1,,s n ) = Π j L j (s j ) Ø Preferences: Soon Row player Column player L R U ( 0, 1 ) ( 1, 0 ) D ( 1, 0 ) ( 0, 1 ) 25

Preferences over lotteries ØOption 1 vs. Option 2 Option 1: $0@50%+$30@50% Option 2: $5 for sure ØOption 3 vs. Option 4 Option 3: $0@50%+$30M@50% Option 4: $5M for sure 26

Lotteries ØThere are m objects. Obj={o 1,,o m } ØLot(Obj): all lotteries (distributions) over Obj ØIn general, an agent s preferences can be modeled by a preorder (ranking with ties) over Lot(Obj) But there are infinitely many outcomes 27

Utility theory Utility function: u: Obj R ØFor any p Lot(Obj) u(p) = Σ o Obj p(o)u(o) Øu represents a total preorder over Lot(Obj) p 1 >p 2 if and only if u(p 1 )>u(p 2 ) 28

Example utility Money Money 0 5 30 5M 30M Utility 1 3 10 100 150 Øu(Option 1) = u(0) 50% + u(30) 50%=5.5 Øu(Option 2) = u(5) 100%=3 Øu(Option 3) = u(0) 50% + u(30m) 50%=75.5 Øu(Option 4) = u(5m) 100%=100 29

Normal form games ØGiven pure strategies: S j for agent j ØPlayers: N={1,,n} ØStrategies: lotteries (distributions) over S j L j Lot(S j ) is called a mixed strategy (L 1,, L n ) is a mixed-strategy profile ØOutcomes: Π j Lot(S j ) ØMechanism: f (L 1,,L n ) = p, such that p(s 1,,s n ) = Π j L j (s j ) ØPreferences: represented by utility functions u 1,,u n 30

Mixed-strategy NE Ø Mixed-strategy Nash Equilibrium is a mixed strategy profile (L 1,, L n ) s.t. for every j and every L j ' Lot(S j ) u j (L j, L -j ) u j (L j ', L -j ) Ø Any normal form game has at least one mixedstrategy NE [Nash 1950] Ø Any L j with L j (s j )=1 for some s j S j is called a pure strategy Ø Pure Nash Equilibrium a special mixed-strategy NE (L 1,, L n ) where all strategies are pure strategy 31

Example: mixed-strategy NE Column player H T Row player H ( -1, 1 ) ( 1, -1 ) T ( 1, -1 ) ( -1, 1 ) Ø(H@0.5+T@0.5, H@0.5+T@0.5) } Row player s strategy } Column player s strategy 32

Best responses Ø For any agent j, given any other agents strategies L -j, the set of best responses is BR(L -j ) = argmax sj u j (s j, L -j ) It is a set of pure strategies Ø A strategy profile L is an NE if and only if for all agent j, L j only takes positive probabilities on BR(L -j ) 33

Computing NEs by guessing best responses Ø Step 1. Guess the best response sets BR j for all players Ø Step 2. Check if there are ways to assign probabilities to BR j to make them actual best responses 34

Example Column player H T Row player H ( -1, 1 ) ( 1, -1 ) T ( 1, -1 ) ( -1, 1 ) Ø Hypothetical BR Row ={H,T}, BR Col ={H,T} Pr Row (H)=p, Pr Col (H)=q Row player: 1-q-q=q-(1-q) Column player: 1-q-q=q-(1-q) p=q=0.5 Ø Hypothetical BR Row ={H,T}, BR Col ={H} Pr Row (H)=p Row player: -1 = 1 Column player: p-(1-p)>=-p+(1-p) No solution 35

Rock Paper Scissors: Lirong vs. young Daughter Daughter mini R mini P R ( 0, 0 ) ( -1, 1 ) Lirong P ( 1, -1 ) ( 0, 0 ) S (-1, 1 ) ( 1, -1 ) Ø Hypothetical BR L ={P,S}, BR D : {mini R, mini P} Pr L (P)=p, Pr D (mini R) = q Lirong: q = (1-q)-q Daughter: -1p+(1-p) = -1(1-p) p=2/3, q=1/3 36

Extensive-form games Nash B A Hansen Hansen B A B A Nash (5,1) (1,5) (2,2) B A (0,0) (-1,5) leaves: utilities (Nash,Hansen) Ø Players move sequentially Ø Outcomes: leaves Ø Preferences are represented by utilities Ø A strategy of player j is a combination of all actions at her nodes Ø All players know the game tree (complete information) Ø At player j s node, she knows all previous moves (perfect information) 37

Convert to normal-form Nash B A Hansen Hansen B A B A Hansen (B,B) (B,A) (A,B) (A,A) (B,B) (0,0) (0,0) (5,1) (5,1) Nash (5,1) (1,5) (2,2) B A (0,0) (-1,5) Nash (B,A) (-1,5) (-1,5) (5,1) (5,1) (A,B) (1,5) (2,2) (1,5) (2,2) (A,A) (1,5) (2,2) (1,5) (2,2) Nash: (Up node action, Down node action) Hansen: (Left node action, Right node action) 38

Subgame perfect equilibrium Nash B A Hansen Hansen B A B A Nash (5,1) (1,5) (2,2) B A (0,0) (-1,5) ØUsually too many NE Ø(pure) SPNE a refinement (special NE) also an NE of any subgame (subtree) 39

Backward induction Nash (5,1) B A Hansen (5,1) Hansen (1,5) B A B A Nash (0,0) (5,1) (1,5) (2,2) B A (0,0) (-1,5) ØDetermine the strategies bottom-up ØUnique if no ties in the process ØAll SPNE can be obtained, if the game is finite complete information perfect information 40

A different angle ØHow good is SPNE as a solution concept? At least one In many cases unique is a refinement of NE (always exists) 41

Wrap up Preferences Solution concept How many Computation General game total preorders NE 0-many Normal form game utilities mixed-strategy NE pure NE mixed: 1-many pure: 0-many Extensive form game utilities Subgame perfect NE 1 (no ties) many (ties) Backward induction 42

The reading questions Ø What is the problem? agents may have incentive to lie Ø Why we want to study this problem? How general it is? The outcome is hard to predict when agents lie It is very general and important Ø How was problem addressed? by modeling the situation as a game and focus on solution concepts, e.g. Nash Equilibrium Ø Appreciate the work: what makes the work nontrivial? It is by far the most sensible solution concept. Existence of (mixed-strategy) NE for normal form games Ø Critical thinking: anything you are not very satisfied with? Hard to justify NE in real-life How to obtain the utility function? 43

Looking forward ØSo far we have been using game theory for prediction ØHow to design the mechanism? when every agent is self-interested as a whole, works as we want ØThe next class: mechanism design 44

NE of the plurality election game YOU > > Plurality rule Bob > > Carol > > Players: { YOU, Bob, Carol}, n=3 Outcomes: O = {,, } Strategies: S j = Rankings(O) Preferences: Rankings(O) Mechanism: the plurality rule 45