A Short Tutorial on Game Theory.ppt
《A Short Tutorial on Game Theory.ppt》由会员分享,可在线阅读,更多相关《A Short Tutorial on Game Theory.ppt(49页珍藏版)》请在麦多课文档分享上搜索。
1、A Short Tutorial on Game Theory,EE228a, Fall 2002 Dept. of EECS, U.C. Berkeley,EE228a, Fall 2002,2,Outline,Introduction Complete-Information Strategic Games Static Games Repeated Games Stackelberg Games Cooperative Games Bargaining Problem Coalitions,EE228a, Fall 2002,3,Outline,Introduction What is
2、game theory about? Relevance to networking research Elements of a game Non-Cooperative Games Static Complete-Information Games Repeated Complete-Information Games Stackelberg Games Cooperative Games Nashs Bargaining Solution Coalition: the Shapley Value,EE228a, Fall 2002,Introduction,4,What Is Game
3、Theory About?,To understand how decision-makers interact A brief history 1920s: study on strict competitions 1944: Von Neumann and Morgensterns book Theory of Games and Economic Behavior After 1950s: widely used in economics, politics, biology Competition between firms Auction design Role of punishm
4、ent in law enforcement International policies Evolution of species,EE228a, Fall 2002,Introduction,5,Relevance to Networking Research,Economic issues becomes increasingly important Interactions between human users congestion control resource allocation Independent service providers Bandwidth trading
5、Peering agreements Tool for system design Distributed algorithms Multi-objective optimization Incentive compatible protocols,EE228a, Fall 2002,Introduction,6,Elements of a Game: Strategies,Decision-makers choice(s) in any given situation Fully known to the decision-maker Examples Price set by a firm
6、 Bids in an auction Routing decision by a routing algorithm Strategy space: set of all possible actions Finite vs infinite strategy space Pure vs mixed strategies Pure: deterministic actions Mixed: randomized actions,EE228a, Fall 2002,Introduction,7,Elements of a Game: Preference and Payoff,Preferen
7、ce Transitive ordering among strategiesif a b, b c, then a c Payoff An order-preserving mapping from preference to R+ Example: in flow control, U(x)=log(1+x) px,EE228a, Fall 2002,Introduction,8,Rational Choice,Two axiomatic assumptions on games In any given situation a decision-maker always chooses
8、the action which is the best according to his/her preferences (a.k.a. rational play). Rational play is common knowledge among all players in the game.,EE228a, Fall 2002,Introduction,9,Example: Prisoners Dilemma,EE228a, Fall 2002,Introduction,10,Different Types of Games,Static vs multi-stage Static:
9、game is played only once Prisoners dilemma Multi-stage: game is played in multiple rounds Multi-round auctions, chess games Complete vs incomplete information Complete info.: players know each others payoffs Prisoners dilemma Incomplete info.: other players payoffs are not known Sealed auctions,EE22
10、8a, Fall 2002,Introduction,11,Representations of a Game,Normal- vs extensive-form representation Normal-form like the one used in previous example Extensive-form,EE228a, Fall 2002,12,Outline,Introduction Complete-Information Strategic Games Static Games Repeated Games Stackelberg Games Cooperative G
11、ames Nashs Bargaining Problem Coalitions: the Shapley Value,EE228a, Fall 2002,13,Static Games,Model Players know each others payoffs But do not know which strategies they would choose Players simultaneously choose their strategies Game is over and players receive payoffs based on the combination of
12、strategies just chosen Question of Interest: What outcome would be produced by such a game?,EE228a, Fall 2002,14,Example: Cournots Model of Duopoly,Model (from Gibbons) Two firms producing the same kind of product in quantities of q1 and q2, respectively Market clearing price p=A q1 q2 Cost of produ
13、ction is C for both firms Profit for firm iJi = (A q1 q2) qi C qi= (A C q1 q2) qidefine B A C Objective: choose qi to maximize profitqi*= argmaxqi (B q1 q2) qi,EE228a, Fall 2002,15,A Simple Example: Solution,Firm is best choice, given its competitors q,EE228a, Fall 2002,16,Solution to Static Games,N
14、ash Equilibrium (J. F. Nash, 1950) Mathematically, a strategy profile (s1* , , si*, sn* ) is a Nash Equilibrium if for each player iUi(s1* , , s*i-1, si*, s*i+1, sn* ) Ui(s1* , , s*i-1, si, s*i+1,sn* ), for each feasible strategy si Plain English: a situation in which no player has incentive to devi
15、ate Its fixed-point solution to the following system of equations si=argmaxs Ui(s1, , si-1, s, si+1,sn ), i Other solution concepts (see references),EE228a, Fall 2002,17,An Example on Mixed Strategies,Pure-Strategy Nash Equilibrium may not exist,Player A,Player B,Head (H),Tail (T),H,T,1, 1,1, 1,1, 1
16、,1, 1,Cause: each player tries to outguess his opponent!,EE228a, Fall 2002,18,Example: Best Reply,Mixed Strategies Randomized actions to avoid being outguessed Players strategies and expected payoffs Players plays H w.p. p and play T w.p. 1 p Expected payoff of Player Apa pb + (1 pa) (1 pb) pa (1 pb
17、) pb (1 pa) = (1 2 pb) + pa (4pb 2) So if pb 1/2, pa*=1 (i.e. play H); if pb 1/2, pa*=0 (i.e. play T); if pb=1/2, then playing either H or T is equally good,EE228a, Fall 2002,19,Example: Nash Equilibrium,pb,pa,0,1,1,EE228a, Fall 2002,20,Existence of Nash Equilibrium,Finite strategy space (J. F. Nash
18、, 1950) A n-player game has at least one Nash equilibrium, possibly involving mixed strategy. Infinite strategy space (R.B. Rosen, 1965) A pure-strategy Nash Equilibrium exists in a n-player concave game. If the payoff functions satisfy diagonally strict concavity condition, then the equilibrium is
19、unique.(s1 s2) rjJj(s1) + (s2 s1) rjJj(s2) 0,EE228a, Fall 2002,21,Distributed Computation of Nash Equilibrium,Nash equilibrium as result of “learning” Players iteratively adjust their strategies based on locally available information Equilibrium is reached if there is a steady state Two commonly use
20、d schemes,Gauss-Siedel,Jacobian,EE228a, Fall 2002,22,Convergence of Distributed Algorithms,Algorithms may not converge for some cases,EE228a, Fall 2002,23,Suggested Readings,J.F. Nash. “Equilibrium Points in N-Person Games.” Proc. of National Academy of Sciences, vol. 36, 1950. A “must-read” classic
21、 paper R.B. Rosen. “Existence and Uniqueness of Equilibrium Points for Concave N-Person Games.” Econometrica, vol. 33, 1965. Has many useful techniques A. Orda et al. “Competitive Routing in Multi-User Communication Networks.” IEEE/ACM Transactions on Networking, vol. 1, 1993. Applies game theory to
22、 routing And many more,EE228a, Fall 2002,24,Multi-Stage Games,General model Game is played in multiple rounds Finite or infinitely many times Different games could be played in different rounds Different set of actions or even players Different solution concepts from those in static games Analogy: o
23、ptimization vs dynamic programming Two special classes Infinitely repeated games Stackelberg games,EE228a, Fall 2002,25,Infinitely Repeated Games,Model A single-stage game is repeated infinitely many times Accumulated payoff for a player,J=t1+dt2+d n-1tn+=Si d i-1ti,Main theme: play socially more ef
24、ficient moves Everyone promises to play a socially efficient move in each stage Punishment is used to deter “cheating” Example: justice system,EE228a, Fall 2002,26,Cournots Game Revisited. I,Cournots Model At equilibrium each firm produces B/3, making a profit of B2/9 Not an “ideal” arrangement for
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ASHORTTUTORIALONGAMETHEORYPPT
