读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈

时间 2020-07-21

原文原文链接

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈

贝叶斯博弈(Bayesian Games)

本文是Game Theory An Introduction (by Steven Tadelis) 的学习笔记。html

不完整信息的静态博弈(Incomplete information static games)

不完整信息博弈意味着玩家之间缺少共识(common knowledge)，具体指的是其它对手的行动集、结果集和收益函数等信息。对不完整信息博弈的处理方法来自于Harsanyi。他引进了两个概念来解决这个问题。 type space: 将对手隐藏的信息(行动集、结果集和收益函数等)转化为多个types，每一个type中的信息都是可知的。 belief: 因为不知道对手的具体type是什么，所以使用分布几率表示对手选择某个type的可能性。这样就能够经过几率统计来计算可能的收益。函数

静态不完整信息贝叶斯博弈(static Bayesian game of incomplete information)的normal-form描述 $$ \left \langle N, { A_i }{i=1}^n, { \Theta_i }{i=1}^n, { v_i(\cdot; \theta_i), \theta_i \in \Theta_i }{i=1}^n, { \phi_i }{i=1}^n \right \rangle \ where \ N = { 1,2,\cdots, n} \text{ : is the set of players} \ A_i \text{ : the action set of player i} \ \Theta_i \text{ : the type space of player i} \ v_i : A \times \Theta_i \to \mathbb{R} \text{ : type dependent pay of function of player i} \ \phi \text{ : the belief of player i with respect to the uncertainty over the other players' types} \ \phi(\theta_{-i} | \theta_i) \text{ : the posterior conditional distribution on } \theta_{-i} $$post
静态不完整信息贝叶斯博弈处理流程：学习
1. 天然选择一个类型组合(profile of types)$\theta_1, \theta_2, \cdots, \theta_n$。
2. 每一个玩家知道本身$\theta_i$，使用先前的$\phi_i$来造成对对手type的分布几率。
3. 玩家选择行动。
4. 根据玩家们的行动$a = (a_i, a_2, \cdots, a_n)$，能够或者收益$v_i(a; \theta)$.
条件几率(conditional probability) 当事件S发生时，事件H发生的条件几率为： $$ \Pr{H|S} = \frac{\phi(S \land H)}{\phi(S)} $$ui
静态不完整信息贝叶斯博弈 - 纯策略 $$ \left \langle N, { A_i }{i=1}^n, { \Theta_i }{i=1}^n, { v_i(\cdot; \theta_i), \theta_i \in \Theta_i }{i=1}^n, { \phi_i }{i=1}^n \right \rangle \ $$ 玩家i的一个纯策略$s_i(\theta_i) \to a_i$spa
静态不完整信息贝叶斯博弈 - 混合策略玩家i的一个混合策略是一个在纯策略之上的几率分布。orm
静态不完整信息贝叶斯博弈 - 纯策略贝叶斯纳什均衡(pure-strategy Bayesian Nash equilibrium) 一个纯策略贝叶斯纳什均衡$s^* = (s_1^, \cdots, s_n^)$，若是对于每一个玩家i，每一个玩家的类型$\theta_i \in \Theta_i$，每一个行动$a_i \in A_i$，知足： $$ \sum_{\theta_{-i} \in \Theta_{-i}} \phi_i(\theta_{-i}|\theta_i) v_i(s_i^(\theta_i), s_{-i}^(\theta_{-i});\theta_i) \geq \sum_{\theta_{-i} \in \Theta_{-i}} \phi_i(\theta_{-i}|\theta_i) v_i(a_i, s_{-i}^(\theta_{-i});\theta_i) \ where \ v_i(a_i, s_{-i}^(\theta_{-i});\theta_i) \text{ : only on type } \theta_i \text{, the player i's payoff function} $$ 其含义：对于每一个玩家，其行动$s_i^*(\theta_i)$的分布几率收益总和老是最大的。htm

关于这章（甚至整本书），重要的是学会如何使用这些理论，书中提供了很好的示例。但这里就不介绍了。blog

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈 贝叶斯博弈

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈 贝叶斯博弈

贝叶斯博弈(Bayesian Games)

不完整信息的静态博弈(Incomplete information static games)

参照

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈