谷歌浏览器插件
订阅小程序
在清言上使用

Decomposition Techniques for Markov Zero-Sum Games with Nested Information.

IEEE Conference on Decision and Control(2013)

引用 4|浏览14
暂无评分
摘要
Markov zero-sum games arise in applications such as network interdiction, where an informed defender protects a network against attacks. This problem has received significant attention in recent years due to its relevance to military problems and network security. In this paper, we focus on finite games where the attacker knows imperfectly the network state, and formulate this as a Markov game with nested information. By exploiting the nested information structure, we decompose the multistage game into a sequence of one-stage subgames and develop an algorithm that computes the value of the game and the saddle point strategies for the game. This decomposition method computes the value of the game using backward induction as in stochastic dynamic programming, then identifies saddle-point strategies that achieve this value. Using the Markov structure of the game, we show that the value of the game can be computed efficiently in terms of a single value function of an information state at each stage. The resulting single stage optimization problems are much smaller than the original multistage game. We illustrate our results with an example of multistage network interdiction where the attacker may not be able to observe outcomes of the attacks.
更多
查看译文
关键词
Markov processes,dynamic programming,game theory,stochastic programming,Markov structure,Markov zero-sum games,backward induction,decomposition method,decomposition techniques,finite games,game value,information state,military problems,multistage game,multistage network interdiction,nested information structure,network attacks,network security,network state,one-stage subgames,saddle point strategies,single stage optimization problems,single value function,stochastic dynamic programming
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要