Fictitious Play in Extensive-Form Games of Imperfect Information
Fictitious Play in Extensive-Form Games of Imperfect Information
We study the long-term behavior of the fictitious play process in repeated extensive-form games of imperfect information with perfect recall. Each player maintains incorrect beliefs that the moves at all information sets, except the one at which the player is about to make a move, are made according to fixed random strategies, independently across all information sets. Accordingly, each player makes his moves at any of his information sets to maximize his expected payoff assuming that, at any other information set, the moves are made according to the empirical frequencies of the past moves. We extend the well-known Monderer-Shapley result [1] on the convergence of the empirical frequencies to the set of Nash equilibria to a certain class of extensive-form games with identical interests. We then strengthen this result by the use of inertia and fading memory, and prove the convergence of the realized play-paths to an essentially pure Nash equilibrium in all extensive-form games of imperfect information with identical interests.
Jason Castiglione、Gürdal Arslan
计算技术、计算机技术
Jason Castiglione,Gürdal Arslan.Fictitious Play in Extensive-Form Games of Imperfect Information[EB/OL].(2025-04-25)[2025-05-21].https://arxiv.org/abs/2504.18075.点此复制
评论