|国家预印本平台
首页|Higher-Order Uncoupled Learning Dynamics and Nash Equilibrium

Higher-Order Uncoupled Learning Dynamics and Nash Equilibrium

Higher-Order Uncoupled Learning Dynamics and Nash Equilibrium

来源:Arxiv_logoArxiv
英文摘要

We study learnability of mixed-strategy Nash Equilibrium (NE) in general finite games using higher-order replicator dynamics as well as classes of higher-order uncoupled heterogeneous dynamics. In higher-order uncoupled learning dynamics, players have no access to utilities of opponents (uncoupled) but are allowed to use auxiliary states to further process information (higher-order). We establish a link between uncoupled learning and feedback stabilization with decentralized control. Using this association, we show that for any finite game with an isolated completely mixed-strategy NE, there exist higher-order uncoupled learning dynamics that lead (locally) to that NE. We further establish the lack of universality of learning dynamics by linking learning to the control theoretic concept of simultaneous stabilization. We construct two games such that any higher-order dynamics that learn the completely mixed-strategy NE of one of these games can never learn the completely mixed-strategy NE of the other. Next, motivated by imposing natural restrictions on allowable learning dynamics, we introduce the Asymptotic Best Response (ABR) property. Dynamics with the ABR property asymptotically learn a best response in environments that are asymptotically stationary. We show that the ABR property relates to an internal stability condition on higher-order learning dynamics. We provide conditions under which NE are compatible with the ABR property. Finally, we address learnability of mixed-strategy NE in the bandit setting using a bandit version of higher-order replicator dynamics.

Sarah A. Toonsi、Jeff S. Shamma

数学

Sarah A. Toonsi,Jeff S. Shamma.Higher-Order Uncoupled Learning Dynamics and Nash Equilibrium[EB/OL].(2025-06-12)[2025-07-02].https://arxiv.org/abs/2506.10874.点此复制

评论