Meta-reinforcement learning with minimum attention
Meta-reinforcement learning with minimum attention
Minimum attention applies the least action principle in the changes of control concerning state and time, first proposed by Brockett. The involved regularization is highly relevant in emulating biological control, such as motor learning. We apply minimum attention in reinforcement learning (RL) as part of the rewards and investigate its connection to meta-learning and stabilization. Specifically, model-based meta-learning with minimum attention is explored in high-dimensional nonlinear dynamics. Ensemble-based model learning and gradient-based meta-policy learning are alternately performed. Empirically, we show that the minimum attention does show outperforming competence in comparison to the state-of-the-art algorithms in model-free and model-based RL, i.e., fast adaptation in few shots and variance reduction from the perturbations of the model and environment. Furthermore, the minimum attention demonstrates the improvement in energy efficiency.
Pilhwa Lee、Shashank Gupta
计算技术、计算机技术
Pilhwa Lee,Shashank Gupta.Meta-reinforcement learning with minimum attention[EB/OL].(2025-05-22)[2025-06-09].https://arxiv.org/abs/2505.16741.点此复制
评论