Quantum reinforcement learning in dynamic environments
Quantum reinforcement learning in dynamic environments
Combining quantum computing techniques in the form of amplitude amplification with classical reinforcement learning has led to the so-called "hybrid agent for quantum-accessible reinforcement learning", which achieves a quadratic speedup in sample complexity for certain learning problems. So far, this hybrid agent has only been applied to stationary learning problems, that is, learning problems without any time dependency within components of the Markov decision process. In this work, we investigate the applicability of the hybrid agent to dynamic RL environments. To this end, we enhance the hybrid agent by introducing a dissipation mechanism and, with the resulting learning agent, perform an empirical comparison with a classical RL agent in an RL environment with a time-dependent reward function. Our findings suggest that the modified hybrid agent can adapt its behavior to changes in the environment quickly, leading to a higher average success probability compared to its classical counterpart.
Oliver Sefrin、Manuel Radons、Lars Simon、Sabine Wölk
计算技术、计算机技术
Oliver Sefrin,Manuel Radons,Lars Simon,Sabine Wölk.Quantum reinforcement learning in dynamic environments[EB/OL].(2025-07-02)[2025-07-16].https://arxiv.org/abs/2507.01691.点此复制
评论