|国家预印本平台
首页|Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning

来源:Arxiv_logoArxiv
英文摘要

Developing scalable, fault-tolerant atomic quantum processors requires precise control over large arrays of optical beams. This remains a major challenge due to inherent imperfections in classical control hardware, such as inter-channel crosstalk and beam leakage. In this work, we introduce a hardware co-designed intelligent quantum control framework to address these limitations. We construct a mathematical model of the photonic control hardware, integrate it into the quantum optimal control (QOC) framework, and apply reinforcement learning (RL) techniques to discover optimal control strategies. We demonstrate that the proposed framework enables robust, high-fidelity parallel single-qubit gate operations under realistic control conditions, where each atom is individually addressed by an optical beam. Specifically, we implement and benchmark three optimization strategies: a classical hybrid Self-Adaptive Differential Evolution-Adam (SADE-Adam) optimizer, a conventional RL approach based on Proximal Policy Optimization (PPO), and a novel end-to-end differentiable RL method. Using SADE-Adam as a baseline, we find that while PPO performance degrades as system complexity increases, the end-to-end differentiable RL consistently achieves gate fidelities above 99.9$\%$, exhibits faster convergence, and maintains robustness under varied channel crosstalk strength and randomized dynamic control imperfections.

Qian Ding、Dirk Englund

物理学计算技术、计算机技术

Qian Ding,Dirk Englund.Hardware Co-Designed Optimal Control for Programmable Atomic Quantum Processors via Reinforcement Learning[EB/OL].(2025-04-15)[2025-06-04].https://arxiv.org/abs/2504.11737.点此复制

评论