首页|Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

来源：

英文摘要

The time allocation problem in multi-function cognitive radar systems focuses on the trade-off between scanning for newly emerging targets and tracking the previously detected targets. We formulate this as a multi-objective optimization problem and employ deep reinforcement learning to find Pareto-optimal solutions and compare deep deterministic policy gradient (DDPG) and soft actor-critic (SAC) algorithms. Our results demonstrate the effectiveness of both algorithms in adapting to various scenarios, with SAC showing improved stability and sample efficiency compared to DDPG. We further employ the NSGA-II algorithm to estimate an upper bound on the Pareto front of the considered problem. This work contributes to the development of more efficient and adaptive cognitive radar systems capable of balancing multiple competing objectives in dynamic environments.

作者：Ziyang Lu、Subodh Kalia、M. Cenk Gursoy、Chilukuri K. Mohan、Pramod K. Varshney

作者单位：

学科分类：航空航天技术航空航天计算技术、计算机技术

推荐引用：Ziyang Lu,Subodh Kalia,M. Cenk Gursoy,Chilukuri K. Mohan,Pramod K. Varshney.Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management[EB/OL].(2025-06-25)[2025-07-17].https://arxiv.org/abs/2506.20853.点此复制

Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management

评论