Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence
Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence
Popular methods in cooperative Multi-Agent Reinforcement Learning with partially observable environments typically allow agents to act independently during execution, which may limit the coordinated effect of the trained policies. However, by sharing information such as known or suspected ongoing threats, effective communication can lead to improved decision-making in the cyber battle space. We propose a game design where defender agents learn to communicate and defend against imminent cyber threats by playing training games in the Cyber Operations Research Gym, using the Differentiable Inter Agent Learning algorithm adapted to the cyber operational environment. The tactical policies learned by these autonomous agents are akin to those of human experts during incident responses to avert cyber threats. In addition, the agents simultaneously learn minimal cost communication messages while learning their defence tactical policies.
Faizan Contractor、Li Li、Ranwa Al Mallah
自动化技术、自动化技术设备计算技术、计算机技术
Faizan Contractor,Li Li,Ranwa Al Mallah.Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence[EB/OL].(2025-07-19)[2025-08-18].https://arxiv.org/abs/2507.14658.点此复制
评论