首页|Attackers Strike Back? Not Anymore -- An Ensemble of RL Defenders Awakens for APT Detection

Attackers Strike Back? Not Anymore -- An Ensemble of RL Defenders Awakens for APT Detection

来源：

英文摘要

Advanced Persistent Threats (APTs) represent a growing menace to modern digital infrastructure. Unlike traditional cyberattacks, APTs are stealthy, adaptive, and long-lasting, often bypassing signature-based detection systems. This paper introduces a novel framework for APT detection that unites deep learning, reinforcement learning (RL), and active learning into a cohesive, adaptive defense system. Our system combines auto-encoders for latent behavioral encoding with a multi-agent ensemble of RL-based defenders, each trained to distinguish between benign and malicious process behaviors. We identify a critical challenge in existing detection systems: their static nature and inability to adapt to evolving attack strategies. To this end, our architecture includes multiple RL agents (Q-Learning, PPO, DQN, adversarial defenders), each analyzing latent vectors generated by an auto-encoder. When any agent is uncertain about its decision, the system triggers an active learning loop to simulate expert feedback, thus refining decision boundaries. An ensemble voting mechanism, weighted by each agent's performance, ensures robust final predictions.

作者：Sidahmed Benabderrahmane、Talal Rahwan

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Sidahmed Benabderrahmane,Talal Rahwan.Attackers Strike Back? Not Anymore -- An Ensemble of RL Defenders Awakens for APT Detection[EB/OL].(2025-08-26)[2025-09-03].https://arxiv.org/abs/2508.19072.点此复制

Attackers Strike Back? Not Anymore -- An Ensemble of RL Defenders Awakens for APT Detection

Attackers Strike Back? Not Anymore -- An Ensemble of RL Defenders Awakens for APT Detection

评论