首页|COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network

COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network

来源：

英文摘要

Persistent monitoring of dynamic targets is essential in real-world applications such as disaster response, environmental sensing, and wildlife conservation, where mobile agents must continuously gather information under uncertainty. We propose COMPASS, a multi-agent reinforcement learning (MARL) framework that enables decentralized agents to persistently monitor multiple moving targets efficiently. We model the environment as a graph, where nodes represent spatial locations and edges capture topological proximity, allowing agents to reason over structured layouts and revisit informative regions as needed. Each agent independently selects actions based on a shared spatio-temporal attention network that we design to integrate historical observations and spatial context. We model target dynamics using Gaussian Processes (GPs), which support principled belief updates and enable uncertainty-aware planning. We train COMPASS using centralized value estimation and decentralized policy execution under an adaptive reward setting. Our extensive experiments demonstrate that COMPASS consistently outperforms strong baselines in uncertainty reduction, target coverage, and coordination efficiency across dynamic multi-target scenarios.

作者：Xingjian Zhang、Yizhuo Wang、Guillaume Sartoretti

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Xingjian Zhang,Yizhuo Wang,Guillaume Sartoretti.COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network[EB/OL].(2025-07-22)[2025-08-10].https://arxiv.org/abs/2507.16306.点此复制

COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network

COMPASS: Cooperative Multi-Agent Persistent Monitoring using Spatio-Temporal Attention Network

评论