EEvAct: Early Event-Based Action Recognition with High-Rate Two-Stream Spiking Neural Networks
EEvAct: Early Event-Based Action Recognition with High-Rate Two-Stream Spiking Neural Networks
Recognizing human activities early is crucial for the safety and responsiveness of human-robot and human-machine interfaces. Due to their high temporal resolution and low latency, event-based vision sensors are a perfect match for this early recognition demand. However, most existing processing approaches accumulate events to low-rate frames or space-time voxels which limits the early prediction capabilities. In contrast, spiking neural networks (SNNs) can process the events at a high-rate for early predictions, but most works still fall short on final accuracy. In this work, we introduce a high-rate two-stream SNN which closes this gap by outperforming previous work by 2% in final accuracy on the large-scale THU EACT-50 dataset. We benchmark the SNNs within a novel early event-based recognition framework by reporting Top-1 and Top-5 recognition scores for growing observation time. Finally, we exemplify the impact of these methods on a real-world task of early action triggering for human motion capture in sports.
Michael Neumeier、Jules Lecomte、Nils Kazinski、Soubarna Banik、Bing Li、Axel von Arnim
计算技术、计算机技术
Michael Neumeier,Jules Lecomte,Nils Kazinski,Soubarna Banik,Bing Li,Axel von Arnim.EEvAct: Early Event-Based Action Recognition with High-Rate Two-Stream Spiking Neural Networks[EB/OL].(2025-07-10)[2025-07-20].https://arxiv.org/abs/2507.07734.点此复制
评论