|国家预印本平台
首页|SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities

SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities

SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities

来源:Arxiv_logoArxiv
英文摘要

Spike cameras, bio-inspired vision sensors, asynchronously fire spikes by accumulating light intensities at each pixel, offering ultra-high energy efficiency and exceptional temporal resolution. Unlike event cameras, which record changes in light intensity to capture motion, spike cameras provide even finer spatiotemporal resolution and a more precise representation of continuous changes. In this paper, we introduce the first video action recognition (VAR) dataset using spike camera, alongside synchronized RGB and thermal modalities, to enable comprehensive benchmarking for Spiking Neural Networks (SNNs). By preserving the inherent sparsity and temporal precision of spiking data, our three datasets offer a unique platform for exploring multimodal video understanding and serve as a valuable resource for directly comparing spiking, thermal, and RGB modalities. This work contributes a novel dataset that will drive research in energy-efficient, ultra-low-power video understanding, specifically for action recognition tasks using spike-based data.

Yasser Ashraf、Ahmed Sharshar、Velibor Bojkovic、Bin Gu

光电子技术

Yasser Ashraf,Ahmed Sharshar,Velibor Bojkovic,Bin Gu.SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities[EB/OL].(2025-07-22)[2025-08-10].https://arxiv.org/abs/2507.16151.点此复制

评论