|国家预印本平台
首页|Multispectral Detection Transformer with Infrared-Centric Sensor Fusion

Multispectral Detection Transformer with Infrared-Centric Sensor Fusion

Multispectral Detection Transformer with Infrared-Centric Sensor Fusion

来源:Arxiv_logoArxiv
英文摘要

Multispectral object detection aims to leverage complementary information from visible (RGB) and infrared (IR) modalities to enable robust performance under diverse environmental conditions. In this letter, we propose IC-Fusion, a multispectral object detector that effectively fuses visible and infrared features through a lightweight and modalityaware design. Motivated by wavelet analysis and empirical observations, we find that IR images contain structurally rich high-frequency information critical for object localization, while RGB images provide complementary semantic context. To exploit this, we adopt a compact RGB backbone and design a novel fusion module comprising a Multi-Scale Feature Distillation (MSFD) block to enhance RGB features and a three-stage fusion block with Cross-Modal Channel Shuffle Gate (CCSG) and Cross-Modal Large Kernel Gate (CLKG) to facilitate effective cross-modal interaction. Experiments on the FLIR and LLVIP benchmarks demonstrate the effectiveness and efficiency of our IR-centric fusion strategy. Our code is available at https://github.com/smin-hwang/IC-Fusion.

Seongmin Hwang、Daeyoung Han、Moongu Jeon

光电子技术半导体技术电子技术概论电子技术应用

Seongmin Hwang,Daeyoung Han,Moongu Jeon.Multispectral Detection Transformer with Infrared-Centric Sensor Fusion[EB/OL].(2025-05-21)[2025-06-15].https://arxiv.org/abs/2505.15137.点此复制

评论