|国家预印本平台
首页|EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation

EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation

EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation

来源:Arxiv_logoArxiv
英文摘要

Convolutional neural networks have primarily led 3D medical image segmentation but may be limited by small receptive fields. Transformer models excel in capturing global relationships through self-attention but are challenged by high computational costs at high resolutions. Recently, Mamba, a state space model, has emerged as an effective approach for sequential modeling. Inspired by its success, we introduce a novel Mamba-based 3D medical image segmentation model called EM-Net. It not only efficiently captures attentive interaction between regions by integrating and selecting channels, but also effectively utilizes frequency domain to harmonize the learning of features across varying scales, while accelerating training speed. Comprehensive experiments on two challenging multi-organ datasets with other state-of-the-art (SOTA) algorithms show that our method exhibits better segmentation accuracy while requiring nearly half the parameter size of SOTA models and 2x faster training speed.

Jiajun Zeng、Dong Ni、Ruobing Huang、Ao Chang

医学研究方法计算技术、计算机技术

Jiajun Zeng,Dong Ni,Ruobing Huang,Ao Chang.EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation[EB/OL].(2024-09-26)[2025-08-02].https://arxiv.org/abs/2409.17675.点此复制

评论