|国家预印本平台
首页|Enhancing DR Classification with Swin Transformer and Shifted Window Attention

Enhancing DR Classification with Swin Transformer and Shifted Window Attention

Enhancing DR Classification with Swin Transformer and Shifted Window Attention

来源:Arxiv_logoArxiv
英文摘要

Diabetic retinopathy (DR) is a leading cause of blindness worldwide, underscoring the importance of early detection for effective treatment. However, automated DR classification remains challenging due to variations in image quality, class imbalance, and pixel-level similarities that hinder model training. To address these issues, we propose a robust preprocessing pipeline incorporating image cropping, Contrast-Limited Adaptive Histogram Equalization (CLAHE), and targeted data augmentation to improve model generalization and resilience. Our approach leverages the Swin Transformer, which utilizes hierarchical token processing and shifted window attention to efficiently capture fine-grained features while maintaining linear computational complexity. We validate our method on the Aptos and IDRiD datasets for multi-class DR classification, achieving accuracy rates of 89.65% and 97.40%, respectively. These results demonstrate the effectiveness of our model, particularly in detecting early-stage DR, highlighting its potential for improving automated retinal screening in clinical settings.

Meher Boulaabi、Takwa Ben A?cha Gader、Afef Kacem Echi、Zied Bouraoui

眼科学医学研究方法计算技术、计算机技术

Meher Boulaabi,Takwa Ben A?cha Gader,Afef Kacem Echi,Zied Bouraoui.Enhancing DR Classification with Swin Transformer and Shifted Window Attention[EB/OL].(2025-04-20)[2025-05-23].https://arxiv.org/abs/2504.15317.点此复制

评论