UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation
The integration of AI-assisted biomedical image analysis into clinical practice demands AI-generated findings that are not only accurate but also interpretable to clinicians. However, existing biomedical AI models generally lack the ability to simultaneously generate diagnostic findings and localize corresponding biomedical objects. This limitation makes it challenging for clinicians to correlate AI-generated findings with visual evidence (e.g., tiny lesions) in images and interpret the results of AI models. To address this challenge, we introduce UniBiomed, the first universal foundation model for grounded biomedical image interpretation, which is capable of generating accurate diagnostic findings and simultaneously segmenting the corresponding biomedical targets. UniBiomed is based on a novel integration of Multi-modal Large Language Model and Segment Anything Model, which can effectively unify diverse biomedical tasks in universal training for advancing grounded interpretation. To develop UniBiomed, we curate a large-scale dataset comprising over 27 million triplets of images, region annotations, and text descriptions across ten biomedical imaging modalities. Extensive validation on 70 internal and 14 external datasets demonstrated the state-of-the-art performance of UniBiomed in diverse biomedical tasks, including image segmentation, disease recognition, region-aware diagnosis, vision question answering, and report generation. In summary, UniBiomed is a powerful and versatile biomedical foundation model, unlocking the untapped grounded interpretation capability for optimizing AI-assisted biomedical image analysis.
Luyang Luo、Yuxiang Nie、Sunan He、Jiaxin Zhuang、Neeraj Mahboobani、Varut Vardhanabhuti、Ronald Cheong Kin Chan、Yifan Peng、Pranav Rajpurkar、Hao Chen、Linshan Wu
医学研究方法医学现状、医学发展基础医学计算技术、计算机技术
Luyang Luo,Yuxiang Nie,Sunan He,Jiaxin Zhuang,Neeraj Mahboobani,Varut Vardhanabhuti,Ronald Cheong Kin Chan,Yifan Peng,Pranav Rajpurkar,Hao Chen,Linshan Wu.UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation[EB/OL].(2025-04-30)[2025-06-14].https://arxiv.org/abs/2504.21336.点此复制
评论