Towards Artificial Intelligence Research Assistant for Expert-Involved Learning
Towards Artificial Intelligence Research Assistant for Expert-Involved Learning
Large Language Models (LLMs) and Large Multi-Modal Models (LMMs) have emerged as transformative tools in scientific research, yet their reliability and specific contributions to biomedical applications remain insufficiently characterized. In this study, we present \textbf{AR}tificial \textbf{I}ntelligence research assistant for \textbf{E}xpert-involved \textbf{L}earning (ARIEL), a multimodal dataset designed to benchmark and enhance two critical capabilities of LLMs and LMMs in biomedical research: summarizing extensive scientific texts and interpreting complex biomedical figures. To facilitate rigorous assessment, we create two open-source sets comprising biomedical articles and figures with designed questions. We systematically benchmark both open- and closed-source foundation models, incorporating expert-driven human evaluations conducted by doctoral-level experts. Furthermore, we improve model performance through targeted prompt engineering and fine-tuning strategies for summarizing research papers, and apply test-time computational scaling to enhance the reasoning capabilities of LMMs, achieving superior accuracy compared to human-expert corrections. We also explore the potential of using LMM Agents to generate scientific hypotheses from diverse multimodal inputs. Overall, our results delineate clear strengths and highlight significant limitations of current foundation models, providing actionable insights and guiding future advancements in deploying large-scale language and multi-modal models within biomedical research.
Biqing Zhu、Pan Lu、Yuge Wang、Keyi Li、Rihao Qu、Jiapeng Chen、Yufeng Liu、Aviv Yaish、Xinyue Cui、Chuhan Li、Yuhang Chen、Kexing Li、Arman Cohan、Minsheng Hao、Hua Xu、Mark Gerstein、Hongyu Zhao、James Zou、Tianyu Liu、Simeng Han、Xiao Luo、Hanchen Wang
生物科学研究方法、生物科学研究技术医学研究方法
Biqing Zhu,Pan Lu,Yuge Wang,Keyi Li,Rihao Qu,Jiapeng Chen,Yufeng Liu,Aviv Yaish,Xinyue Cui,Chuhan Li,Yuhang Chen,Kexing Li,Arman Cohan,Minsheng Hao,Hua Xu,Mark Gerstein,Hongyu Zhao,James Zou,Tianyu Liu,Simeng Han,Xiao Luo,Hanchen Wang.Towards Artificial Intelligence Research Assistant for Expert-Involved Learning[EB/OL].(2025-05-03)[2025-05-25].https://arxiv.org/abs/2505.04638.点此复制
评论