|国家预印本平台
首页|MIR: Methodology Inspiration Retrieval for Scientific Research Problems

MIR: Methodology Inspiration Retrieval for Scientific Research Problems

MIR: Methodology Inspiration Retrieval for Scientific Research Problems

来源:Arxiv_logoArxiv
英文摘要

There has been a surge of interest in harnessing the reasoning capabilities of Large Language Models (LLMs) to accelerate scientific discovery. While existing approaches rely on grounding the discovery process within the relevant literature, effectiveness varies significantly with the quality and nature of the retrieved literature. We address the challenge of retrieving prior work whose concepts can inspire solutions for a given research problem, a task we define as Methodology Inspiration Retrieval (MIR). We construct a novel dataset tailored for training and evaluating retrievers on MIR, and establish baselines. To address MIR, we build the Methodology Adjacency Graph (MAG); capturing methodological lineage through citation relationships. We leverage MAG to embed an "intuitive prior" into dense retrievers for identifying patterns of methodological inspiration beyond superficial semantic similarity. This achieves significant gains of +5.4 in Recall@3 and +7.8 in Mean Average Precision (mAP) over strong baselines. Further, we adapt LLM-based re-ranking strategies to MIR, yielding additional improvements of +4.5 in Recall@3 and +4.8 in mAP. Through extensive ablation studies and qualitative analyses, we exhibit the promise of MIR in enhancing automated scientific discovery and outline avenues for advancing inspiration-driven retrieval.

Aniketh Garikaparthi、Manasi Patwardhan、Aditya Sanjiv Kanade、Aman Hassan、Lovekesh Vig、Arman Cohan

自然科学研究方法

Aniketh Garikaparthi,Manasi Patwardhan,Aditya Sanjiv Kanade,Aman Hassan,Lovekesh Vig,Arman Cohan.MIR: Methodology Inspiration Retrieval for Scientific Research Problems[EB/OL].(2025-05-30)[2025-06-30].https://arxiv.org/abs/2506.00249.点此复制

评论