|国家预印本平台
首页|BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text

BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text

BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text

来源:Arxiv_logoArxiv
英文摘要

In scientific research, limitations refer to the shortcomings, constraints, or weaknesses within a study. Transparent reporting of such limitations can enhance the quality and reproducibility of research and improve public trust in science. However, authors often a) underreport them in the paper text and b) use hedging strategies to satisfy editorial requirements at the cost of readers' clarity and confidence. This underreporting behavior, along with an explosion in the number of publications, has created a pressing need to automatically extract or generate such limitations from scholarly papers. In this direction, we present a complete architecture for the computational analysis of research limitations. Specifically, we create a dataset of limitations in ACL, NeurIPS, and PeerJ papers by extracting them from papers' text and integrating them with external reviews; we propose methods to automatically generate them using a novel Retrieval Augmented Generation (RAG) technique; we create a fine-grained evaluation framework for generated limitations; and we provide a meta-evaluation for the proposed evaluation techniques.

Ibrahim Al Azher、Miftahul Jannat Mokarrama、Zhishuai Guo、Sagnik Ray Choudhury、Hamed Alhoori

自然科学研究方法计算技术、计算机技术

Ibrahim Al Azher,Miftahul Jannat Mokarrama,Zhishuai Guo,Sagnik Ray Choudhury,Hamed Alhoori.BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text[EB/OL].(2025-05-22)[2025-07-02].https://arxiv.org/abs/2505.18207.点此复制

评论