When Deep Learning Meets Information Retrieval-based Bug Localization: A Survey
When Deep Learning Meets Information Retrieval-based Bug Localization: A Survey
Bug localization is a crucial aspect of software maintenance, running through the entire software lifecycle. Information retrieval-based bug localization (IRBL) identifies buggy code based on bug reports, expediting the bug resolution process for developers. Recent years have witnessed significant achievements in IRBL, propelled by the widespread adoption of deep learning (DL). To provide a comprehensive overview of the current state of the art and delve into key issues, we conduct a survey encompassing 61 IRBL studies leveraging DL. We summarize best practices in each phase of the IRBL workflow, undertake a meta-analysis of prior studies, and suggest future research directions. This exploration aims to guide further advancements in the field, fostering a deeper understanding and refining practices for effective bug localization. Our study suggests that the integration of DL in IRBL enhances the model's capacity to extract semantic and syntactic information from both bug reports and source code, addressing issues such as lexical gaps, neglect of code structure information, and cold-start problems. Future research avenues for IRBL encompass exploring diversity in programming languages, adopting fine-grained granularity, and focusing on real-world applications. Most importantly, although some studies have started using large language models for IRBL, there is still a need for more in-depth exploration and thorough investigation in this area.
Feifei Niu、Chuanyi Li、Kui Liu、Xin Xia、David Lo
计算技术、计算机技术
Feifei Niu,Chuanyi Li,Kui Liu,Xin Xia,David Lo.When Deep Learning Meets Information Retrieval-based Bug Localization: A Survey[EB/OL].(2025-04-30)[2025-06-30].https://arxiv.org/abs/2505.00144.点此复制
评论