What Happened in This Pipeline? Diffing Build Logs with CiDiff
What Happened in This Pipeline? Diffing Build Logs with CiDiff
Continuous integration (CI) is widely used by developers to ensure the quality and reliability of their software projects. However, diagnosing a CI regression is a tedious process that involves the manual analysis of lengthy build logs. In this paper, we explore how textual differencing can support the debugging of CI regressions. As off-the-shelf diff algorithms produce suboptimal results, in this work we introduce a new diff algorithm specifically tailored to build logs called CiDiff. We evaluate CiDiff against several baselines on a novel dataset of 17 906 CI regressions, performing an accuracy study, a quantitative study and a user-study. Notably, our algorithm reduces the number of lines to inspect by about 60 % in the median case, with reasonable overhead compared to the state-of-practice LCS-diff. Finally, our algorithm is preferred by the majority of participants in 70 % of the regression cases, whereas LCS-diff is preferred in only 5 % of the cases.
Nicolas Hubner、Jean-Rémy Falleri、Raluca Uricaru、Thomas Degueule、Thomas Durieux
LaBRILaBRILaBRI, CNRS, Bordeaux INP, UBLaBRISPIRALS
计算技术、计算机技术
Nicolas Hubner,Jean-Rémy Falleri,Raluca Uricaru,Thomas Degueule,Thomas Durieux.What Happened in This Pipeline? Diffing Build Logs with CiDiff[EB/OL].(2025-04-25)[2025-06-08].https://arxiv.org/abs/2504.18182.点此复制
评论