|国家预印本平台
首页|Exploring the Jupyter Ecosystem: An Empirical Study of Bugs and Vulnerabilities

Exploring the Jupyter Ecosystem: An Empirical Study of Bugs and Vulnerabilities

Exploring the Jupyter Ecosystem: An Empirical Study of Bugs and Vulnerabilities

来源:Arxiv_logoArxiv
英文摘要

Background. Jupyter notebooks are one of the main tools used by data scientists. Notebooks include features (configuration scripts, markdown, images, etc.) that make them challenging to analyze compared to traditional software. As a result, existing software engineering models, tools, and studies do not capture the uniqueness of Notebook's behavior. Aims. This paper aims to provide a large-scale empirical study of bugs and vulnerabilities in the Notebook ecosystem. Method. We collected and analyzed a large dataset of Notebooks from two major platforms. Our methodology involved quantitative analyses of notebook characteristics (such as complexity metrics, contributor activity, and documentation) to identify factors correlated with bugs. Additionally, we conducted a qualitative study using grounded theory to categorize notebook bugs, resulting in a comprehensive bug taxonomy. Finally, we analyzed security-related commits and vulnerability reports to assess risks associated with Notebook deployment frameworks. Results. Our findings highlight that configuration issues are among the most common bugs in notebook documents, followed by incorrect API usage. Finally, we explore common vulnerabilities associated with popular deployment frameworks to better understand risks associated with Notebook development. Conclusions. This work highlights that notebooks are less well-supported than traditional software, resulting in more complex code, misconfiguration, and poor maintenance.

Wenyuan Jiang、Diany Pressato、Harsh Darji、Thibaud Lutellier

计算技术、计算机技术

Wenyuan Jiang,Diany Pressato,Harsh Darji,Thibaud Lutellier.Exploring the Jupyter Ecosystem: An Empirical Study of Bugs and Vulnerabilities[EB/OL].(2025-07-24)[2025-08-18].https://arxiv.org/abs/2507.18833.点此复制

评论