|国家预印本平台
首页|Enhancing Computational Notebooks with Code+Data Space Versioning

Enhancing Computational Notebooks with Code+Data Space Versioning

Enhancing Computational Notebooks with Code+Data Space Versioning

来源:Arxiv_logoArxiv
英文摘要

There is a gap between how people explore data and how Jupyter-like computational notebooks are designed. People explore data nonlinearly, using execution undos, branching, and/or complete reverts, whereas notebooks are designed for sequential exploration. Recent works like ForkIt are still insufficient to support these multiple modes of nonlinear exploration in a unified way. In this work, we address the challenge by introducing two-dimensional code+data space versioning for computational notebooks and verifying its effectiveness using our prototype system, Kishuboard, which integrates with Jupyter. By adjusting code and data knobs, users of Kishuboard can intuitively manage the state of computational notebooks in a flexible way, thereby achieving both execution rollbacks and checkouts across complex multi-branch exploration history. Moreover, this two-dimensional versioning mechanism can easily be presented along with a friendly one-dimensional history. Human subject studies indicate that Kishuboard significantly enhances user productivity in various data science tasks.

Hanxi Fang、Supawit Chockchowwat、Hari Sundaram、Yongjoo Park

10.1145/3706598.3714141

计算技术、计算机技术

Hanxi Fang,Supawit Chockchowwat,Hari Sundaram,Yongjoo Park.Enhancing Computational Notebooks with Code+Data Space Versioning[EB/OL].(2025-04-02)[2025-05-01].https://arxiv.org/abs/2504.01367.点此复制

评论