首页|Piecewise Linear Approximation in Learned Index Structures: Theoretical and Empirical Analysis

Piecewise Linear Approximation in Learned Index Structures: Theoretical and Empirical Analysis

来源：

英文摘要

A growing trend in the database and system communities is to augment conventional index structures, such as B+-trees, with machine learning (ML) models. Among these, error-bounded Piecewise Linear Approximation ($Îµ$-PLA) has emerged as a popular choice due to its simplicity and effectiveness. Despite its central role in many learned indexes, the design and analysis of $Îµ$-PLA fitting algorithms remain underexplored. In this paper, we revisit $Îµ$-PLA from both theoretical and empirical perspectives, with a focus on its application in learned index structures. We first establish a fundamentally improved lower bound of $Î©(Îº\cdot Îµ^2)$ on the expected segment coverage for existing $Îµ$-PLA fitting algorithms, where $Îº$ is a data-dependent constant. We then present a comprehensive benchmark of state-of-the-art $Îµ$-PLA algorithms when used in different learned data structures. Our results highlight key trade-offs among model accuracy, model size, and query performance, providing actionable guidelines for the principled design of future learned data structures.

作者：Jiayong Qin、Xianyu Zhu、Qiyu Liu、Guangyi Zhang、Zhigang Cai、Jianwei Liao、Sha Hu、Jingshu Peng、Yingxia Shao、Lei Chen

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Jiayong Qin,Xianyu Zhu,Qiyu Liu,Guangyi Zhang,Zhigang Cai,Jianwei Liao,Sha Hu,Jingshu Peng,Yingxia Shao,Lei Chen.Piecewise Linear Approximation in Learned Index Structures: Theoretical and Empirical Analysis[EB/OL].(2025-06-25)[2025-07-21].https://arxiv.org/abs/2506.20139.点此复制

Piecewise Linear Approximation in Learned Index Structures: Theoretical and Empirical Analysis

Piecewise Linear Approximation in Learned Index Structures: Theoretical and Empirical Analysis

评论