首页|Pure Exploration with Infinite Answers

Pure Exploration with Infinite Answers

来源：

英文摘要

We study pure exploration problems where the set of correct answers is possibly infinite, e.g., the regression of any continuous function of the means of the bandit. We derive an instance-dependent lower bound for these problems. By analyzing it, we discuss why existing methods (i.e., Sticky Track-and-Stop) for finite answer problems fail at being asymptotically optimal in this more general setting. Finally, we present a framework, Sticky-Sequence Track-and-Stop, which generalizes both Track-and-Stop and Sticky Track-and-Stop, and that enjoys asymptotic optimality. Due to its generality, our analysis also highlights special cases where existing methods enjoy optimality.

作者：Riccardo Poiani、Martino Bernasconi、Andrea Celli

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Riccardo Poiani,Martino Bernasconi,Andrea Celli.Pure Exploration with Infinite Answers[EB/OL].(2025-05-28)[2025-06-18].https://arxiv.org/abs/2505.22473.点此复制

Pure Exploration with Infinite Answers

Pure Exploration with Infinite Answers

评论