首页|Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

来源：

英文摘要

Gaussian process upper confidence bound (GP-UCB) is a theoretically established algorithm for Bayesian optimization (BO), where we assume the objective function $f$ follows a GP. One notable drawback of GP-UCB is that the theoretical confidence parameter $Î²$ increases along with the iterations and is too large. To alleviate this drawback, this paper analyzes the randomized variant of GP-UCB called improved randomized GP-UCB (IRGP-UCB), which uses the confidence parameter generated from the shifted exponential distribution. We analyze the expected regret and conditional expected regret, where the expectation and the probability are taken respectively with $f$ and noise and with the randomness of the BO algorithm. In both regret analyses, IRGP-UCB achieves a sub-linear regret upper bound without increasing the confidence parameter if the input domain is finite. Furthermore, we show that randomization plays a key role in avoiding an increase in confidence parameter by showing that GP-UCB using a constant confidence parameter can incur linearly growing expected cumulative regret. Finally, we show numerical experiments using synthetic and benchmark functions and real-world emulators.

作者：Shion Takeno、Yu Inatsu、Masayuki Karasuyama

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Shion Takeno,Yu Inatsu,Masayuki Karasuyama.Regret Analysis for Randomized Gaussian Process Upper Confidence Bound[EB/OL].(2025-07-16)[2025-08-02].https://arxiv.org/abs/2409.00979.点此复制

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

评论