Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems
Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems
Follow-the-Regularized-Leader (FTRL) policies have achieved Best-of-Both-Worlds (BOBW) results in various settings through hybrid regularizers, whereas analogous results for Follow-the-Perturbed-Leader (FTPL) remain limited due to inherent analytical challenges. To advance the analytical foundations of FTPL, we revisit classical FTRL-FTPL duality for unbounded perturbations and establish BOBW results for FTPL under a broad family of asymmetric unbounded Fréchet-type perturbations, including hybrid perturbations combining Gumbel-type and Fréchet-type tails. These results not only extend the BOBW results of FTPL but also offer new insights into designing alternative FTPL policies competitive with hybrid regularization approaches. Motivated by earlier observations in two-armed bandits, we further investigate the connection between the $1/2$-Tsallis entropy and a Fréchet-type perturbation. Our numerical observations suggest that it corresponds to a symmetric Fréchet-type perturbation, and based on this, we establish the first BOBW guarantee for symmetric unbounded perturbations in the two-armed setting. In contrast, in general multi-armed bandits, we find an instance in which symmetric Fréchet-type perturbations violate the key condition for standard BOBW analysis, which is a problem not observed with asymmetric or nonnegative Fréchet-type perturbations. Although this example does not rule out alternative analyses achieving BOBW results, it suggests the limitations of directly applying the relationship observed in two-armed cases to the general case and thus emphasizes the need for further investigation to fully understand the behavior of FTPL in broader settings.
Jongyeong Lee、Junya Honda、Shinji Ito、Min-hwan Oh
计算技术、计算机技术
Jongyeong Lee,Junya Honda,Shinji Ito,Min-hwan Oh.Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems[EB/OL].(2025-08-26)[2025-09-10].https://arxiv.org/abs/2508.18604.点此复制
评论