首页|Best Group Identification in Multi-Objective Bandits

Best Group Identification in Multi-Objective Bandits

来源：

英文摘要

We introduce the Best Group Identification problem in a multi-objective multi-armed bandit setting, where an agent interacts with groups of arms with vector-valued rewards. The performance of a group is determined by an efficiency vector which represents the group's best attainable rewards across different dimensions. The objective is to identify the set of optimal groups in the fixed-confidence setting. We investigate two key formulations: group Pareto set identification, where efficiency vectors of optimal groups are Pareto optimal and linear best group identification, where each reward dimension has a known weight and the optimal group maximizes the weighted sum of its efficiency vector's entries. For both settings, we propose elimination-based algorithms, establish upper bounds on their sample complexity, and derive lower bounds that apply to any correct algorithm. Through numerical experiments, we demonstrate the strong empirical performance of the proposed algorithms.

作者：Mohammad Shahverdikondori、Mohammad Reza Badri、Negar Kiyavash

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Mohammad Shahverdikondori,Mohammad Reza Badri,Negar Kiyavash.Best Group Identification in Multi-Objective Bandits[EB/OL].(2025-05-23)[2025-06-25].https://arxiv.org/abs/2505.17869.点此复制

Best Group Identification in Multi-Objective Bandits

Best Group Identification in Multi-Objective Bandits

评论