|国家预印本平台
首页|Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice

Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice

Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice

来源:Arxiv_logoArxiv
英文摘要

This study investigates adaptive experimental design for treatment choice, also known as fixed-budget best-arm identification. We consider an adaptive procedure consisting of a treatment-allocation phase followed by a treatment-choice phase, and we design an adaptive experiment for this setup to efficiently identify the best treatment arm, defined as the one with the highest expected outcome. In our designed experiment, the treatment-allocation phase consists of two stages. The first stage is a pilot phase, where we allocate each treatment arm uniformly with equal proportions to eliminate clearly suboptimal arms and estimate outcome variances. In the second stage, we allocate treatment arms in proportion to the variances estimated in the first stage. After the treatment-allocation phase, the procedure enters the treatment-choice phase, where we choose the treatment arm with the highest sample mean as our estimate of the best treatment arm. We prove that this single design is simultaneously asymptotically minimax and Bayes optimal for the simple regret, with upper bounds that match our lower bounds up to exact constants. Therefore, our designed experiment achieves the sharp efficiency limits without requiring separate tuning for minimax and Bayesian objectives.

Masahiro Kato

数学

Masahiro Kato.Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice[EB/OL].(2025-07-06)[2025-07-16].https://arxiv.org/abs/2506.24007.点此复制

评论