Claudio Mayrink Verdun Alex Oesterling Himabindu Lakkaraju Flavio P. Calmon
作者信息
引用本文复制引用
Claudio Mayrink Verdun,Alex Oesterling,Himabindu Lakkaraju,Flavio P. Calmon.Soft Best-of-n Sampling for Model Alignment[EB/OL].(2025-05-06)[2025-12-13].https://arxiv.org/abs/2505.03156.
评论