首页|A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

来源：

英文摘要

Reward Model (RM) has demonstrated impressive potential for enhancing Large Language Models (LLM), as RM can serve as a proxy for human preferences, providing signals to guide LLMs' behavior in various tasks. In this paper, we provide a comprehensive overview of relevant research, exploring RMs from the perspectives of preference collection, reward modeling, and usage. Next, we introduce the applications of RMs and discuss the benchmarks for evaluation. Furthermore, we conduct an in-depth analysis of the challenges existing in the field and dive into the potential research directions. This paper is dedicated to providing beginners with a comprehensive introduction to RMs and facilitating future studies. The resources are publicly available at github\footnote{https://github.com/JLZhong23/awesome-reward-models}.

作者：Jialun Zhong、Wei Shen、Yanzeng Li、Songyang Gao、Hua Lu、Yicheng Chen、Yang Zhang、Wei Zhou、Jinjie Gu、Lei Zou

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Jialun Zhong,Wei Shen,Yanzeng Li,Songyang Gao,Hua Lu,Yicheng Chen,Yang Zhang,Wei Zhou,Jinjie Gu,Lei Zou.A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future[EB/OL].(2025-04-12)[2025-04-26].https://arxiv.org/abs/2504.12328.点此复制

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

评论