|国家预印本平台
| 注册
首页|Reinforcement Learning from Human Feedback

Reinforcement Learning from Human Feedback

Nathan Lambert

Arxiv_logoArxiv

Reinforcement Learning from Human Feedback

Nathan Lambert

作者信息

引用本文复制引用

Nathan Lambert.Reinforcement Learning from Human Feedback[EB/OL].(2025-11-02)[2025-12-13].https://arxiv.org/abs/2504.12501.

学科分类

计算技术、计算机技术

评论

首发时间 2025-11-02
下载量:0
|
点击量:6
段落导航相关论文