LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making
Abilities
Thomas Schmied J?rg Bornschein Jordi Grau-Moya Markus Wulfmeier Razvan Pascanu
作者信息
引用本文复制引用
Thomas Schmied,J?rg Bornschein,Jordi Grau-Moya,Markus Wulfmeier,Razvan Pascanu.LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making
Abilities[EB/OL].(2025-04-22)[2025-12-13].https://arxiv.org/abs/2504.16078.
评论