|国家预印本平台
| 注册
首页|UFT: Unifying Supervised and Reinforcement Fine-Tuning

UFT: Unifying Supervised and Reinforcement Fine-Tuning

Asuman Ozdaglar Mingyang Liu Gabriele Farina

Arxiv_logoArxiv

UFT: Unifying Supervised and Reinforcement Fine-Tuning

Asuman Ozdaglar Mingyang Liu Gabriele Farina

作者信息

引用本文复制引用

Asuman Ozdaglar,Mingyang Liu,Gabriele Farina.UFT: Unifying Supervised and Reinforcement Fine-Tuning[EB/OL].(2025-10-19)[2025-12-13].https://arxiv.org/abs/2505.16984.

学科分类

计算技术、计算机技术

评论

首发时间 2025-10-19
下载量:0
|
点击量:1
段落导航相关论文