Policy Gradient with Tree Search: Avoiding Local Optimas through
Lookahead
Uri Koren Navdeep Kumar Uri Gadot Giorgia Ramponi Kfir Yehuda Levy Shie Mannor
作者信息
引用本文复制引用
Uri Koren,Navdeep Kumar,Uri Gadot,Giorgia Ramponi,Kfir Yehuda Levy,Shie Mannor.Policy Gradient with Tree Search: Avoiding Local Optimas through
Lookahead[EB/OL].(2025-06-08)[2025-12-13].https://arxiv.org/abs/2506.07054.
评论