首页|Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach

Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach

来源：

英文摘要

Offline reinforcement learning (RL) enables policy optimization using static datasets, avoiding the risks and costs of extensive real-world exploration. However, it struggles with suboptimal offline behaviors and inaccurate value estimation due to the lack of environmental interaction. We present Video-Enhanced Offline RL (VeoRL), a model-based method that constructs an interactive world model from diverse, unlabeled video data readily available online. Leveraging model-based behavior guidance, our approach transfers commonsense knowledge of control policy and physical dynamics from natural videos to the RL agent within the target domain. VeoRL achieves substantial performance gains (over 100% in some cases) across visual control tasks in robotic manipulation, autonomous driving, and open-world video games.

作者：Minting Pan、Yitao Zheng、Jiajian Li、Yunbo Wang、Xiaokang Yang

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Minting Pan,Yitao Zheng,Jiajian Li,Yunbo Wang,Xiaokang Yang.Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach[EB/OL].(2025-05-09)[2025-06-06].https://arxiv.org/abs/2505.06482.点此复制

Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach

Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach

评论