|国家预印本平台
首页|Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs

Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs

Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs

来源:Arxiv_logoArxiv
英文摘要

Household robots have been a longstanding research topic, but they still lack human-like intelligence, particularly in manipulating open-set objects and navigating large environments efficiently and accurately. To push this boundary, we consider a generalized multi-object collection problem in large scene graphs, where the robot needs to pick up and place multiple objects across multiple locations in a long mission of multiple human commands. This problem is extremely challenging since it requires long-horizon planning in a vast action-state space under high uncertainties. To this end, we propose a novel interleaved LLM and motion planning algorithm Inter-LLM. By designing a multimodal action cost similarity function, our algorithm can both reflect the history and look into the future to optimize plans, striking a good balance of quality and efficiency. Simulation experiments demonstrate that compared with latest works, our algorithm improves the overall mission performance by 30% in terms of fulfilling human commands, maximizing mission success rates, and minimizing mission costs.

Ruochu Yang、Yu Zhou、Fumin Zhang、Mengxue Hou

计算技术、计算机技术

Ruochu Yang,Yu Zhou,Fumin Zhang,Mengxue Hou.Interleaved LLM and Motion Planning for Generalized Multi-Object Collection in Large Scene Graphs[EB/OL].(2025-07-21)[2025-08-10].https://arxiv.org/abs/2507.15782.点此复制

评论