|国家预印本平台
首页|VL-Explore: Zero-shot Vision-Language Exploration and Target Discovery by Mobile Robots

VL-Explore: Zero-shot Vision-Language Exploration and Target Discovery by Mobile Robots

VL-Explore: Zero-shot Vision-Language Exploration and Target Discovery by Mobile Robots

来源:Arxiv_logoArxiv
英文摘要

Vision-language navigation (VLN) has emerged as a promising paradigm, enabling mobile robots to perform zero-shot inference and execute tasks without specific pre-programming. However, current systems often separate map exploration and path planning, with exploration relying on inefficient algorithms due to limited (partially observed) environmental information. In this paper, we present a novel navigation pipeline named "VL-Explore" for simultaneous exploration and target discovery in unknown environments, leveraging the capabilities of a vision-language model named CLIP. Our approach requires only monocular vision and operates without any prior map or knowledge about the target. For comprehensive evaluations, we designed a functional prototype of a UGV (unmanned ground vehicle) system named "Open Rover", a customized platform for general-purpose VLN tasks. We integrated and deployed the VL-Explore pipeline on Open Rover to evaluate its throughput, obstacle avoidance capability, and trajectory performance across various real-world scenarios. Experimental results demonstrate that VL-Explore consistently outperforms traditional map-traversal algorithms and achieves performance comparable to path-planning methods that depend on prior map and target knowledge. Notably, VL-Explore offers real-time active navigation without requiring pre-captured candidate images or pre-built node graphs, addressing key limitations of existing VLN pipelines.

Yuxuan Zhang、Adnan Abdullah、Sanjeev J. Koppal、Md Jahidul Islam

自动化技术、自动化技术设备计算技术、计算机技术

Yuxuan Zhang,Adnan Abdullah,Sanjeev J. Koppal,Md Jahidul Islam.VL-Explore: Zero-shot Vision-Language Exploration and Target Discovery by Mobile Robots[EB/OL].(2025-07-22)[2025-08-23].https://arxiv.org/abs/2502.08791.点此复制

评论