|国家预印本平台
首页|Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning

Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning

Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning

来源:Arxiv_logoArxiv
英文摘要

Building models responsive to input prompts represents a transformative shift in machine learning. This paradigm holds significant potential for robotics problems, such as targeted manipulation amidst clutter. In this work, we present a novel approach to combine promptable foundation models with reinforcement learning (RL), enabling robots to perform dexterous manipulation tasks in a prompt-responsive manner. Existing methods struggle to link high-level commands with fine-grained dexterous control. We address this gap with a memory-augmented student-teacher learning framework. We use the Segment-Anything 2 (SAM 2) model as a perception backbone to infer an object of interest from user prompts. While detections are imperfect, their temporal sequence provides rich information for implicit state estimation by memory-augmented models. Our approach successfully learns prompt-responsive policies, demonstrated in picking objects from cluttered scenes. Videos and code are available at https://memory-student-teacher.github.io

Malte Mosbach、Sven Behnke

计算技术、计算机技术

Malte Mosbach,Sven Behnke.Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning[EB/OL].(2025-05-04)[2025-07-16].https://arxiv.org/abs/2505.02232.点此复制

评论