|国家预印本平台
首页|Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning

Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning

Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning

来源:Arxiv_logoArxiv
英文摘要

Safety stands as the primary obstacle preventing the widespread adoption of learning-based robotic systems in our daily lives. While reinforcement learning (RL) shows promise as an effective robot learning paradigm, conventional RL frameworks often model safety by using single scalar negative rewards with immediate episode termination, failing to capture the temporal consequences of unsafe actions (e.g., sustained collision damage). In this work, we introduce a novel approach that simulates these temporal effects by applying continuous negative rewards without episode termination. Our experiments reveal that standard RL methods struggle with this model, as the accumulated negative values in unsafe zones create learning barriers. To address this challenge, we demonstrate how Control Barrier Functions (CBFs), with their proven safety guarantees, effectively help robots avoid catastrophic regions while enhancing learning outcomes. We present three CBF-based approaches, each integrating traditional RL methods with Control Barrier Functions, guiding the agent to learn safe behavior. Our empirical analysis, conducted in both simulated environments and real-world settings using a four-wheel differential drive robot, explores the possibilities of employing these approaches for safe robotic learning.

Maeva Guerrier、Karthik Soma、Hassan Fouad、Giovanni Beltrame

安全科学自动化技术、自动化技术设备

Maeva Guerrier,Karthik Soma,Hassan Fouad,Giovanni Beltrame.Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning[EB/OL].(2025-05-24)[2025-06-24].https://arxiv.org/abs/2505.18858.点此复制

评论