首页|Learning Verifiable Control Policies Using Relaxed Verification

Learning Verifiable Control Policies Using Relaxed Verification

来源：

英文摘要

To provide safety guarantees for learning-based control systems, recent work has developed formal verification methods to apply after training ends. However, if the trained policy does not meet the specifications, or there is conservatism in the verification algorithm, establishing these guarantees may not be possible. Instead, this work proposes to perform verification throughout training to ultimately aim for policies whose properties can be evaluated throughout runtime with lightweight, relaxed verification algorithms. The approach is to use differentiable reachability analysis and incorporate new components into the loss function. Numerical experiments on a quadrotor model and unicycle model highlight the ability of this approach to lead to learned control policies that satisfy desired reach-avoid and invariance specifications.

作者：Puja Chaudhury、Alexander Estornell、Michael Everett

作者单位：

学科分类：自动化基础理论自动化技术、自动化技术设备

推荐引用：Puja Chaudhury,Alexander Estornell,Michael Everett.Learning Verifiable Control Policies Using Relaxed Verification[EB/OL].(2025-04-23)[2025-06-06].https://arxiv.org/abs/2504.16879.点此复制

Learning Verifiable Control Policies Using Relaxed Verification

Learning Verifiable Control Policies Using Relaxed Verification

评论