首页|Neural Certificates for Safe Control Policies

Neural Certificates for Safe Control Policies

来源：

英文摘要

This paper develops an approach to learn a policy of a dynamical system that is guaranteed to be both provably safe and goal-reaching. Here, the safety means that a policy must not drive the state of the system to any unsafe region, while the goal-reaching requires the trajectory of the controlled system asymptotically converges to a goal region (a generalization of stability). We obtain the safe and goal-reaching policy by jointly learning two additional certificate functions: a barrier function that guarantees the safety and a developed Lyapunov-like function to fulfill the goal-reaching requirement, both of which are represented by neural networks. We show the effectiveness of the method to learn both safe and goal-reaching policies on various systems, including pendulums, cart-poles, and UAVs.

作者：Zhuoran Yang、Shaoshuai Mou、Wanxin Jin、Zhaoran Wang

作者单位：

学科分类：自动化技术、自动化技术设备计算技术、计算机技术航空航天技术

推荐引用：Zhuoran Yang,Shaoshuai Mou,Wanxin Jin,Zhaoran Wang.Neural Certificates for Safe Control Policies[EB/OL].(2020-06-15)[2025-08-02].https://arxiv.org/abs/2006.08465.点此复制

Neural Certificates for Safe Control Policies

Neural Certificates for Safe Control Policies

评论