Neural Certificates for Safe Control Policies
Neural Certificates for Safe Control Policies
This paper develops an approach to learn a policy of a dynamical system that is guaranteed to be both provably safe and goal-reaching. Here, the safety means that a policy must not drive the state of the system to any unsafe region, while the goal-reaching requires the trajectory of the controlled system asymptotically converges to a goal region (a generalization of stability). We obtain the safe and goal-reaching policy by jointly learning two additional certificate functions: a barrier function that guarantees the safety and a developed Lyapunov-like function to fulfill the goal-reaching requirement, both of which are represented by neural networks. We show the effectiveness of the method to learn both safe and goal-reaching policies on various systems, including pendulums, cart-poles, and UAVs.
Zhuoran Yang、Shaoshuai Mou、Wanxin Jin、Zhaoran Wang
自动化技术、自动化技术设备计算技术、计算机技术航空航天技术
Zhuoran Yang,Shaoshuai Mou,Wanxin Jin,Zhaoran Wang.Neural Certificates for Safe Control Policies[EB/OL].(2020-06-15)[2025-08-02].https://arxiv.org/abs/2006.08465.点此复制
评论