Controlled Reach-avoid Set Computation for Discrete-time Polynomial Systems via Convex Optimization
Controlled Reach-avoid Set Computation for Discrete-time Polynomial Systems via Convex Optimization
This paper addresses the computation of controlled reach-avoid sets (CRASs) for discrete-time polynomial systems subject to control inputs. A CRAS is a set encompassing initial states from which there exist control inputs driving the system into a target set while avoiding unsafe sets. However, efficiently computing CRASs remains an open problem, especially for discrete-time systems. In this paper, we propose a novel framework for computing CRASs which takes advantage of a probabilistic perspective. This framework transforms the fundamentally nonlinear problem of computing CRASs into a computationally tractable convex optimization problem. By regarding control inputs as disturbances obeying certain probability distributions, a CRAS can be equivalently treated as a 0-reach-avoid set in the probabilistic sense, which consists of initial states from which the probability of eventually entering the target set while remaining within the safe set is greater than zero. Thus, we can employ the convex optimization method of computing 0-reach-avoid sets to estimate CRASs. Furthermore, inspired by the $\epsilon$-greedy strategy widely used in reinforcement learning, we propose an approach that iteratively updates the aforementioned probability distributions imposed on control inputs to compute larger CRASs. We demonstrate the effectiveness of the proposed method on extensive examples.
Taoran Wu、Yiling Xue、Dejin Ren、Arvind Easwaran、Martin Fr?nzle、Bai Xue
自动化基础理论
Taoran Wu,Yiling Xue,Dejin Ren,Arvind Easwaran,Martin Fr?nzle,Bai Xue.Controlled Reach-avoid Set Computation for Discrete-time Polynomial Systems via Convex Optimization[EB/OL].(2025-06-07)[2025-06-17].https://arxiv.org/abs/2506.06679.点此复制
评论