Computing stabilizing feedback gains for stochastic linear systems via policy iteration method
Computing stabilizing feedback gains for stochastic linear systems via policy iteration method
In recent years, stabilizing unknown dynamical systems has became a critical problem in control systems engineering. Addressing this for linear time-invariant (LTI) systems is an essential fist step towards solving similar problems for more complex systems. In this paper, we develop a model-free reinforcement learning algorithm to compute stabilizing feedback gains for stochastic LTI systems with unknown system matrices. This algorithm proceeds by solving a series of discounted stochastic linear quadratic (SLQ) optimal control problems via policy iteration (PI). And the corresponding discount factor gradually decreases according to an explicit rule, which is derived from the equivalent condition in verifying the stabilizability. We prove that this method can return a stabilizer after finitely many steps. Finally, a numerical example is provided to illustrate the effectiveness of the proposed method.
Xinpei Zhang、Guangyan Jia
自动化基础理论自动化技术、自动化技术设备计算技术、计算机技术
Xinpei Zhang,Guangyan Jia.Computing stabilizing feedback gains for stochastic linear systems via policy iteration method[EB/OL].(2025-08-07)[2025-08-18].https://arxiv.org/abs/2508.05214.点此复制
评论