Mutual Information Optimal Control of Discrete-Time Linear Systems
Mutual Information Optimal Control of Discrete-Time Linear Systems
In this paper, we formulate a mutual information optimal control problem (MIOCP) for discrete-time linear systems. This problem can be regarded as an extension of a maximum entropy optimal control problem (MEOCP). Differently from the MEOCP where the prior is fixed to the uniform distribution, the MIOCP optimizes the policy and prior simultaneously. As analytical results, under the policy and prior classes consisting of Gaussian distributions, we derive the optimal policy and prior of the MIOCP with the prior and policy fixed, respectively. Using the results, we propose an alternating minimization algorithm for the MIOCP. Through numerical experiments, we discuss how our proposed algorithm works.
Shoju Enami、Kenji Kashima
自动化基础理论计算技术、计算机技术
Shoju Enami,Kenji Kashima.Mutual Information Optimal Control of Discrete-Time Linear Systems[EB/OL].(2025-07-07)[2025-08-02].https://arxiv.org/abs/2507.04712.点此复制
评论