A first-order condition for discrete-time distribution steering
A first-order condition for discrete-time distribution steering
We study a class of distribution-steering problems from a variational point of view. Under some differentiability assumptions, we derive necessary conditions for optimal Markov policies in the spirit of the Lagrange multiplier approach. We also provide a heuristic gradient-based method derived from the variational principle.
Alberto Domínguez Corella、David González-Sánchez
自动化基础理论计算技术、计算机技术
Alberto Domínguez Corella,David González-Sánchez.A first-order condition for discrete-time distribution steering[EB/OL].(2025-08-28)[2025-09-06].https://arxiv.org/abs/2508.21026.点此复制
评论