Dynamic programming and dimensionality in convex stochastic optimization and control
Dynamic programming and dimensionality in convex stochastic optimization and control
This paper studies stochastic optimization problems and associated Bellman equations in formats that allow for reduced dimensionality of the cost-to-go functions. In particular, we study stochastic control problems in the ``decision-hazard-decision'' form where at each stage, the system state is controlled both by predictable as well as adapted controls. Such an information structure may result in a lower dimensional system state than what is required in more traditional ``decision-hazard'' or ``hazard-decision'' formulations. The dimension is critical for the complexity of numerical dynamic programming algorithms and, in particular, for cutting plane schemes such as the stochastic dual dynamic programming algorithm. Our main result characterizes optimal solutions and optimum values in terms of solutions to generalized Bellman equations. Existence of solutions to the Bellman equations is established under general conditions that do not require compactness. We allow for general randomness but show that, in the Markovian case, the dimensionality of the Bellman equations reduces with respect to randomness just like in more traditional control formulations.
Teemu Pennanen、Ari-Pekka Perkki?
自动化基础理论计算技术、计算机技术
Teemu Pennanen,Ari-Pekka Perkki?.Dynamic programming and dimensionality in convex stochastic optimization and control[EB/OL].(2025-05-19)[2025-06-05].https://arxiv.org/abs/2505.12787.点此复制
评论