首页|Yet Another Distributional Bellman Equation

Yet Another Distributional Bellman Equation

来源：

英文摘要

We consider non-standard Markov Decision Processes (MDPs) where the target function is not only a simple expectation of the accumulated reward. Instead, we consider rather general functionals of the joint distribution of terminal state and accumulated reward which have to be optimized. For finite state and compact action space, we show how to solve these problems by defining a lifted MDP whose state space is the space of distributions over the true states of the process. We derive a Bellman equation in this setting, which can be considered as a distributional Bellman equation. Well-known cases like the standard MDP and quantile MDPs are shown to be special examples of our framework. We also apply our model to a variant of an optimal transport problem.

作者：Nicole B?uerle、Tamara G?ll、Anna Ja?kiewicz

作者单位：

学科分类：自动化基础理论计算技术、计算机技术

推荐引用：Nicole B?uerle,Tamara G?ll,Anna Ja?kiewicz.Yet Another Distributional Bellman Equation[EB/OL].(2025-05-27)[2025-06-15].https://arxiv.org/abs/2505.21098.点此复制

Yet Another Distributional Bellman Equation

Yet Another Distributional Bellman Equation

评论