|国家预印本平台
首页|Yet Another Distributional Bellman Equation

Yet Another Distributional Bellman Equation

Yet Another Distributional Bellman Equation

来源:Arxiv_logoArxiv
英文摘要

We consider non-standard Markov Decision Processes (MDPs) where the target function is not only a simple expectation of the accumulated reward. Instead, we consider rather general functionals of the joint distribution of terminal state and accumulated reward which have to be optimized. For finite state and compact action space, we show how to solve these problems by defining a lifted MDP whose state space is the space of distributions over the true states of the process. We derive a Bellman equation in this setting, which can be considered as a distributional Bellman equation. Well-known cases like the standard MDP and quantile MDPs are shown to be special examples of our framework. We also apply our model to a variant of an optimal transport problem.

Nicole B?uerle、Tamara G?ll、Anna Ja?kiewicz

自动化基础理论计算技术、计算机技术

Nicole B?uerle,Tamara G?ll,Anna Ja?kiewicz.Yet Another Distributional Bellman Equation[EB/OL].(2025-05-27)[2025-06-15].https://arxiv.org/abs/2505.21098.点此复制

评论