Modelling bounded rational decision-making through Wasserstein constraints
Modelling bounded rational decision-making through Wasserstein constraints
Modelling bounded rational decision-making through information constrained processing provides a principled approach for representing departures from rationality within a reinforcement learning framework, while still treating decision-making as an optimization process. However, existing approaches are generally based on Entropy, Kullback-Leibler divergence, or Mutual Information. In this work, we highlight issues with these approaches when dealing with ordinal action spaces. Specifically, entropy assumes uniform prior beliefs, missing the impact of a priori biases on decision-makings. KL-Divergence addresses this, however, has no notion of "nearness" of actions, and additionally, has several well known potentially undesirable properties such as the lack of symmetry, and furthermore, requires the distributions to have the same support (e.g. positive probability for all actions). Mutual information is often difficult to estimate. Here, we propose an alternative approach for modeling bounded rational RL agents utilising Wasserstein distances. This approach overcomes the aforementioned issues. Crucially, this approach accounts for the nearness of ordinal actions, modeling "stickiness" in agent decisions and unlikeliness of rapidly switching to far away actions, while also supporting low probability actions, zero-support prior distributions, and is simple to calculate directly.
Benjamin Patrick Evans、Leo Ardon、Sumitra Ganesh
计算技术、计算机技术
Benjamin Patrick Evans,Leo Ardon,Sumitra Ganesh.Modelling bounded rational decision-making through Wasserstein constraints[EB/OL].(2025-04-01)[2025-05-01].https://arxiv.org/abs/2504.03743.点此复制
评论