|国家预印本平台
首页|Deliberation gated by opportunity cost adapts to context with urgency

Deliberation gated by opportunity cost adapts to context with urgency

Deliberation gated by opportunity cost adapts to context with urgency

来源:bioRxiv_logobioRxiv
英文摘要

Abstract Finding the right amount of deliberation, between insufficient and excessive, is a hard decision making problem that depends on the value we place on our time. Average-reward, putatively encoded by tonic dopamine, serves in existing reinforcement learning theory as the stationary opportunity cost of time, and of deliberation in particular. However, this cost often varies with environmental context that can change over time. Here, we introduce an opportunity cost of deliberation estimated adaptively on multiple timescales to account for non-stationary contextual factors. We use it in a simple decision-making heuristic based on average-reward reinforcement learning (AR-RL) that we call Performance-Gated Deliberation (PGD). We propose PGD as a strategy used by animals wherein deliberation cost is implemented directly as urgency, a previously characterized neural signal effectively controlling the speed of the decision-making process. We show PGD outperforms AR-RL solutions in explaining behaviour and urgency of non-human primates in a context-varying random walk prediction task and is consistent with relative performance and urgency in a context-varying random dot motion task. We make readily testable predictions for both neural activity and behaviour and call for an integrated research program in cognitive and systems neuroscience around the value of time.

Cisek Paul、Touzel Maximilian Puelma、Lajoie Guillaume

Department of Neuroscience, Universit¨| de Montr¨|alMila, Quebec AI Institute||Department of Computer Science and Operations Research, Universit¨| de Montr¨|alMila, Quebec AI Institute||Department of Mathematics and Statistics, Universit¨|e de Montr¨|eal||Canada CIFAR AI Chair

10.1101/2021.07.31.452742

生物科学理论、生物科学方法生物科学研究方法、生物科学研究技术生物物理学

primate decision-makingreinforcement learningurgencyopportunity cost

Cisek Paul,Touzel Maximilian Puelma,Lajoie Guillaume.Deliberation gated by opportunity cost adapts to context with urgency[EB/OL].(2025-03-28)[2025-05-23].https://www.biorxiv.org/content/10.1101/2021.07.31.452742.点此复制

评论