首页|Optimal Targeting in Dynamic Systems

Optimal Targeting in Dynamic Systems

来源：

英文摘要

Modern treatment targeting methods often rely on estimating the conditional average treatment effect (CATE) using machine learning tools. While effective in identifying who benefits from treatment on the individual level, these approaches typically overlook system-level dynamics that may arise when treatments induce strain on shared capacity. We study the problem of targeting in Markovian systems, where treatment decisions must be made one at a time as units arrive, and early decisions can impact later outcomes through delayed or limited access to resources. We show that optimal policies in such settings compare CATE-like quantities to state-specific thresholds, where each threshold reflects the expected cumulative impact on the system of treating an additional individual in the given state. We propose an algorithm that augments standard CATE estimation with off-policy evaluation techniques to estimate these thresholds from observational data. Theoretical results establish consistency and convergence guarantees, and empirical studies demonstrate that our method improves long-run outcomes considerably relative to individual-level CATE targeting rules.

作者：Yuchen Hu、Shuangning Li、Stefan Wager

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Yuchen Hu,Shuangning Li,Stefan Wager.Optimal Targeting in Dynamic Systems[EB/OL].(2025-06-30)[2025-07-16].https://arxiv.org/abs/2507.00312.点此复制

Optimal Targeting in Dynamic Systems

Optimal Targeting in Dynamic Systems

评论