GCF: Generalized Causal Forest for Heterogeneous Treatment Effect Estimation in Online Marketplace
GCF: Generalized Causal Forest for Heterogeneous Treatment Effect Estimation in Online Marketplace
Uplift modeling is a rapidly growing approach that utilizes causal inference and machine learning methods to directly estimate the heterogeneous treatment effects, which has been widely applied to various online marketplaces to assist large-scale decision-making in recent years. The existing popular models, like causal forest (CF), are limited to either discrete treatments or posing parametric assumptions on the outcome-treatment relationship that may suffer model misspecification. However, continuous treatments (e.g., price, duration) often arise in marketplaces. To alleviate these restrictions, we use a kernel-based doubly robust estimator to recover the non-parametric dose-response functions that can flexibly model continuous treatment effects. Moreover, we propose a generic distance-based splitting criterion to capture the heterogeneity for the continuous treatments. We call the proposed algorithm generalized causal forest (GCF) as it generalizes the use case of CF to a much broader setting. We show the effectiveness of GCF by deriving the asymptotic property of the estimator and comparing it to popular uplift modeling methods on both synthetic and real-world datasets. We implement GCF on Spark and successfully deploy it into a large-scale online pricing system at a leading ride-sharing company. Online A/B testing results further validate the superiority of GCF.
Jiecheng Guo、Hongtu Zhu、Xiaoqing Yang、Zhonggen Sun、Shu Wan、Mengfan Xu、Chen Zheng
计算技术、计算机技术
Jiecheng Guo,Hongtu Zhu,Xiaoqing Yang,Zhonggen Sun,Shu Wan,Mengfan Xu,Chen Zheng.GCF: Generalized Causal Forest for Heterogeneous Treatment Effect Estimation in Online Marketplace[EB/OL].(2022-03-21)[2025-08-02].https://arxiv.org/abs/2203.10975.点此复制
评论