|国家预印本平台
首页|Multiply Robust Conformal Risk Control with Coarsened Data

Multiply Robust Conformal Risk Control with Coarsened Data

Multiply Robust Conformal Risk Control with Coarsened Data

来源:Arxiv_logoArxiv
英文摘要

Conformal Prediction (CP) has recently received a tremendous amount of interest, leading to a wide range of new theoretical and methodological results for predictive inference with formal theoretical guarantees. However, the vast majority of CP methods assume that all units in the training data have fully observed data on both the outcome and covariates of primary interest, an assumption that rarely holds in practice. In reality, training data are often missing the outcome, a subset of covariates, or both on some units. In addition, time-to-event outcomes in the training set may be censored due to dropout or administrative end-of-follow-up. Accurately accounting for such coarsened data in the training sample while fulfilling the primary objective of well-calibrated conformal predictive inference, requires robustness and efficiency considerations. In this paper, we consider the general problem of obtaining distribution-free valid prediction regions for an outcome given coarsened training data. Leveraging modern semiparametric theory, we achieve our goal by deriving the efficient influence function of the quantile of the outcome we aim to predict, under a given semiparametric model for the coarsened data, carefully combined with a novel conformal risk control procedure. Our principled use of semiparametric theory has the key advantage of facilitating flexible machine learning methods such as random forests to learn the underlying nuisance functions of the semiparametric model. A straightforward application of the proposed general framework produces prediction intervals with stronger coverage properties under covariate shift, as well as the construction of multiply robust prediction sets in monotone missingness scenarios. We further illustrate the performance of our methods through various simulation studies.

Manit Paul、Arun Kumar Kuchibhotla、Eric J. Tchetgen Tchetgen

数学计算技术、计算机技术

Manit Paul,Arun Kumar Kuchibhotla,Eric J. Tchetgen Tchetgen.Multiply Robust Conformal Risk Control with Coarsened Data[EB/OL].(2025-08-21)[2025-09-02].https://arxiv.org/abs/2508.15489.点此复制

评论