|国家预印本平台
首页|Pseudo-Labeling for Kernel Ridge Regression under Covariate Shift

Pseudo-Labeling for Kernel Ridge Regression under Covariate Shift

Pseudo-Labeling for Kernel Ridge Regression under Covariate Shift

来源:Arxiv_logoArxiv
英文摘要

We develop and analyze a principled approach to kernel ridge regression under covariate shift. The goal is to learn a regression function with small mean squared error over a target distribution, based on unlabeled data from there and labeled data that may have a different feature distribution. We propose to split the labeled data into two subsets, and conduct kernel ridge regression on them separately to obtain a collection of candidate models and an imputation model. We use the latter to fill the missing labels and then select the best candidate accordingly. Our non-asymptotic excess risk bounds demonstrate that our estimator adapts effectively to both the structure of the target distribution and the covariate shift. This adaptation is quantified through a notion of effective sample size that reflects the value of labeled source data for the target regression task. Our estimator achieves the minimax optimal error rate up to a polylogarithmic factor, and we find that using pseudo-labels for model selection does not significantly hinder performance.

Kaizheng Wang

计算技术、计算机技术

Kaizheng Wang.Pseudo-Labeling for Kernel Ridge Regression under Covariate Shift[EB/OL].(2025-07-24)[2025-08-10].https://arxiv.org/abs/2302.10160.点此复制

评论