PC Adjusted Testing for Low Dimensional Parameters
PC Adjusted Testing for Low Dimensional Parameters
In this paper, we investigate the impact of high-dimensional Principal Component (PC) adjustments on inferring the effects of variables on outcomes, with a focus on applications in genetic association studies where PC adjustment is commonly used to account for population stratification. We consider high-dimensional linear regression in the regime where the number of covariates grows proportionally to the number of samples. In this setting, we provide an asymptotically precise understanding of when PC adjustments yield valid tests with controlled Type I error rates. Our results demonstrate that, under both fixed and diverging signal strengths, PC regression often fails to control the Type I error at the desired nominal level. Furthermore, we establish necessary and sufficient conditions for Type I error inflation based on covariate distributions. These theoretical findings are further supported by a series of numerical experiments.
Rounak Dey、Rajarshi Mukherjee、Sohom Bhattacharya
生物科学研究方法、生物科学研究技术遗传学
Rounak Dey,Rajarshi Mukherjee,Sohom Bhattacharya.PC Adjusted Testing for Low Dimensional Parameters[EB/OL].(2025-06-27)[2025-07-21].https://arxiv.org/abs/2209.10774.点此复制
评论