A new measure of dependence: Integrated $R^2$
A new measure of dependence: Integrated $R^2$
We propose a new measure of dependence that quantifies the degree to which a random variable $Y$ depends on a random vector $X$. This measure is zero if and only if $Y$ and $X$ are independent, and equals one if and only if $Y$ is a measurable function of $X$. We introduce a simple and interpretable estimator that is comparable in ease of computation to classical correlation coefficients such as Pearson's, Spearman's, or Chatterjee's. Building on this coefficient, we develop a model-free variable selection algorithm, feature ordering by dependence (FORD), inspired by FOCI. FORD requires no tuning parameters and is provably consistent under suitable sparsity assumptions. We demonstrate its effectiveness and improvements over FOCI through experiments on both synthetic and real datasets.
Mona Azadkia、Pouya Roudaki
计算技术、计算机技术
Mona Azadkia,Pouya Roudaki.A new measure of dependence: Integrated $R^2$[EB/OL].(2025-05-23)[2025-06-08].https://arxiv.org/abs/2505.18146.点此复制
评论