Variable Importance Assessments and Backward Variable Selection for High-Dimensional Data
Variable Importance Assessments and Backward Variable Selection for High-Dimensional Data
Variable selection in high-dimensional scenarios is of great interested in statistics. One application involves identifying differentially expressed genes in genomic analysis. Existing methods for addressing this problem have some limits or disadvantages. In this paper, we propose distance based variable importance measures to deal with these problems, which is inspired by the Multi-Response Permutation Procedure (MRPP). The proposed variable importance assessments can effectively measure the importance of an individual dimension by quantifying its influence on the differences between multivariate distributions. A backward selection algorithm is developed that can be used in high-dimensional variable selection to discover important variables. Both simulations and real data applications demonstrate that our proposed method enjoys good properties and has advantages over other methods.
Liuhua Peng、Dan Nettleton、Long Qu
生物科学现状、生物科学发展生物科学研究方法、生物科学研究技术
Liuhua Peng,Dan Nettleton,Long Qu.Variable Importance Assessments and Backward Variable Selection for High-Dimensional Data[EB/OL].(2018-06-17)[2025-04-27].https://arxiv.org/abs/1806.06468.点此复制
评论