Estimation of the number of principal components in high-dimensional multivariate extremes
Estimation of the number of principal components in high-dimensional multivariate extremes
For multivariate regularly random vectors of dimension $d$, the dependence structure of the extremes is modeled by the so-called angular measure. When the dimension $d$ is high, estimating the angular measure is challenging because of its complexity. In this paper, we use Principal Component Analysis (PCA) as a method for dimension reduction and estimate the number of significant principal components of the empirical covariance matrix of the angular measure under the assumption of a spiked covariance structure. Therefore, we develop Akaike Information Criteria (AIC) and Bayesian Information Criteria (BIC) to estimate the location of the spiked eigenvalue of the covariance matrix, reflecting the number of significant components, and explore these information criteria on consistency. On the one hand, we investigate the case where the dimension $d$ is fixed, and on the other hand, where the dimension $d$ converges to $\infty$ under different high-dimensional scenarios. When the dimension $d$ is fixed, we establish that the AIC is not consistent, whereas the BIC is weakly consistent. In the high-dimensional setting, with techniques from random matrix theory, we derive sufficient conditions for the AIC and the BIC to be consistent. Finally, the performance of the different AIC and BIC versions is compared in a simulation study and applied to high-dimensional precipitation data.
Lucas Butsch、Vicky Fasen-Hartmann
数学
Lucas Butsch,Vicky Fasen-Hartmann.Estimation of the number of principal components in high-dimensional multivariate extremes[EB/OL].(2025-05-28)[2025-07-16].https://arxiv.org/abs/2505.22437.点此复制
评论