Pair Correlation Factor and the Sample Complexity of Gaussian Mixtures
Pair Correlation Factor and the Sample Complexity of Gaussian Mixtures
We study the problem of learning Gaussian Mixture Models (GMMs) and ask: which structural properties govern their sample complexity? Prior work has largely tied this complexity to the minimum pairwise separation between components, but we demonstrate this view is incomplete. We introduce the \emph{Pair Correlation Factor} (PCF), a geometric quantity capturing the clustering of component means. Unlike the minimum gap, the PCF more accurately dictates the difficulty of parameter recovery. In the uniform spherical case, we give an algorithm with improved sample complexity bounds, showing when more than the usual $ε^{-2}$ samples are necessary.
Farzad Aryan
数学
Farzad Aryan.Pair Correlation Factor and the Sample Complexity of Gaussian Mixtures[EB/OL].(2025-08-05)[2025-08-23].https://arxiv.org/abs/2508.03633.点此复制
评论