Simple, Efficient Entropy Estimation using Harmonic Numbers
Simple, Efficient Entropy Estimation using Harmonic Numbers
The estimation of entropy, a fundamental measure of uncertainty, is central to diverse data applications. For discrete random variables, however, efficient entropy estimation presents challenges, particularly when the cardinality of the support set is large relative to the available sample size. This is because, without other assumptions, there may be insufficient data to adequately characterize a probability mass function. Further complications stem from the dependence among transformations of empirical frequencies within the sample. This paper demonstrates that a simple entropy estimator based on the harmonic number function achieves asymptotic efficiency for discrete random variables with tail probabilities satisfying $p_j =o(j^{-2})$ as $j\rightarrow\infty$. This result renders statistical inference newly feasible for a broad class of distributions. Further, the proposed estimator has superior mean squared error bounds compared to the classical plug-in estimator, while retaining its computational simplicity, offering practical and theoretical advantages.
Octavio César Mesner
数学
Octavio César Mesner.Simple, Efficient Entropy Estimation using Harmonic Numbers[EB/OL].(2025-05-26)[2025-07-01].https://arxiv.org/abs/2505.20153.点此复制
评论