Lempel-Ziv Complexity, Empirical Entropies, and Chain Rules
Lempel-Ziv Complexity, Empirical Entropies, and Chain Rules
We derive upper and lower bounds on the overall compression ratio of the 1978 Lempel-Ziv (LZ78) algorithm, applied independently to $k$-blocks of a finite individual sequence. Both bounds are given in terms of normalized empirical entropies of the given sequence. For the bounds to be tight and meaningful, the order of the empirical entropy should be small relative to $k$ in the upper bound, but large relative to $k$ in the lower bound. Several non-trivial conclusions arise from these bounds. One of them is a certain form of a chain rule of the Lempel-Ziv (LZ) complexity, which decomposes the joint LZ complexity of two sequences, say, $\bx$ and $\by$, into the sum of the LZ complexity of $\bx$ and the conditional LZ complexity of $\by$ given $\bx$ (up to small terms). The price of this decomposition, however, is in changing the length of the block. Additional conclusions are discussed as well.
Neri Merhav
计算技术、计算机技术
Neri Merhav.Lempel-Ziv Complexity, Empirical Entropies, and Chain Rules[EB/OL].(2025-06-15)[2025-06-27].https://arxiv.org/abs/2506.12772.点此复制
评论