|国家预印本平台
首页|Fair-Count-Min: Frequency Estimation under Equal Group-wise Approximation Factor

Fair-Count-Min: Frequency Estimation under Equal Group-wise Approximation Factor

Fair-Count-Min: Frequency Estimation under Equal Group-wise Approximation Factor

来源:Arxiv_logoArxiv
英文摘要

Frequency estimation in streaming data often relies on sketches like Count-Min (CM) to provide approximate answers with sublinear space. However, CM sketches introduce additive errors that disproportionately impact low-frequency elements, creating fairness concerns across different groups of elements. We introduce Fair-Count-Min, a frequency estimation sketch that guarantees equal expected approximation factors across element groups, thus addressing the unfairness issue. We propose a column partitioning approach with group-aware semi-uniform hashing to eliminate collisions between elements from different groups. We provide theoretical guarantees for fairness, analyze the price of fairness, and validate our theoretical findings through extensive experiments on real-world and synthetic datasets. Our experimental results show that Fair-Count-Min achieves fairness with minimal additional error and maintains competitive efficiency compared to standard CM sketches.

Nima Shahbazi、Stavros Sintos、Abolfazl Asudeh

计算技术、计算机技术

Nima Shahbazi,Stavros Sintos,Abolfazl Asudeh.Fair-Count-Min: Frequency Estimation under Equal Group-wise Approximation Factor[EB/OL].(2025-05-24)[2025-07-17].https://arxiv.org/abs/2505.18919.点此复制

评论