Communication-Efficient Publication of Sparse Vectors under Differential Privacy
Communication-Efficient Publication of Sparse Vectors under Differential Privacy
In this work, we propose a differentially private algorithm for publishing matrices aggregated from sparse vectors. These matrices include social network adjacency matrices, user-item interaction matrices in recommendation systems, and single nucleotide polymorphisms (SNPs) in DNA data. Traditionally, differential privacy in vector collection relies on randomized response, but this approach incurs high communication costs. Specifically, for a matrix with $N$ users, $n$ columns, and $m$ nonzero elements, conventional methods require $Ω(n \times N)$ communication, making them impractical for large-scale data. Our algorithm significantly reduces this cost to $O(\varepsilon m)$, where $\varepsilon$ is the privacy budget. Notably, this is even lower than the non-private case, which requires $Ω(m \log n)$ communication. Moreover, as the privacy budget decreases, communication cost further reduces, enabling better privacy with improved efficiency. We theoretically prove that our method yields results identical to those of randomized response, and experimental evaluations confirm its effectiveness in terms of accuracy, communication efficiency, and computational complexity.
Quentin Hillebrand、Vorapong Suppakitpaisarn、Tetsuo Shibuya
通信
Quentin Hillebrand,Vorapong Suppakitpaisarn,Tetsuo Shibuya.Communication-Efficient Publication of Sparse Vectors under Differential Privacy[EB/OL].(2025-06-25)[2025-07-16].https://arxiv.org/abs/2506.20234.点此复制
评论