|国家预印本平台
首页|基于GPGPU的LDPC解码访存优化技术研究

基于GPGPU的LDPC解码访存优化技术研究

memory optimization strategy of LDPC decoding based on GPGPU

中文摘要英文摘要

低密度奇偶校验码(Low-Density Parity-Check, LDPC)作为一类高性能的差错控制编码被用于多个通信标准中,但解码算法计算量巨大,限制了其潜能,基于通用图形处理器(General-Purpose GPU, GPGPU)的LDPC解码器由于其灵活性近年来备受关注。深入分析了LDPC)解码算法特性,提出Tanner图的交织器表示,简化了解码算法;结合GPU体系结构特点提出自顶向下的多步优化策略,充分挖掘了GPU的加速性能。实验结果显示,平衡计算访存负载、合并对齐全局访存、充分利用寄存器资源,可显著提高GPU性能;相对于CPU实现,可取得383倍的加速,综合性能优于现有的基于GPU的LDPC解码实现。

s powerful error correcting codes, low-density parity-check (LDPC) codes have been adopted by new emerging standards for digital communication; however, their performance gain is constrained due to their huge computation demand. The GPU-based LDPC decoder is a recent hot research subject for its lower cost and better flexibility. We analyze the parallelism property of SPA (sum product algorithm) and propose an easy way to translate the Tanner graph into an interleaver. From a hardware architecture perspective, we propose an efficient up-to-down multi-stage optimization strategy which releases GPU’s acceleration power to its limit gradually. Experimental results demonstrate that balancing computation and memory access, coalescing global memory accessing and aggressive usage of the on-chip high speed resource (e.g., shared memory and registers) can promote the performance significantly. The proposed decoder can achieve 383x-speedup compared to CPU-based decoder and also outperformances existing GPU-based ones in terms of overall performance.

邢座程、原略超、张洋、唐川

通信计算技术、计算机技术无线通信

低密度奇偶校验码解码器和积算法通用图形处理器优化策略并行计算

LDPC decodersum-product algorithmgeneral purpose graphic processor unitoptimization strategyparallel computing

邢座程,原略超,张洋,唐川.基于GPGPU的LDPC解码访存优化技术研究[EB/OL].(2013-08-28)[2025-08-19].http://www.paper.edu.cn/releasepaper/content/201308-312.点此复制

评论