基于DCQCN的拥塞控制算法研究
Research on congestion control algorithm based on DCQCN
远程直接内存访问(Remote Direct Memory Access,RDMA)是一种用于解决网络传输中服务器数据处理延迟的技术,它能够在不需要用户双方操作系统介入的条件下有效降低延迟。自从DCQCN提出以来,它已逐渐成为一种被广泛采用的RDMA网络拥塞控制解决方案。然而,在满足超低延迟和高带宽要求的数据中心网络中,当发生大规模的冲突通信时,DCQCN存在incast处理能力不足和参数调整复杂的问题。当探测可用带宽时,DCQCN使用固定的周期和步长来增加速率,在大规模通信中容易导致数据包排队甚至数据包丢失。因此,本文提出了一种基于交换机拥塞队列的改进算法 DCQCN-Q。DCQCN-Q利用网络内遥测(in-network telemetry,INT)技术来获取精确的交换机队列信息,并将交换机的拥塞状态反馈给发送端。发送端根据该反馈信号动态地调整速率。NS-3仿真平台上的实验结果显示DCQCN-Q具有较强的incast负载处理能力,在WebSearch工作负载下实现了比其他主流算法低13%以上的数据延迟。
Remote Direct Memory Access (RDMA) is a technology used to solve server data processing latency in network transmission. It can effectively reduce latency without requiring the intervention of both user operating systems. Since DCQCN was proposed, it has gradually become a widely adopted congestion control solution for RDMA network. However, in data center networks that meet ultralow latency and high bandwidth requirements, DCQCN has insufficient cast processing capability and complex parameter adjustment issues when large-scale incast communication happens. DCQCN uses fixed period and steps for rate increase when probing for available bandwidth, which is prone to queuing, causing even packet loss in large-scale communications. Thus, this article proposes an improved algorithm DCQCN-Q based on the congestion queue at the switch. DCQCN-Q leverages in-network telemetry (INT) to obtain accurate switch queue information and feedbacks the congestion status of the switch to the sender. The sender dynamically adjusts the rate according to this feedback signal. The experimental results on the NS-3 simulation platform show that DCQCN-Q has strong ability of handling incast loads. When DCQCN-Q is implemented in the simulation under the workload ofWebSearch, the data latency is more than 13% lower than other mainstream algorithms.
尹长川、杨泽童
通信
数据中心网络拥塞控制RDMAQCN低延迟
data center networkcongestion controlRDMADCQCNlow latency
尹长川,杨泽童.基于DCQCN的拥塞控制算法研究[EB/OL].(2024-04-10)[2025-08-16].http://www.paper.edu.cn/releasepaper/content/202404-160.点此复制
评论