|国家预印本平台
首页|大规模数据中心网络快收敛、强鲁棒的集中式路由协议

大规模数据中心网络快收敛、强鲁棒的集中式路由协议

Fast, Scalable and Robust Centralized Routing for Data Center Networks

中文摘要英文摘要

本文提出了一种快速收敛、高度鲁棒的数据中心网络(Data center network, DCN)集中式路由解决方案Primus。对于快速路由计算,Primus使用集中式控制器收集/下发网络的链路状态(Link state, LS),并将实际路由计算转移到每个交换机上。观察到在具有规则拓扑的DCN中,路由变化可以被划分为几个固定的模式,因此可以将每个交换机的路由计算简化为查表方式,即将LS的变化与预先配置的DCN基本拓扑进行比较,并根据预定义的规则更新路由路径。为了实现高效的控制器容错,Primus使用了reporter交换机来保证LS更新成功地传递到所有受影响的交换机。Primus使用多个无状态控制器和少量的冗余通信来容忍故障,正常情况这种方式几乎不会产生开销。Primus基于集中式架构进行设计,因此其保持了良好的路由可控性/可管理性,这使得Primus能够实现一些高级路由功能,包括路由故障可视化和加权多路径(Weighted Cost Multipath, WCMP)路由。

his paper presents a fast and robust centralized data center network (DCN) routing solution called \name. For fast routing calculation, \name uses centralized controller to collect/disseminates the network's link-states (LS), and offload the actual routing calculation onto each switch. Observing that the routing changes can be classified into a few fixed patterns in DCNs which have regular topologies, we simplify each switch's routing calculation into a table-lookup manner, i.e., comparing LS changes with pre-installed base topology and updating routing paths according to predefined rules. For efficient controller fault-tolerance, \name purposely uses reporter switch to ensure the LS updates successfully delivered to all affected switches. As such, \name can use multiple stateless controllers and little redundant traffic to tolerate failures, which incurs little overhead under normal case. Primus maintains good routing controllability/manageability thanks to its centralized architecture, which enables us to build several advanced routing features in our testbed, including routing failure visualization and weighted-cost-multi-path routing.

陈果、蒋洪波、王宏宇、徐婷婷、陆元伟、周桂华、邵华、QU Andrew、魏德惠、林福生、陈力

通信

数据中心网络集中式路由路由协议

ata center networks Centralized routing Network protocols.

陈果,蒋洪波,王宏宇,徐婷婷,陆元伟,周桂华,邵华,QU Andrew,魏德惠,林福生,陈力.大规模数据中心网络快收敛、强鲁棒的集中式路由协议[EB/OL].(2022-05-11)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/202205-64.点此复制

评论