运行Hadoop的数据中心流量特性
he Traffic Characteristics of Datacenter running Hadoop
随着越来越多的公司和机构开始使用Hadoop运行他们的业务,关于Hadoop的研究也越来越多的受到人们的关注。但是关于运行Hadoop的数据中心的流量测量工作目前几乎没有,测量工作的缺乏也阻碍了Hadoop研究的发展。本文根据数据中心网络的固有特点,提出了一个有针对性的测量方法,并且开发出了一个叫做HADE的软件专门用来处理和分析流数据,数据中心的被测流量是由搜索业务生成的。本文最后给出了一些会对研究者很有用的流量特性的测量结果,并且对这些测量结果做出了一定分析。
Research on Hadoop is becoming increasingly prominent recently as more and more organizations use it for their business. But little has been done on measurements of datacenter running Hadoop so far. This lack of knowledge hindered the development of the research. According to the feature of the datacenter network, this paper proposes a measurement method and developes a software callded HADE for processing and analyzing flow data. And flow is generated by search engine application. At last, this paper provides some traffic characteristics of Hadoop datacenter and analyzes the measurement result.
程时端、王洪波、黄永军、杨贺
计算技术、计算机技术
计算机应用技术数据中心Hadoop测量MapReduce搜索引擎
echnology of Computer ApplicationDatacenterHadoopMeasurementMapReduceSearch Engine
程时端,王洪波,黄永军,杨贺.运行Hadoop的数据中心流量特性[EB/OL].(2011-12-05)[2025-08-10].http://www.paper.edu.cn/releasepaper/content/201112-94.点此复制
评论