|国家预印本平台
首页|Hadoop剩余有效时间调度算法

Hadoop剩余有效时间调度算法

Hadoop remaining valid time scheduling algorithm

中文摘要英文摘要

并行计算被用来解决复杂的分布式计算问题和大型的处理问题,通过同时使用多台机器中的计算能力,达到实现高效并行计算的目的。Hadoop作为并行计算的开源平台,被广泛使用在web日志分析,数据挖掘和图像处理等领域。随着对hadoop使用的深入,各个种类的服务都出现了处理时效性的瓶颈。现有的hadoop调度专注于缩短执行时间,而忽略了时效响应要求。为了提高hadoop处理时效性task的能力,本文设计了一种Hadoop剩余有效时间调度算法。在调度时,剩余有效时间调度器会实时估算task的剩余有效时间,并以此为依据更新task的执行顺序。结果表明剩余有效时间调度能提高hadoop处理时效性任务的能力。

Parallel computing can be used to solve complex distributed computing and large processing problems, by the use of multiple machines computing power at the same time,we can achieve the goal of fast parallel computing. Hadoop as parallel computing, open source platform, is widely used in the web log analysis, data mining and image processing, etc. With the deepening of the use of hadoop, various kinds of services are on the bottleneck of the processing timeliness. Existing hadoop scheduling focused on reducing execution time, while ignoring the ageing response requirements. In order to improve the ability of hadoop processing timeliness task, this paper designs a hadoop remaining valid time scheduling algorithm. When scheduling, remaining valid time real-time scheduler will estimate the remaining valid time of the task, And on this basis to update the task execution order. Results show that the remaining time effective scheduling can improve the ability of timing hadoop processing tasks.

林荣恒、李熙文、邹华

计算技术、计算机技术

Hadoop调度并行计算时效响应剩余时间映射规约

Hadoop schedulingParallel computingAgeing responseime remainMapReduce

林荣恒,李熙文,邹华.Hadoop剩余有效时间调度算法[EB/OL].(2014-11-24)[2025-08-04].http://www.paper.edu.cn/releasepaper/content/201411-435.点此复制

评论