分布式环境下的模式匹配算法
he Pattern Matching Algorithm In Distributed Computing Environment
模式匹配算法是信息处理中常用的算法。但大部分经典的模式匹配算法提出的时间都比较早,即使这些算法效率很高,在处理海量数据时也有些乏力。如今分布式计算技术已成为处理海量数据的基本方法。本文利用分布式计算法的技术,对经典的WM模式匹配算法进行改进,提出了一种分布式环境下的模式匹配算法。该算法充分利用Map-Reduce的特性,将WM的预处理过程和匹配过程拆分成Map-Reduce作业,使处理过程并发进行。本文还将此算法的执行结果与经典的串行的模式匹配算法进行时间效率上的对比,从而证明该算法在效率上的优势。
Pattern Matching is a very popular method in information pocessing. However, most pattern matching algorithm are proposed many years ago. Even though these algorithms are very effective, they are not proper for processing huge amount of data. These years, distributed computing becomes a basic approach to handle this. This paper will use this technology to improve the classic WM pattern matching algorithm and propose a pattern matching algorithm in distributed computing environment. This algoritm will take Map-Reduce features, spliting the processing into pieces of Map-Reduce jobs and making them work simultaneously. This paper will also compare the efficiency of this algorithm with that of the basic pattern matching algorithm, proving this algorithm is more effective.
刘杰、杨文川
计算技术、计算机技术
模式匹配分布式计算Map-Reduce并行
pattern matchingdistributed computingMap-Reduceparallel processing
刘杰,杨文川.分布式环境下的模式匹配算法[EB/OL].(2012-01-11)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/201201-376.点此复制
评论