|国家预印本平台
首页|基于区间编码的XML数据压缩方法研究

基于区间编码的XML数据压缩方法研究

Region encoding-based XML data compression method

中文摘要英文摘要

针对现有XML数据压缩方法在压缩数据上不支持有效连接操作问题,提出采用区间编码的压缩方法REXDC(region encoding-based XML data compression method)对XML数据中的节点进行区间编码,实现结构连接;提出相同子树的概念以及合并相同子树的方法,建立一种支持有效连接操作的存储模型,实现XML数据压缩同时解决在压缩数据上不支持有效连接操作的问题;最后,以压缩率、压缩时间、解压时间及查询性能作为衡量标准,将REXDC与XGrind、XPress和XQzip算法进行比较,实验结果证明,REXDC具有较好的压缩性能和查询性能。

he existing XML data compressions do not support effective structural join on compressed data. A new Compressor-REXDC is proposed, which encodes each node in XML document with region encoding and realizes the structure connection. The definition and merging method of the Same SubTree (SST) are proposed. A storage model is designed to support effective join operation. Finally, the REXDC compared with XGrind, XPress and XQzip, which takes compression ratio, compression and decompression time as a measure. The result shows that REXDC has good compression performance and query efficiency.

李华昱、魏祥丽、高海康

计算技术、计算机技术

XML相同子树数据压缩区间编码结构连接

XMLthe Same SubTreedata compressionregion encodingstructural join

李华昱,魏祥丽,高海康.基于区间编码的XML数据压缩方法研究[EB/OL].(2015-06-26)[2025-08-16].http://www.paper.edu.cn/releasepaper/content/201506-332.点此复制

评论