|国家预印本平台
首页|一种正负关联规则的快速查询扩展算法

一种正负关联规则的快速查询扩展算法

Efficient query expansion based on positive and negative association rules

中文摘要英文摘要

将负关联规则引入到查询扩展研究中,提出了新的查询扩展模型,并设计了一种基于正负关联规则的快速查询扩展算法。该算法通过对文本事务数据库的布尔化表示及数据结构的合理分配,采用向量内积策略来产生频繁和非频繁特征词集,并从中挖掘出词间正负关联规则。实验结果表明,该算法能对原查询词进行快速有效的扩展,且仅需扫描一次文本数据库,并具有动态剪枝,不保留中间候选项和节省大量内存等优点,对信息检索中查询扩展的研究具有重要意义。

his paper introduces negative association rules to the field of query expansion, and proposes new models of query expansion; meanwhile, we design an algorithm of query expansion based on positive and negative association rules. By converting the text database to Boolean Vector Matrix, and allotting equitable data storage structure, this algorithm can produce frequent and infrequent feature terms according to the inner vector product, and get positive and negative association rules between terms. Experimental results show that this algorithm can expand original query terms efficiently and effectively, and scan the database only once. Meanwhile, it has advantages such as pruning dynamically, without saving mid items, and saving lots of memories, which is important to the research of query expansion in information retrieval.

刘彩虹、刘强、祁瑞华

计算技术、计算机技术

数据挖掘负关联规则信息检索查询扩展

data miningnegative association rulesinformation retrievalquery expansion

刘彩虹,刘强,祁瑞华.一种正负关联规则的快速查询扩展算法[EB/OL].(2013-02-04)[2025-08-21].http://www.paper.edu.cn/releasepaper/content/201302-53.点此复制

评论