|国家预印本平台
首页|基于自动查询扩展的专利文档检索方法

基于自动查询扩展的专利文档检索方法

Patent Retrieval Method based on Automatic Query Expansion

中文摘要英文摘要

针对现有专利检索中的用户意图理解及查询扩展不足问题,提出了一种基于自动查询扩展的专利文档检索方法。首先结合专利文档特点,采用基于改进TF-IDF公式的专利领域词表提取方法,构建专利领域词表。在检索阶段,对查询输入串进行分析得到查询关键词汇,同领域词表相结合,确定查询所在领域及查询扩展难度。利用基于伪相关反馈的自动查询扩展技术,根据伪相关文档的术语分布差异分析,生成查询扩展项并排序,最后将扩展项与原始查询条件相结合,重新组成查询条件,完成专利查询。实验结果表明,该方法具有较高的召回率和平均准确率。

Existing patent retrieval methods cannot effectively capture user's query intents due to the lack in query expansion. To solve this problem, this paper propose a novel patent retrieval method based on automatic query expansion. Considering the characteristics of patent documents, an improved TF-IDF scheme is first adopted to extract patent domain terms and build the domain vocabularies. At the retrieval stage, query inputs are analyzed to extract key words, and then the field of query and the difficulty of query expansion are determined based on domain vocabularies. Furthermore, according to the term distribution variation analysis on pseudo related documents, the pseudo relevance feedback (PRF)-based automatic query expansion techniques are utilized to generate and rank the candidate expansion terms. At last, the expansion terms are combined with original query conditions to compose the final query conditions for searching. The comparative experiment results show that our method achieves better recall and average precision.

王锋、谢非、朱晓伟、林兰芬、羊帅

计算技术、计算机技术

人工智能专利检索领域词表查询扩展伪相关反馈

rtificial intelligencePatent retrievalDomain vocabularyQuery expansionPRF

王锋,谢非,朱晓伟,林兰芬,羊帅.基于自动查询扩展的专利文档检索方法[EB/OL].(2013-04-17)[2025-08-18].http://www.paper.edu.cn/releasepaper/content/201304-357.点此复制

评论