基于中文命名实体识别的查询扩充方法
Query expansion method based on Chinese named entity recognition
为在互联网大数据时代提供给用户便捷的获取信息方式,解决检索服务中存在的理解查询意图难的问题,提出了一种基于中文命名实体识别的查询扩充方法,通过对查询词中的实体类别和词性的识别帮助系统扩充查询词,提高查询的结果完整性和效率。首先,分析中文命名实体识别的现状与难点;然后,针对前述问题设计了基于词汇增强和相对编码的命名实体模型;最后,利用识别出的实体信息和词性信息对查询词进行扩充。通过在公开数据集MSRA上的实验结果分析,发现该方法提高了实体识别的准确率,并通过消融实验验证了所提出新方法的优越性。
In order to provide users with a convenient way to obtain information in the era of Internet big data and solve the problem of difficult understanding of query intention in retrieval service, this paper proposes a query expansion method based on Chinese named entity recognition, which helps the system expand the query words by identifying the entity category and part of speech in the query words, and improves the integrity and efficiency of the query results.Firstly, the current situation and difficulties of Chinese named entity recognition are analyzed; secondly, a named entity model based on vocabulary enhancement and relative coding is designed; finally, the query words are expanded by using the identified entity information and part of speech information. Through the analysis of the experimental results on the open data set MSRA, it is found that this method improves the accuracy of entity recognition, and the advantages of the new method are verified by ablation experiments.
王奕昕、吴岳辛、范春晓
计算技术、计算机技术
人工智能命名实体识别查询扩充栅栏网络相对位置编码
rtificial intelligencenamed entity recognitionquery expansionlattice networkrelative position coding
王奕昕,吴岳辛,范春晓.基于中文命名实体识别的查询扩充方法[EB/OL].(2021-03-25)[2025-08-16].http://www.paper.edu.cn/releasepaper/content/202103-280.点此复制
评论