ext-CRNN+Attention架构下的多类别文本信息分类
文本分类作为数据挖掘和信息检索领域的研究热点。迄今为止,传统机器学习方法依赖人工提取特征,复杂度高;深度学习网络本身特征表达能力强,但模型可解释性弱导致关键特征信息丢失。为此,以网络层次结合的方式设计了CRNN网络,并引入Attention机制,提出一种Text-CRNN+Attention模型。首先利用CNN处理局部特征的位置不变性,提取高效局部特征信息;然后RNN进行序列特征建模时,引入Attention机制对每一时刻输出序列信息进行自动加权,减少关键特征的丢失;最后完成时间和空间上的特征提取。实验结果表明,提出的模型较其他模型准确率提升了2~3个百分点;在提取文本特征时,该模型既保证了数据的局部相关性,又起到强化序列特征的有效组合能力。
ext classification is a research hotspot in the field of data mining and information retrieval. In view of the current research process , traditional machine learning methods relies on manual feature extraction with high complexity; Deep learning network has strong feature expression ability, but the model is weak in interpretability, leading to the loss of key feature information. For this reason, the author designed the CRNN network in the way of network level combination, introduced Attention mechanism, and proposed a Text-CRNN+Attention model. Firstly, CNN was used to deal with the position invariance of local features and extracted efficient local feature information. Then, Attention mechanism was introduced to automatically weigh the output sequence information at each time to reduce the loss of key features when RNN was used to model the sequence features. The feature extraction in time and space is completed. The experimental results show that the accuracy of the proposed model is 2 to 3 percentage points higher than that of other models. When dealing with text data, the model not only guarantees the local correlation of data, but also strengthens the effective combination ability of sequence features.
马成贤、杨腾飞、周嫣然、卢健
计算技术、计算机技术
文本分类NNRNNRNNttention机制
马成贤,杨腾飞,周嫣然,卢健.ext-CRNN+Attention架构下的多类别文本信息分类[EB/OL].(2019-05-10)[2025-05-11].https://chinaxiv.org/abs/201905.00047.点此复制
评论