基于跳层连接卷积网络的语音情感识别研究
Research on speech emotion recognition based on layer hopping connected convolution network
随着智能设备的发展,语音识别技术作为最为便捷的人机交互技术之一,需求日益增长,部分产品如语音智能助手已经走向商用,但是据用户普遍反映,目前的各大语音助手还远达不到自然、流畅的交流与互动,这是因为语音助手缺少了情感的表达。随着人们对人机交互技术的研究逐渐深入,如何识别语音信号中的情感信息也成为亟待解决的问题之一。目前学术界对于语音情感识别的研究仅仅停留在使用传统深度学习模型上。为研究更适合于语音情感的深度学习网络,本文将跳层连接引入卷积神经网络中,使用语谱图作为输入特征,使用双向LSTM进行时序建模,并系统分析在卷积神经网络的不同卷积层后引入跳层连接对于情感识别的影响,选取表现最好的神经网络模型,实现了1.06%的性能提升。
With the development of intelligent devices, voice recognition technology, as one of the most convenient human-computer interaction technologies, has an increasing demand. Some products, such as voice intelligent assistants, have become commercial. However, according to the general reflection of users, the current major voice assistants are far from natural and smooth communication and interaction, because voice assistants lack emotional expression. With the deepening of the research on human-computer interaction technology, how to recognize the emotional information in speech signals has become one of the urgent problems to be solved. At present, the academic research on speech emotion recognition only stays on the use of traditional deep learning model. In order to study the deep learning network more suitable for speech emotion, this paper introduces the layer hopping connection into the convolution neural network, uses the spectrogram as the input feature, uses the two-way LSTM for time series modeling, systematically analyzes the impact of introducing the layer hopping connection on emotion recognition after different convolution layers of the convolution neural network, and selects the neural network model with the best performance, Finally, 1.06% performance improvement is achieved.
刘刚、潘晓萌
电子技术应用计算技术、计算机技术通信
语音情感识别跳层连接卷积神经网络双向LSTM语谱图
Speech emotion recognitionLayer hopping connectionConvolutional neural networkBidirectional LSTMSpectrogram
刘刚,潘晓萌.基于跳层连接卷积网络的语音情感识别研究[EB/OL].(2022-03-22)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/202203-300.点此复制
评论