|国家预印本平台
首页|基于卷积神经网络的语音特征提取算法

基于卷积神经网络的语音特征提取算法

Feature extraction algorithm based on Convolution neural network

中文摘要英文摘要

语音识别时一种人机交互的手段,在如今电子计算机如此普遍的时代,如何让计算机可以智能化接受人类口头指令,听懂人类语言是一件非常有挑战性并有意义的一个课题。语音识别跟其他机器学习问题一样,特征的选取是起始的也是最重要的一环,一个具有区分性和稳定性的特征是一个分类问题较好识别率的前提。随着互联网信息指数性增加,海量语音数据的特征具有很大的非特定人差异性和噪声干扰性,常用的特征提取以及特征变换方法已经很难满足当前模型训练识别的需求。近些年来立足于语音识别和深度学习理论的紧密结合,通过研究发现卷积神经网络的结构十分适合语音信号的特征提取过程,本文提出一种基于卷积神经网络的特征提取方法,并且结合相对复杂的GMM-HMM模型组成新的语音识别系统。实验表明,卷积神经网络结构可以很好的克服语音信号的非特定人差异性以及噪声的影响,GMM-HMM模型相比softmax分类器更为切合语音复杂信号的建模,最终识别率有了很大的提升。

Speech recognition as a way of human-computer interaction, so prevalent in today's computer age, how to make intelligent computer can accept human oral instruction, understand the human language is a very challenging and meaningful topic. Speech recognition and other machine learning problem, feature selection is also one of the most important link of starting, one has to distinguish between sex and stability is characterized by a classification problem is the premise of good recognition rate. With the increase of Internet information grows exponentially, huge amounts of voice and data features has a great deal of speaker-independent, difference and noise interference methods of feature extraction and feature transformation is hard to meet the needs of the current training model identification. In recent years based on speech recognition and deep learning theory together, through the study found that the convolutional neural network structure is very suitable for speech signal feature extraction process, this paper proposes a feature extraction method based on convolution neural network, and the combination of relatively complex GMM - HMM model of the new voice recognition system. Experiments show that the convolution neural network structure can be very good to overcome the differences between speaker-independent speech signals and the influence of noise, GMM - HMM model is more relevant than softmax classifier in speech complex signal model area, the final recognition rate had the very big improvement.

刘刚、楚博策

计算技术、计算机技术通信无线通信

特征提取卷积神经网络语音识别

Feature extractionConvolution neural networkSpeech recognition

刘刚,楚博策.基于卷积神经网络的语音特征提取算法[EB/OL].(2015-11-24)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/201511-414.点此复制

评论