|国家预印本平台
首页|基于LPMCC的语音情感识别研究

基于LPMCC的语音情感识别研究

Speech Emotion Recognition Based on LPMCC

中文摘要英文摘要

语音情感识别是人机语音智能交互的关键技术,特征提取是语音情感识别的重要组成部分,本文使用线性预测美尔倒谱系数(LPMCC)来表征语音情感,LPMCC是基于Mel频率和线性预测倒谱系数(LPCC)提出的频谱特征。这种特征很好地结合了美尔倒谱系数(MFCC)和线性预测倒谱系数(LPCC)的优点,既考虑了人耳听觉特性,同时提高了算法效率。本文对情感语句提取LPMCC特征,用支持向量机(SVM)进行识别,同时对中英文不同的语音库进行实验。实验表明,LPMCC的识别率较LPCC有所提高,单一中文或英文的情感语音库的识别率要高于混合中英文语音库的识别率。

Speech recognition is the key technology of human-computer speech intelligent interaction, feature extraction is an important part of emotional speech recognition. In this paper, we use linear prediction Mel-frequency cepstral coefficients (LPMCC) to reflect speech emotion. LPMCC are spectral features, which are proposed based on Mel frequency and Linear prediction cepstral coefficients (LPCC). LPMCC combines the advantages of Mel-frequency cepstral coefficients (MFCC) and linear prediction cepstral coefficients (LPCC) well, this approach considers the human auditory characteristics and also improves the efficiency of the algorithm. In this paper, linear prediction Mel-frequency cepstral coefficients (LPMCC) were extracted from emotional sentences by using different emotional voicelibs of Chinese and English, and recognized by support vector machine (SVM). Experimental results show that the recognition rate of LPMCC is better than that of LPCC, and the recognition rate of the single Chinese or the English emotional voicelib is better than that of the mixed emotional voicelib.

安秀红、张雪英

通信

语音情感识别LPMCCMFCC

Speech emotion recognitionLPMCCMFCC

安秀红,张雪英.基于LPMCC的语音情感识别研究[EB/OL].(2010-10-03)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/201010-15.点此复制

评论