|国家预印本平台
首页|基于矢量量化的说话人识别系统的研究

基于矢量量化的说话人识别系统的研究

Voiceprint identification based on VQ

中文摘要英文摘要

说话人识别是语音识别的一种特殊方式,其目的不是识别语音内容,而是识别说话人是谁,即从语音信号中提取个人特征。采用矢量量化(VQ)可避免困难的语音分段问题和时间归整问题,且作为一种数据压缩手段可大大减少系统所需的数据存储量。本文提出了识别特征选取采用复倒谱特征参数和对应用VQ的说话人识别系统。当用于训练的数据量较小时,复倒谱特征可以得到比较稳定的识别性能。

Speaker Recognition is a special mode of speech recognition. The order does not recognize speech signals. It expects who is speaking. That is distilling one’s character. Using VQ not only can avoid difficult speech subsection and time warping, but also it can reduce data store as a constringent method. This text introduces a method of digits text speaker recognition is introduced, which is based on complex cepstrum parameters and improved vector quantization modeling. When the data of training is small, complex cepstrum can recognize stably.

丁伟、吴小培

通信

复倒谱说话人识别矢量量化

complex cepstrumspeaker recognitionVQ

丁伟,吴小培.基于矢量量化的说话人识别系统的研究[EB/OL].(2008-05-19)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/200805-516.点此复制

评论