说话人识别中用模型合成的编码畸变补偿研究 Coding distortion compensation of speaker identification based on model synthesis期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

说话人识别中用模型合成的编码畸变补偿研究

引用本文：	马苗苗,何勇军,韩纪庆.说话人识别中用模型合成的编码畸变补偿研究[J].计算机工程与应用,2011,47(3):135-138.

作者姓名：	马苗苗何勇军韩纪庆

作者单位：	哈尔滨工业大学,计算机科学与技术学院,哈尔滨,150001

基金项目：	国家重点基础研究发展规划(973)

摘要：	编码环境失配是影响说话人识别准确率的重要因素之一。在说话人识别系统上,对码速率在5.15～128 Kb/s之间的语音编码进行了实验分析,结果表明,高速率语音编码对说话人识别系统的影响不大,低速率语音编码使系统性能急剧下降。针对这一问题,采用基于UBM的说话人模型合成算法对低速率语音编码的说话人模型进行补偿,在NIST 2002单说话人识别数据库上的实验表明,此方法能显著提高系统识别率。
关键词：	语音编码说话人识别低速率模型合成
收稿时间：	2009-5-13
修稿时间：	2009-7-12
Coding distortion compensation of speaker identification based on model synthesis

MA Miaomiao,HE Yongjun,HAN Jiqing.Coding distortion compensation of speaker identification based on model synthesis[J].Computer Engineering and Applications,2011,47(3):135-138.

Authors:	MA Miaomiao HE Yongjun HAN Jiqing

Affiliation:	MA Miaomiao,HE Yongjun,HAN Jiqing

Abstract:	Environment mismatch in enrollment and test sessions caused by different code strategies is one of main reasons degrading the performance of speaker recognition.Experiments with speech in different code formats and code rate raging from 5.15 Kb/s to 128 Kb/s show that the speech with high-bit rate causes little distortion,while the ones with low-bit rate make the recognition rate decreasing sharply.To solve this problem,speaker model synthesis based on UBM is adopted to synthesis speaker models for target code environments to compensate the distortion caused by low-bit rate.Experiments on NIST 2002 corpus in one-speaker detection task show that the proposed approach obtains better performance than those with no compensation.

Keywords:	speech coding speaker identification low-bit rate model synthesis
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏