首页 | 官方网站   微博 | 高级检索  
     

说话人识别中用模型合成的编码畸变补偿研究
引用本文:马苗苗,何勇军,韩纪庆.说话人识别中用模型合成的编码畸变补偿研究[J].计算机工程与应用,2011,47(3):135-138.
作者姓名:马苗苗  何勇军  韩纪庆
作者单位:哈尔滨工业大学,计算机科学与技术学院,哈尔滨,150001
基金项目:国家重点基础研究发展规划(973)
摘    要:编码环境失配是影响说话人识别准确率的重要因素之一。在说话人识别系统上,对码速率在5.15~128 Kb/s之间的语音编码进行了实验分析,结果表明,高速率语音编码对说话人识别系统的影响不大,低速率语音编码使系统性能急剧下降。针对这一问题,采用基于UBM的说话人模型合成算法对低速率语音编码的说话人模型进行补偿,在NIST 2002单说话人识别数据库上的实验表明,此方法能显著提高系统识别率。

关 键 词:语音编码  说话人识别  低速率  模型合成
收稿时间:2009-5-13
修稿时间:2009-7-12  

Coding distortion compensation of speaker identification based on model synthesis
MA Miaomiao,HE Yongjun,HAN Jiqing.Coding distortion compensation of speaker identification based on model synthesis[J].Computer Engineering and Applications,2011,47(3):135-138.
Authors:MA Miaomiao  HE Yongjun  HAN Jiqing
Affiliation:MA Miaomiao,HE Yongjun,HAN Jiqing
Abstract:Environment mismatch in enrollment and test sessions caused by different code strategies is one of main reasons degrading the performance of speaker recognition.Experiments with speech in different code formats and code rate raging from 5.15 Kb/s to 128 Kb/s show that the speech with high-bit rate causes little distortion,while the ones with low-bit rate make the recognition rate decreasing sharply.To solve this problem,speaker model synthesis based on UBM is adopted to synthesis speaker models for target code environments to compensate the distortion caused by low-bit rate.Experiments on NIST 2002 corpus in one-speaker detection task show that the proposed approach obtains better performance than those with no compensation.
Keywords:speech coding  speaker identification  low-bit rate  model synthesis
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号