首页 | 官方网站   微博 | 高级检索  
     

高斯PLDA在说话人确认中的应用及其联合估计
引用本文:许云飞,杨海,周若华,颜永红.高斯PLDA在说话人确认中的应用及其联合估计[J].自动化学报,2014,40(6):1068-1074.
作者姓名:许云飞  杨海  周若华  颜永红
作者单位:1.中国科学院语言声学与内容理解重点实验室 北京 100190
基金项目:国家高技术研究发展计划(863计划)(2012AA012503),国家自然科学基金(10925419,90920302,61072124,11074275,11161140319,91120001,61271426),中国科学院战略性先导科技专项(XDA06030100,XDA06030500),中科院重点部署项目(KGZD-EW-103-2)资助
摘    要:近年来,基于总变化因子的说话人识别方法成为说话人识别领域的主流方法.其中,概率线性鉴别分析(Probabilistic linear discriminant analysis,PLDA)因其优异的性能而得到学者们的广泛关注.然而,在估计PLDA模型时,传统的因子分析方法只更新模型空间,因此,模型均值不能很好地与更新后的模型空间耦合.提出联合估计法对模型均值和模型空间同时估计,得到更为严格的期望最大化更新公式,在美国国家标准与技术局说话人识别评测2010扩展测试数据库以及2012核心测试数据库上,等错率得到一定提升.

关 键 词:因子分析    总变化因子    概率线性鉴别分析    联合估计    期望最大化
收稿时间:2013-01-06

Gaussian PLDA for Speaker Verification and Joint Estimation
XU Yun-Fei,YANG Hai,ZHOU Ruo-Hua,YAN Yong-Hong.Gaussian PLDA for Speaker Verification and Joint Estimation[J].Acta Automatica Sinica,2014,40(6):1068-1074.
Authors:XU Yun-Fei  YANG Hai  ZHOU Ruo-Hua  YAN Yong-Hong
Affiliation:1.Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190
Abstract:Recently the approaches based on i-vector have become very popular in the speaker recognition domain. Among these methods, the probabilistic linear discriminant analysis (PLDA) has attracted much attention due to its promising performance. However, the traditional factor analysis method only updates model space, thus making model mean couple with the model space unsuitably. This paper propose an approach of joint estimation for both model mean and model space, resulting in more strict expectation maximization (EM) formula. The equal error rate has been improved on the NIST SRE 2010 extended test corpus and NIST SRE 2012 core test corpus.
Keywords:Factor analysis  i-vector  probabilistic linear discriminant analysis (PLDA)  joint estimation  expectation-maximization (EM)
本文献已被 CNKI 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号