基于韵律特征的说话人确认系统融合研究 Fusion of Speaker Verification System based on Super-segment Prosodic Feature期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于韵律特征的说话人确认系统融合研究

引用本文：	童强,李辉,方昕.基于韵律特征的说话人确认系统融合研究[J].通信技术,2013(11):90-94.

作者姓名：	童强李辉方昕

作者单位：	[1]中国科学技术大学电子科学与技术系,安徽合肥230027 [2]科大讯飞信息科技股份有限公司,安徽合肥230088

摘要：	提出一种基于超音段韵律特征和GMM—UBM—MAP的文本无关的说话人确认系统，并与基于MFCC特征参数的说话人确认系统融合，研究提出新的两系统融合策略。在超音段中提取基于基频的韵律特征参数，建立辅助系统。融合时，以基准系统基于MFCC特征参数的说话人确认系统为主系统，基于韵律特征参数的系统为辅助系统，当主系统的得分与阈值接近时，将两系统得分融合再判断。通过NIST2006数据库的实验表明，融合系统相对原系统有16．39％的提升。
关键词：	超音段韵律特征说话人确认得分融合
Fusion of Speaker Verification System based on Super-segment Prosodic Feature

TONG Qiang,LI Hui,FANG Xin.Fusion of Speaker Verification System based on Super-segment Prosodic Feature[J].Communications Technology,2013(11):90-94.

Authors:	TONG Qiang LI Hui FANG Xin

Affiliation:	1. Department of Electronic Science and Technology, University of Science and Technology of China, Hefei Anhui 230027, China ; 2. Anhui USTC iFLYTEK Co. , Ltd, Hefei Anhui 230088, China)

Abstract:	A test-independent speaker verification method based on super-segment prosodic feature and GMM-UBM-MAP is proposed, and fused with another system based on MFCC, a fusion strategy for two systems is suggested. With the extraction rhythm feature parameter based on pitch, the auxiliary system is constructed. In this fusion, the main system is based on MFCC, while the auxiliary system based on super -segment prosodic feature. When the score of the main system is close to threshold, the scores of these two systems are fused, and then the judgement is done. Experiments show that the equal error rate （EER） of the auxiliary system can reach 17.77% , and the performance of final system is improved by 16.39% as compared with the main system.

Keywords:	super-segment prosodic feature speaker verification fusion
本文献已被维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏