基于修正倒谱特征的回放语音检测算法 Playback speech detection algorithm based on modified cepstrum feature期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于修正倒谱特征的回放语音检测算法

引用本文：	林朗,王让定,严迪群,李璨.基于修正倒谱特征的回放语音检测算法[J].计算机应用,2018,38(6):1648-1652.

作者姓名：	林朗王让定严迪群李璨

作者单位：	宁波大学信息科学与工程学院, 浙江宁波 315211

基金项目：	国家自然科学基金资助项目（61672302，U1736215）；浙江省自然科学基金资助项目（LZ15F020002，LY17F020010）。

摘要：	随着语音技术的发展，以回放语音为代表的各种仿冒语音给声纹认证系统及音频取证技术带来了极大挑战。针对回放语音对声纹认证系统的攻击问题，提出一种基于修正倒谱特征的检测算法。首先，采用变异系数来分析原始语音和回放语音在频域上的差异；然后，有针对性地将提取梅尔倒谱系数（MFCC）过程中的Mel滤波器组换成由linear滤波器和逆Mel滤波器组合的新滤波器组，进而得到基于新滤波器组的修正倒谱特征；最后，使用高斯混合模型（GMM）作为分类器进行分类判别。实验结果表明，修正的倒谱特征能够有效地检测回放语音，其等错误率约为3.45%。
关键词：	变异系数高斯混合模型回放语音检测梅尔倒谱系数滤波器组
收稿时间：	2017-12-01
修稿时间：	2018-01-19
Playback speech detection algorithm based on modified cepstrum feature

LIN Lang,WANG Rangding,YAN Diqun,LI Can.Playback speech detection algorithm based on modified cepstrum feature[J].journal of Computer Applications,2018,38(6):1648-1652.

Authors:	LIN Lang WANG Rangding YAN Diqun LI Can

Affiliation:	Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo Zhejiang 315211, China

Abstract:	With the development of speech technology, various kinds of phishing speech represented by playback speech have brought serious challenge for voiceprint authentication system and audio forensics technology. Aiming at the attack problem of playback speech to voiceprint authentication system, a new detection algorithm based on modified cepstrum feature was proposed. Firstly, the coefficient of variation was used to analyze the difference between the original speech and the playback speech in the frequency domain. Secondly, a new filter bank composed of inverse-Mel filters and linear filters was used to replace Mel filter bank in the process of extracting Mel Frequency Cepstral Coefficients (MFCC) pertinently, and then the modified cepstrum feature based on the new filter bank was obtained. Finally, Gaussian Mixture Model (GMM) was utilized as the classifier to classify and discriminate speech. The experimental results show that, the modified cepstrum feature can effectively detect the playback speech, and its equal error rate is about 3.45%.

Keywords:	coefficient of variation Gaussian Mixture Model (GMM) playback speech detection Mel Frequency Cepstral Coefficients (MFCC) filter bank

	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏