首页 | 官方网站   微博 | 高级检索  
     

基于修正倒谱特征的回放语音检测算法
引用本文:林朗,王让定,严迪群,李璨.基于修正倒谱特征的回放语音检测算法[J].计算机应用,2018,38(6):1648-1652.
作者姓名:林朗  王让定  严迪群  李璨
作者单位:宁波大学 信息科学与工程学院, 浙江 宁波 315211
基金项目:国家自然科学基金资助项目(61672302,U1736215);浙江省自然科学基金资助项目(LZ15F020002,LY17F020010)。
摘    要:随着语音技术的发展,以回放语音为代表的各种仿冒语音给声纹认证系统及音频取证技术带来了极大挑战。针对回放语音对声纹认证系统的攻击问题,提出一种基于修正倒谱特征的检测算法。首先,采用变异系数来分析原始语音和回放语音在频域上的差异;然后,有针对性地将提取梅尔倒谱系数(MFCC)过程中的Mel滤波器组换成由linear滤波器和逆Mel滤波器组合的新滤波器组,进而得到基于新滤波器组的修正倒谱特征;最后,使用高斯混合模型(GMM)作为分类器进行分类判别。实验结果表明,修正的倒谱特征能够有效地检测回放语音,其等错误率约为3.45%。

关 键 词:变异系数  高斯混合模型  回放语音检测  梅尔倒谱系数  滤波器组  
收稿时间:2017-12-01
修稿时间:2018-01-19

Playback speech detection algorithm based on modified cepstrum feature
LIN Lang,WANG Rangding,YAN Diqun,LI Can.Playback speech detection algorithm based on modified cepstrum feature[J].journal of Computer Applications,2018,38(6):1648-1652.
Authors:LIN Lang  WANG Rangding  YAN Diqun  LI Can
Affiliation:Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo Zhejiang 315211, China
Abstract:With the development of speech technology, various kinds of phishing speech represented by playback speech have brought serious challenge for voiceprint authentication system and audio forensics technology. Aiming at the attack problem of playback speech to voiceprint authentication system, a new detection algorithm based on modified cepstrum feature was proposed. Firstly, the coefficient of variation was used to analyze the difference between the original speech and the playback speech in the frequency domain. Secondly, a new filter bank composed of inverse-Mel filters and linear filters was used to replace Mel filter bank in the process of extracting Mel Frequency Cepstral Coefficients (MFCC) pertinently, and then the modified cepstrum feature based on the new filter bank was obtained. Finally, Gaussian Mixture Model (GMM) was utilized as the classifier to classify and discriminate speech. The experimental results show that, the modified cepstrum feature can effectively detect the playback speech, and its equal error rate is about 3.45%.
Keywords:coefficient of variation  Gaussian Mixture Model (GMM)  playback speech detection  Mel Frequency Cepstral Coefficients (MFCC)  filter bank  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号