首页 | 官方网站   微博 | 高级检索  
     

针对多种处理痕迹的数字语音取证算法
引用本文:向立,严迪群,王让定,李孝文.针对多种处理痕迹的数字语音取证算法[J].计算机应用,2019,39(1):126-130.
作者姓名:向立  严迪群  王让定  李孝文
作者单位:宁波大学信息科学与工程学院,浙江宁波,315211;宁波大学信息科学与工程学院,浙江宁波,315211;宁波大学信息科学与工程学院,浙江宁波,315211;宁波大学信息科学与工程学院,浙江宁波,315211
基金项目:国家自然科学基金资助项目(U1736215,61672302);浙江省自然科学基金资助项目(LZ15F020002,LY17F020010);宁波市自然科学基金资助项目(2017A610123);宁波大学学科基金资助项目(XKXL1509,XKXL1503)。
摘    要:现有的数字语音取证研究主要集中于对单一的某种操作进行检测,无法对不相关的操作进行判断。针对该问题,提出了一种能够同时检测经过变调、低通滤波、高通滤波和加噪这四种操作的数字语音取证方法。首先,计算语音的归一化梅尔频率倒谱系数(MFCC)统计矩特征;然后通过多个二分类器对特征进行训练,并组合投票得到多分类器;最后使用该多分类器对待测语音进行分类。在TIMIT以及UME语音库上的实验结果表明,归一化MFCC统计矩特征在库内实验中均达到了97%以上的检测率,且在对MP3压缩鲁棒性测试的实验中,检测率仍能保持在96%以上。

关 键 词:语音取证  梅尔频率倒谱系数  处理痕迹  多分类器
收稿时间:2018-07-19
修稿时间:2018-08-07

Forensics algorithm of various operations for digital speech
XIANG Li,YAN Diqun,WANG Rangding,LI Xiaowen.Forensics algorithm of various operations for digital speech[J].journal of Computer Applications,2019,39(1):126-130.
Authors:XIANG Li  YAN Diqun  WANG Rangding  LI Xiaowen
Affiliation:Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo Zhejiang 315211, China
Abstract:Most existing forensic methods for digital speech aim at detecting a specific operation, which means that these methods can not identify various operations at a time. To solve the problem, a universal forensic algorithm for simultaneously detecting various operations, such as pitch modification, low-pass filtering, high-pass filtering, and noise adding was proposed. Firstly, the statistical moments of Mel-Frequency Cepstral Coefficients (MFCC) were calculated, and cepstrum mean and variance normalization were applied to the moments. Then, a multi-class classifier based on multiple two-class classifiers was constructed. Finally, the classifier was used to identify various types of speech operations. The experimental results on TIMIT and UME speech datasets show that the proposed universal features achieve detection accuracy over 97% for various speech operations. And the detection accuracy in the test of MP3 compression robustness is still above 96%.
Keywords:speech forensics                                                                                                                        Mel-Frequency Cepstral Coefficient (MFCC)                                                                                                                        operation trace                                                                                                                        multi-class classifier
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号