首页 | 官方网站   微博 | 高级检索  
     

融合非线性幂函数和谱减法的CFCC特征提取
引用本文:白静,史燕燕,薛珮芸,郭倩岩. 融合非线性幂函数和谱减法的CFCC特征提取[J]. 西安电子科技大学学报(自然科学版), 2019, 46(1): 86-92. DOI: 10.19665/j.issn1001-2400.2019.01.014
作者姓名:白静  史燕燕  薛珮芸  郭倩岩
作者单位:太原理工大学 信息与计算机学院,山西 太原 030024
基金项目:山西省科技攻关(社会发展)项目(20120313013-6);山西省青年科技研究基金(2013021016-1)
摘    要:为提高噪声环境下的语音识别准确率,提出一种改进的语音特征提取算法。该算法采用模拟人耳听觉特性的非线性幂函数提取一种新的耳蜗滤波倒谱系数,并在特征提取前端引入谱减法对信号进行增强,将提取到的新的特征及其一阶差分组成一种混合特征参数;再联合主成分分析对该混合特征进行降维,将最终得到的特征用于一个非特定人、孤立词、小词汇量的语音识别系统。实验结果表明:采用非线性幂函数提取的耳蜗滤波倒谱系数特征与传统的耳蜗滤波倒谱系数特征相比,明显提高了语音识别准确率;混合特征参数相比单一特征能达到更佳的语音识别性能;结合主成分分析后的特征集在信噪比为0dB时的识别正确率可达到88.10%。

关 键 词:语音识别  非线性幂函数  耳蜗滤波倒谱系数  谱减法  
收稿时间:2018-06-26

CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction
BAI Jing,SHI Yanyan,XUE Peiyun,GUO Qianyan. CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction[J]. Journal of Xidian University, 2019, 46(1): 86-92. DOI: 10.19665/j.issn1001-2400.2019.01.014
Authors:BAI Jing  SHI Yanyan  XUE Peiyun  GUO Qianyan
Affiliation:College of Information and Computer, Taiyuan University of Technology, Taiyuan 030024, China
Abstract:This paper presents an improved speech feature extraction algorithm for improving the accuracy of speech recognition in noisy environment. A New Cochlear Filter Cepstral Coefficient(NCFCC) is extracted by the power-law nonlinear function which can simulate the auditory characteristics of the human ear. Then, the spectral subtraction is introduced in the feature extraction front end to enhance the signal, and the new feature and the first order difference are composed of a mixed feature parameter, after which the combined principal component analysis is made to reduce the dimension of the hybrid feature. The final feature is used in a non-specific persons, isolated words, and small-vocabulary speech recognition system. Experimental results show that, compared with the traditional Cochlear Filter Cepstral Coefficients(CFCC) feature, the Cochlear Filter Cepstral Coefficients extracted from the power-law nonlinear function significantly improve the accuracy of speech recognition. The mixed feature parameter can achieve a better speech recognition performance than a single feature. Combined with the feature set of the principal component analysis(PCA) ,the recognition accuracy can reach up to 88.10% when the signal to noise ratio(SNR) is 0 dB.
Keywords:speech recognition  power-law nonlinearity function  cochlear filter cepstral coefficients  spectral subtraction  
点击此处可从《西安电子科技大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《西安电子科技大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号