融合非线性幂函数和谱减法的CFCC特征提取 CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

融合非线性幂函数和谱减法的CFCC特征提取

引用本文：	白静,史燕燕,薛珮芸,郭倩岩. 融合非线性幂函数和谱减法的CFCC特征提取[J]. 西安电子科技大学学报(自然科学版), 2019, 46(1): 86-92. DOI: 10.19665/j.issn1001-2400.2019.01.014

作者姓名：	白静史燕燕薛珮芸郭倩岩

作者单位：	太原理工大学信息与计算机学院,山西太原 030024

基金项目：	山西省科技攻关(社会发展)项目(20120313013-6);山西省青年科技研究基金(2013021016-1)

摘要：	为提高噪声环境下的语音识别准确率,提出一种改进的语音特征提取算法。该算法采用模拟人耳听觉特性的非线性幂函数提取一种新的耳蜗滤波倒谱系数,并在特征提取前端引入谱减法对信号进行增强,将提取到的新的特征及其一阶差分组成一种混合特征参数;再联合主成分分析对该混合特征进行降维,将最终得到的特征用于一个非特定人、孤立词、小词汇量的语音识别系统。实验结果表明:采用非线性幂函数提取的耳蜗滤波倒谱系数特征与传统的耳蜗滤波倒谱系数特征相比,明显提高了语音识别准确率;混合特征参数相比单一特征能达到更佳的语音识别性能;结合主成分分析后的特征集在信噪比为0dB时的识别正确率可达到88.10%。
关键词：	语音识别非线性幂函数耳蜗滤波倒谱系数谱减法
收稿时间：	2018-06-26
CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction

BAI Jing,SHI Yanyan,XUE Peiyun,GUO Qianyan. CFCC feature extraction for fusion of the power-law nonlinearity function and spectral subtraction[J]. Journal of Xidian University, 2019, 46(1): 86-92. DOI: 10.19665/j.issn1001-2400.2019.01.014

Authors:	BAI Jing SHI Yanyan XUE Peiyun GUO Qianyan

Affiliation:	College of Information and Computer, Taiyuan University of Technology, Taiyuan 030024, China

Abstract:	This paper presents an improved speech feature extraction algorithm for improving the accuracy of speech recognition in noisy environment. A New Cochlear Filter Cepstral Coefficient(NCFCC) is extracted by the power-law nonlinear function which can simulate the auditory characteristics of the human ear. Then, the spectral subtraction is introduced in the feature extraction front end to enhance the signal, and the new feature and the first order difference are composed of a mixed feature parameter, after which the combined principal component analysis is made to reduce the dimension of the hybrid feature. The final feature is used in a non-specific persons, isolated words, and small-vocabulary speech recognition system. Experimental results show that, compared with the traditional Cochlear Filter Cepstral Coefficients(CFCC) feature, the Cochlear Filter Cepstral Coefficients extracted from the power-law nonlinear function significantly improve the accuracy of speech recognition. The mixed feature parameter can achieve a better speech recognition performance than a single feature. Combined with the feature set of the principal component analysis(PCA) ,the recognition accuracy can reach up to 88.10% when the signal to noise ratio(SNR) is 0 dB.

Keywords:	speech recognition power-law nonlinearity function cochlear filter cepstral coefficients spectral subtraction

	点击此处可从《西安电子科技大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《西安电子科技大学学报(自然科学版)》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏