多噪声环境下的层级语音识别模型 Hierarchical speech recognition model in multi-noise environment期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

多噪声环境下的层级语音识别模型

引用本文：	曹晶晶,许洁萍,邵聖淇.多噪声环境下的层级语音识别模型[J].计算机应用,2018,38(6):1790-1794.

作者姓名：	曹晶晶许洁萍邵聖淇

作者单位：	中国人民大学信息学院, 北京 100872

基金项目：	国家自然科学基金资助项目（61672523）。

摘要：	针对多噪声环境下的语音识别问题，提出了将环境噪声作为语音识别上下文考虑的层级语音识别模型。该模型由含噪语音分类模型和特定噪声环境下的声学模型两层组成，通过含噪语音分类模型降低训练数据与测试数据的差异，消除了特征空间研究对噪声稳定性的限制，并且克服了传统多类型训练在某些噪声环境下识别准确率低的弊端，又通过深度神经网络（DNN）进行声学模型建模，进一步增强声学模型分辨噪声的能力，从而提高模型空间语音识别的噪声鲁棒性。实验中将所提模型与多类型训练得到的基准模型进行对比，结果显示所提层级语音识别模型较该基准模型的词错率（WER）相对降低了20.3%，表明该层级语音识别模型有利于增强语音识别的噪声鲁棒性。
关键词：	语音识别噪声鲁棒性环境噪声声学模型深度神经网络
收稿时间：	2017-11-14
修稿时间：	2018-01-09
Hierarchical speech recognition model in multi-noise environment

CAO Jingjing,XU Jieping,SHAO Shengqi.Hierarchical speech recognition model in multi-noise environment[J].journal of Computer Applications,2018,38(6):1790-1794.

Authors:	CAO Jingjing XU Jieping SHAO Shengqi

Affiliation:	School of Information, Renmin University of China, Beijing 100872, China

Abstract:	Focusing on the issue of speech recognition in multi-noise environment, a new hierarchical speech recognition model considering environmental noise as the context of speech recognition was proposed. The proposed model was composed of two layers of noisy speech classification model and acoustic model under specific noise environment. The difference between training data and test data was reduced by noisy speech classification model, which eliminated the limitation of noise stability required in feature space research and solved the disadvantage of low recognition rate caused by traditional multi-type training under certain noise environment. Furthermore, a Deep Neural Network (DNN) was used for modeling of acoustic model, which could further enhance the ability of acoustic model to distinguish noise and speech, and the noise robustness of speech recognition in model space was improved. In the experiment, the proposed model was compared with the benchmark model obtained by multi-type training. The experimental results show that, the proposed hierarchical speech recognition model has relatively reduced the Word Error Rate (WER) by 20.3% compared with the traditional benchmark model. The proposed hierarchical speech recognition model is helpful to enhance the noise robustness of speech recognition.

Keywords:	speech recognition noise-robustness environmental noise acoustic model Deep Neural Network (DNN)

	点击此处可从《计算机应用》浏览原始摘要信息
	点击此处可从《计算机应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏