首页 | 官方网站   微博 | 高级检索  
     

基于自适应心理声学模型的智能语音识别系统
引用本文:熊笑颜,陈栩,黄灿英,陈艳.基于自适应心理声学模型的智能语音识别系统[J].沈阳工业大学学报,2017,39(6):675-679.
作者姓名:熊笑颜  陈栩  黄灿英  陈艳
作者单位:南昌大学 科学技术学院, 南昌 330029
基金项目:江西省教育厅科学技术研究项目(GJJ151504,GJJ151505);江西省教育改革课题资助项目(JXJG-14-28-3,JXJG-14-28-1,JXJG-14-28-6,JXJG-14-28-8)
摘    要:针对包含环境噪声和信道失真等噪声的语音处理问题,提出了一种基于自适应心理声学模型的智能语音识别系统,并建立了听觉模型.该模型将心理声学和耳声发射(OAE)合并到了自动语音识别(ASR)系统中,利用AURORA2数据库分别在清洁训练条件和多训练条件下进行试验.结果表明,所提出的特征提取方法可以显著提高词识别率,优于梅尔频率倒谱系数(MFCC)、前向掩蔽(FM)、侧向抑制(LI)和倒谱平均值及方差归一化(CMVN)算法,能够有效地提高智能语音识别系统的性能.

关 键 词:梅尔频率倒谱系数  耳声发射  自适应  心理声学滤波器  自动语音识别  AURORA2数据库  前向掩蔽  侧向抑制  

Intelligent speech recognition system based on self-adaption psychoacoustic model
XIONG Xiao-yan,CHEN Xu,HUANG Can-ying,CHEN Yan.Intelligent speech recognition system based on self-adaption psychoacoustic model[J].Journal of Shenyang University of Technology,2017,39(6):675-679.
Authors:XIONG Xiao-yan  CHEN Xu  HUANG Can-ying  CHEN Yan
Affiliation:School of Science and Technology, Nanchang University, Nanchang 330029, China
Abstract:Aiming at such noise speech processing problems as environmental noise and channel distortion, an intelligent speech recognition system based on adaptive psychoacoustic system was proposed, and an auditory model was established. In the proposed model, the psychoacoustics and otoacoustic emission(OAE)were integrated into an automatic speech recognition(ASR)system. With the AURORA2 database, the experiments were performed under both clean and multiple training conditions, respectively. The results show that the proposed feature extraction method can significantly improve the word recognition rate, is superior to those of Mel-frequency cepstral coefficients(MFCCs), forward masking(FM), lateral inhibition(LI)and cepstral mean & variance normalization(CMVN)algorithms, and can effectively enhance the performance of intelligent speech recognition system.
Keywords:Mel-frequency cepstral coefficient(MFCC)  otoacoustic emission(OAE)  self-adaption  psychoacoustic filter  automatic speech recognition(ASR)  AURORA2 database  forward masking(FM)  lateral inhibition(LI)  
本文献已被 CNKI 等数据库收录!
点击此处可从《沈阳工业大学学报》浏览原始摘要信息
点击此处可从《沈阳工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号