首页 | 官方网站   微博 | 高级检索  
     

利用韵律信息的CHMM连续数字语音识别
引用本文:张静亚,俞一彪.利用韵律信息的CHMM连续数字语音识别[J].电子工程师,2006,32(12):43-46.
作者姓名:张静亚  俞一彪
作者单位:1. 常熟理工学院物理与电子科学系,江苏省,常熟市,215500
2. 苏州大学电子信息学院,江苏省,苏州市,215021
基金项目:江苏省高校自然科学基金重点项目(04KJA51033)
摘    要:提出了一种结合韵律信息的高性能汉语连续数字语音识别算法,该识别算法基于CHMM(连续隐马尔可夫模型),采用MFCC(MEL频率倒谱系数)为主要语音特征参数,结合韵律信息进行连续数字精确分割,能够有效区分易混数字。算法采用两级识别框架来提高语音识别率,其中,第1级对连续数字分割,在此基础上进行数字语音识别,输出各候选结果,第2级在候选结果中确定易混数字对,并运用韵律信息进一步选择正确结果。实验表明,最终汉语连续数字语音识别率有很大提高。

关 键 词:语音识别  连续隐马尔可夫模型(CHMM)  韵律信息
收稿时间:2006-03-14
修稿时间:2006年3月14日

A Study of Connected Digit Speech Recognition Based CHMM with Prosodic Information
ZHANG Jingya,YU Yibiao.A Study of Connected Digit Speech Recognition Based CHMM with Prosodic Information[J].Electronic Engineer,2006,32(12):43-46.
Authors:ZHANG Jingya  YU Yibiao
Affiliation:1. Changshu Institute of Technology , Changshu 215500, China; 2. Soochow University, Suzhou 215021, China
Abstract:A new algorithm for connected digital speech recognition based on CHMM using prosodic information is proposed.Every digit is modeled by a five-state CHMM described by MFCC coefficients.With the prosodic information,the connected speech is separated precisely,and the digits which acoustic features used to confuse easily can be recognized correctly.The algorithm employs two-level scheme.In the first level,the input speech is separated into individual digital syllables, and then the syllables are recognized and will output the first two digital candidates with higher scores.In the second level,the right digit string is extracted from the candidate lattice using the prosodic information.Experiments show that the proposed algorithm can improve the connected digital speech recognition performance.
Keywords:speech recognition  CHMM  prosodic information
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号