首页 | 官方网站   微博 | 高级检索  
     

采用双谱特征的语音可懂度评价算法
引用本文:陈晓梅,王晓玮,钟波,商莹莹,杨佳燕.采用双谱特征的语音可懂度评价算法[J].声学技术,2022,41(5):678-684.
作者姓名:陈晓梅  王晓玮  钟波  商莹莹  杨佳燕
作者单位:华北电力大学电气与电子工程学院, 北京 102206;中国计量科学研究院力学与声学计量科学研究所, 北京 100029;中国医学科学院北京协和医院耳鼻喉科, 北京 100730
基金项目:国家重点研发计划“主动健康和老龄化科技应对”专项(2020YFC2005200)课题
摘    要:针对现有的语音可懂度评价方法不能有效地处理信号在多种类型的非线性失真下的变化,提出了一种基于双谱特征的语音可懂度评价(Bispectral Speech Intelligibility Metric,BSIM)算法,用三阶统计量从语音信号的谱图中提取特征。双谱可以检测语音信号中的非线性相位耦合,抑制非高斯信号中的高斯噪声,从而揭示更多隐含于信号内部的有用信息。将本方法与现有的语音可懂度指标进行了比较,结果表明,此方法可以成功地预测线性失真和非线性失真造成的语音可懂度下降,其评价结果与主观可懂度结果具有很高的相关度,对信号失真变化敏感。

关 键 词:语音可懂度  客观评价算法  高阶统计  双谱
收稿时间:2021/1/12 0:00:00
修稿时间:2021/4/11 0:00:00

Speech intelligibility evaluation algorithm using bispectral features
CHEN Xiaomei,WANG Xiaowei,ZHONG Bo,YANG Jiayan,SHANG Yingying.Speech intelligibility evaluation algorithm using bispectral features[J].Technical Acoustics,2022,41(5):678-684.
Authors:CHEN Xiaomei  WANG Xiaowei  ZHONG Bo  YANG Jiayan  SHANG Yingying
Affiliation:Department of Electrical and Electronic Engineering, North China Electric Power University, Beijing 102206, China;Department of Mechanics and Acoustics Division, National Institute of Metrology, Beijing 100029, China;Department of Otolaryngology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing 100730, China
Abstract:Aiming at the fact that the existing speech intelligibility evaluation methods cannot effectively deal with the signal changes under various types of nonlinear distortions, a bispectral speech intelligibility metric (BSIM) algorithm based on bispectral features is proposed, which uses third-order statistics to extract features from the spectrogram of speech signal. Bispectrum can detect the nonlinear phase coupling in the speech signal and suppress the Gussian noise in the non-Gussian signal, thereby can reveal more useful information hidden in the signal. This method is compared with existing speech intelligibility indicators. The results show that this method can successfully predict the degradation of speech intelligibility caused by linear distortion and nonlinear distortion. The evaluation result is highly correlated with the subjective intelligibility result and sensitive to signal distortion changes.
Keywords:speech intelligibility  objective evaluation algorithm  high-order statistics  bispectrum
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号