首页 | 官方网站   微博 | 高级检索  
     


Multiple statistical models for soft decision in noisy speech enhancement
Authors:Joon-Hyuk Chang [Author Vitae]  Saeed Gazor [Author Vitae] [Author Vitae]  Sanjit K Mitra [Author Vitae]
Affiliation:a School of Electronic Engineering, Inha University, Incheon 402-751, Korea
b Department of Electrical and Computer Engineering, Queen's University, Kingston, Ont., Canada K7L 3N6
c School of Electrical Engineering and INMC, Seoul National University, Seoul, Kwanak 151-742, P.O. Box 34, Korea
d Department of Electrical and Computer Engineering, University of California, Santa Barbara, CA 93106, USA
Abstract:Most speech enhancement algorithms are based on the assumption that speech and noise are both Gaussian in the discrete cosine transform (DCT) domain. For further enhancement of noisy speech in the DCT domain, we consider multiple statistical distributions (i.e., Gaussian, Laplacian and Gamma) as a set of candidates to model the noise and speech. We first use the goodness-of-fit (GOF) test in order to measure how far the assumed model deviate from the actual distribution for each DCT component of noisy speech. Our evaluations illustrate that the best candidate is assigned to each frequency bin depending on the Signal-to-Noise-Ratio (SNR) and the Power Spectral Flatness Measure (PSFM). In particular, since the PSFM exhibits a strong relation with the best statistical fit we employ a simple recursive estimation of the PSFM in the model selection. The proposed speech enhancement algorithm employs a soft estimate of the speech absence probability (SAP) separately for each frequency bin according to the selected distribution. Both objective and subjective tests are performed for the evaluation of the proposed algorithms on a large speech database, for various SNR values and types of background noise. Our evaluations show that the proposed soft decision scheme based on multiple statistical modeling or the PSFM provides further speech quality enhancement compared with recent methods through a number of subjective and objective tests.
Keywords:Speech enhancement  DCT  Multiple statistical model  Gaussian  Laplacian  Gamma  GOF  PSFM  SAP  PESQ
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号