低信噪比下多参数融合的自适应语音端点检测 Adaptive Speech Endpoint Detection based on Multi-parameter Fusion in Low SNR Situation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

低信噪比下多参数融合的自适应语音端点检测

引用本文：	雷静,何培宇,徐自励.低信噪比下多参数融合的自适应语音端点检测[J].信号处理,2020,36(8):1205-1211.

作者姓名：	雷静何培宇徐自励

作者单位：	四川大学电子信息学院

基金项目：	国家自然科学基金资助项目（61071159,U1733109）

摘要：	传统语音端点检测方法利用语音和噪声在某单一参数特征上的差异进行信号中语音起止点的切分，但不同参数在低信噪比不同噪声环境下表现不稳定，鲁棒性差。因此，本文提出了基于均匀子带谱方差，能熵比，梅尔倒谱距离，似然比四种参数相融合的语音端点检测方法。该方法能自适应地改变各参数阈值，并通过实时监测噪声段能熵比的值确定所采用的投票判决机制，从而进行语音端点判定。实验结果表明，该方法在低信噪比下较常用的端点检测方法有更高的检测正确率及鲁棒性，对语音信号后续处理工作有一定的借鉴意义。
关键词：	语音端点检测多参数融合自适应阈值投票机制
收稿时间：	2020-04-28
Adaptive Speech Endpoint Detection based on Multi-parameter Fusion in Low SNR Situation

Affiliation:	School of Electronic Information and Engineering, Sichuan University

Abstract:	Traditional speech endpoint detection methods make use of the difference between speech and noise in a single parameter to segment the start and end points of speech in the signal. However, the performance of different parameters under different noise environments with low signal-to-noise ratio is unstable and the robustness is poor. To overcome such problem, this paper proposed a speech endpoint detection method based on the fusion of four parameters: sub-band spectral variance, energy entropy ratio, MFCC cepstrum distance and likelihood ratio. This method could change the threshold of each parameter adaptively, then determined the voting mechanism by real-time detection of the energy entropy ratio of the noise segment, so as to determine the speech endpoint. Experimental results show that the proposed method has higher detection accuracy and robustness than the conventional endpoint detection methods in the case of low signal-to-noise ratio. The proposed method has certain reference significance for the follow-up processing of speech signal.

Keywords:

	点击此处可从《信号处理》浏览原始摘要信息
	点击此处可从《信号处理》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏