首页 | 官方网站   微博 | 高级检索  
     

基于深度学习的两阶段联合声学回波和混响抑制技术
引用本文:栾书明,程龙彪,孙兴伟,李军锋,颜永红.基于深度学习的两阶段联合声学回波和混响抑制技术[J].信号处理,2020,36(6):948-957.
作者姓名:栾书明  程龙彪  孙兴伟  李军锋  颜永红
作者单位:中国科学院声学研究所语言声学与内容理解重点实验室
基金项目:国家重点研究开发计划项目(2017YFB1002803);国家自然科学基金项目(11722437;11674352)
摘    要:在现代通信系统中,回波与混响常损害通信语音的质量和可懂度。为克服回波与混响的负面影响,本文提出了一种基于深度学习的两阶段联合声学回波和混响抑制系统。系统先用基于理想比值掩蔽的模型去除与目标信号不相关的声学回波;然后用一个基于"隐掩蔽"的谱映射模型去除与目标信号强相关的混响干扰;最后联合训练两阶段模型以获得更好的系统性能。一系列不同声学环境下的实验结果表明,本文所提出的系统可显著地消除回波与混响干扰,从而极大地增强了目标语音的语音质量与可懂度。

关 键 词:回波消除  去混响  双向长短时记忆网络  理想比率掩蔽  联合训练  谱映射
收稿时间:2020-03-31

A Two-stage Deep Learning Based Method for Acoustic Echo Cancellation and Speech Dereverberation
Luan Shuming,Cheng Longbiao,Sun Xingwei,Li Junfeng,Yan Yonghong.A Two-stage Deep Learning Based Method for Acoustic Echo Cancellation and Speech Dereverberation[J].Signal Processing,2020,36(6):948-957.
Authors:Luan Shuming  Cheng Longbiao  Sun Xingwei  Li Junfeng  Yan Yonghong
Affiliation:Key Laboratory of Speech Acoustic and Content Understanding, Institute of Acoustic, Chinese Academy of SciencesUniversity of Chinese Academy of Sciences
Abstract:In modern telecommunications, both echo and reverberation can significantly disturb people's communication and degrade the speech intelligibility and quality. In order to overcome the negative impact of the echo and reverberation simultaneously, we proposed a two-stage joint-training system based on deep learning to enhance the speech signal, where echo cancellation and speech dereverberation were conducted sequentially. The system is composed of two stages, echo cancellation stage and dereverberation stage. The system firstly employed a model based on ideal ratio mask (IRM) to cancel the acoustic echo, which was uncorrelated with the target signal. Then the reverberation strongly correlated with the target signal was removed using a spectrum mapping model combined with a hidden mask. Then the two-stage model was jointly trained to obtain a better performance. A series of systematic experiments were conducted in different conditions and the results indicated that the proposed system significantly improves the performance on echo cancellation and dereverberation and achieves better speech intelligibility and quality over other methods. 
Keywords:acoustic echo cancellation  dereverberation  bidirectional long short-term memory  ideal ratio mask  joint training  spectrum mapping
本文献已被 维普 等数据库收录!
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号