首页 | 官方网站   微博 | 高级检索  
     

基于瞬时相位差和深度学习的丢包隐藏方法
引用本文:黄晋维,鲍长春.基于瞬时相位差和深度学习的丢包隐藏方法[J].信号处理,2021,37(10):1791-1798.
作者姓名:黄晋维  鲍长春
作者单位:北京工业大学信息学部语音与音频信号处理研究室
基金项目:国家自然科学基金(61831019)
摘    要:实时IP 语音通信在数据包会丢失的情况下,语音质量会受到严重影响。为了恢复传输过程中丢失的语音信息,本文提出了一种基于瞬时相位差(Instantaneous Phase Deviation, IPD)和深度神经网络(Deep Neural Network, DNN)的丢包隐藏 (Packet Loss Concealment, PLC)方法。在训练阶段,将语音的对数功率谱(Log Power Spectrum, LPS)和IPD作为训练DNN的输入特征,以学习从接收包到丢失包的映射关系;在重构阶段,将丢包前接收到的语音包送入训练好的DNN中,恢复出丢失包的语音。实验结果表明,在不同丢包率下,所提方法的性能优于传统的基于LPS和DNN的PLC方法。 

关 键 词:深度神经网络    相位特征    丢包隐藏
收稿时间:2021-08-11

Packet loss concealment based on instantaneous phase deviation and deep neural network
Affiliation:Speech and Audio Signal Processing Lab, Faculty of Information Technology, Beijing University of Technology
Abstract:In the case of packet loss for real-time speech communication, the speech quality will be seriously affected. In order to recover the lost speech information during transmission, this paper proposes a packet loss concealment (Packet Loss Concealment, PLC) based on instantaneous phase deviation (Instantaneous Phase Deviation, IPD) and deep neural network (Deep Neural Network, DNN). In the training stage, the log power spectrum (Log Power Spectrum, LPS) and IPD of the speech are used as the input feature of the DNN training for learning the mapping relationship from the received packets to the lost packets. In the reconstruction stage, the received packets are sent to the well trained DNN for recovering the lost packet. Experimental results prove that under different packet loss rates, the proposed algorithm can gain better performance than conventional LPS+DNN-based PLC method. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号