一种基于码书映射的高效语音转换方法 A Highly Efficient Voice Conversion Method Based on Codebook Mapping期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于码书映射的高效语音转换方法

引用本文：	王志卫,徐宁,刘小峰. 一种基于码书映射的高效语音转换方法[J]. 微处理机, 2014, 0(1): 65-69

作者姓名：	王志卫徐宁刘小峰

作者单位：	[1]河海大学物联网工程学院,常州213022 [2]河海大学-法国AlderbaranRobotics认知与机器人实验室,常州213022 [3]常州市机器人与智能技术重点实验室,常州213022 [4]教育部宽带无线通信与网络感知技术重点实验室,南京210003

基金项目：	国家自然科学基金（60905060）.中央高校基础研究项目（2011811114,2012807314,2012804014）.教育部重点实验室开放基金（NYKL201305）

摘要：	为了使机器人在人一机语音交互过程中更为自然，利用语音转换技术改变源语音个性特征（机械音）．进而变化为自然的目标人语音，是一种可行的方案。然而，当前的语音转换主流方法在实时性要求高且内核小的嵌入式机器人中并不适用。引入一种高效的改进型码书转换方法。该方法首先通过匹配线性谱频率参数的相对距离来求取加权系数，进而实现码字的预测重构；其次．对预测的码字进行带宽修正。克服频谱偏移问题。实验结果表明：该方法相比较传统方法，在转换性能相当的条件下，运行时间缩短75％左右。
关键词：	语音转换嵌入式系统谐波随机模型分段码书人机交互
A Highly Efficient Voice Conversion Method Based on Codebook Mapping

Affiliation:	WANG Zhi - wei , XU Ning, LIU Xiao - feng ( 1. School of lot Engineering, Hohai University, Changzhou 213022, China ; 2. Hohai University - Alderbaran Robotics Laboratory for Cognition and Robotics, Cbangzhou 213022, China; 3. Changzbou key Laboratoy of Robotics and Intelligent Technology, Changzhou 213022, China ; 4. Ministry of Edacation Key Lab of Broadband Wireless Communication and SensorNetwork Technology, Nanjing 210003, China)

Abstract:	In human -robot interaction, it is desired to have synthetic voices which sound natural and can be personalized for each user. One solution is to use voice conversion, in which the characteris- tics of a source mechanical voice are changed to produce a sound corresponding to a given target natural voice. However, the popular voice conversion method is computationally intensive, and not suitable for application in a robot with small kernel embedded. This paper introduces a high efficient improved segmental codebook conversion method. It firstly calculates the weighting coefficient by matching the relative distance of the Line Spectral Frequency （LSF） parameters to realize the prediction refactoring of code word. Secondly, the bandwidth correction for the predicted code word is used to solve the problem of spectrum shift. The test results show that the method is approximately 75% faster than the traditional Gaussian Mixture Model（GMM） under the comparative conversion performance.

Keywords:	Voice Conversion Embedded Systems Harmonic Stochastic Model SegmentalCodebook Man - machine Interaction
本文献已被 CNKI 维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏