首页 | 官方网站   微博 | 高级检索  
     

浊声基频轮廓对汉语合成自然度提高的分析与综合
引用本文:田岚,陆小珊,杨霓清.浊声基频轮廓对汉语合成自然度提高的分析与综合[J].山东大学学报(工学版),2003,33(4):413-416.
作者姓名:田岚  陆小珊  杨霓清
作者单位:山东大学,信息科学与工程学院,山东,济南,250100
摘    要:连续语音浊声基频轮廓是影响合成语音自然度和表现力的一个重要因素 .本文采用序位调值分类统计法 ,对汉语连续语音音调动态特性作了系统分析 ,提出一种用于分析和分层产生汉语连续语音基频参数的数学模型 .模型充分考虑了汉语发音特点 ,归纳了语言表达中音调变化的各种可能 ,并相应设置了控制调整参量 ,相对完整而实用地表示了语言知识和基频参数之间的对应关系 .对一些典型自然语句进行了仿真实验 ,结果表明 ,该模型控制产生的合成基频轮廓和测试目标可达到满意的吻合 ,对有效改善TTS系统语音合成自然度作用明显 .

关 键 词:文语转换(Text-to-Speech)  韵律特征  基频  语音自然度
文章编号:1672-3961(2003)04-0413-04
修稿时间:2002年4月11日

Analysis and synthesis of continuous voice pitch contour for improving Chinese synthetic speech naturalness
TIAN Lan,LU Xiao shan,YANG Ni qing.Analysis and synthesis of continuous voice pitch contour for improving Chinese synthetic speech naturalness[J].Journal of Shandong University of Technology,2003,33(4):413-416.
Authors:TIAN Lan  LU Xiao shan  YANG Ni qing
Abstract:The continuous speech Fo contour plays key role for the naturalness and emotion in text to speech conversion system. Based on statistics method and clustering at the sequence location of each syllable, we systematically analyzed a large number of Chinese continuous speech pitch contours. As a consequence, a hierarchical prosody analysis and synthesis model is introduced, in which Mandarin characteristics are fully taken into account, introducing all tone patterns and phrase dynamic trend, and setting relative control command parameters and sandhi rules. The model quantitatively describes the relationship between prosody features and Chinese multi layer linguistic information. The emulating tests for some typical natural utterances show that synthetic Fo contours have good correspondences with the objective samples and that the model is expected to improve the naturalness of TTS synthetic speech evidently.
Keywords:text  to  speech(TTS)  prosody features  pitch  speech naturalness
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号