首页 | 官方网站   微博 | 高级检索  
     

用决策树指导TBL进行多音字消歧
引用本文:刘方舟,周游.用决策树指导TBL进行多音字消歧[J].计算机工程与应用,2011,47(12):137-140.
作者姓名:刘方舟  周游
作者单位:1. 湖南师范大学,数学与计算机科学学院,长沙,410081
2. 湖南财政经济学院,应用数学系,长沙,410205
基金项目:湖南省科技计划项目,湖南省教育厅科研项目
摘    要:多音字消歧是普通话语音合成系统中字音转换模块的核心问题。选择了常见易错的33个多音字和24个多音词作为研究对象,构建了一个平均每个多音字(词)5000句的语料库,并且提出了一种结合决策树和基于转换的错误驱动的学习(Transformation-Basederror-driven Learning,TBL)的混合算法。该方法根据决策树的指导,自动生成TBL算法的模板,避免了手工总结模板这一费时费力的过程。实验结果表明,该方法生成的模板与手工模板性能相当,其平均准确率达90.36%,明显优于决策树。

关 键 词:多音字消歧  字音转换  决策树  基于转换的错误驱动的学习(TBL)
修稿时间: 

Polyphone disambiguation based on tree-guided TBL
LIU Fangzhou,ZHOU You.Polyphone disambiguation based on tree-guided TBL[J].Computer Engineering and Applications,2011,47(12):137-140.
Authors:LIU Fangzhou  ZHOU You
Affiliation:1.College of Mathematics and Computer Science,Hunan Normal University,Changsha 410081,China 2.Department of Applied Mathematics,Hunan University of Finance and Economics,Changsha 410205,China
Abstract:Polyphone disambiguation is the core issue of the grapheme-to-phoneme conversion in Mandarin Text-To-Speech ('ITS) system.This paper selects 33 key polyphones and 24 key polyphonic words which are most ambiguous and frequently used as study objects,and builds a polyphone corpus of 5 000 sentences per polyphone on average.Furthermore,a hybrid algorithm called Tree-Guided Transformation-Based Leaming(TGTBL),which combines decision tree with Transformation-Based error-driven Leaming(TBL),is proposed to resolve the polyphonic ambiguity.It automatically generates TBL templates,thereby avoiding manually summarizing templates, which is time-consuming and laborious in conventional TBL.Results of comparative experiments show that, for the task of polyphone disambiguation, templates automatically generated by decision tree achieve comparable performance to manually summarized templates,and the average precision of TGTBL reaches 90.36%,siguificantly higher than that of decision tree.
Keywords:polyphone disambiguation  grapheme-to-phoneme  decision tree  Transformation-Based error-driven Leaming(TBL)
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号