首页 | 官方网站   微博 | 高级检索  
     

基于词典和句长及位置的双语对齐方法的改进
引用本文:李文刚,周杰,杨保群.基于词典和句长及位置的双语对齐方法的改进[J].现代电子技术,2011,34(14):25-27.
作者姓名:李文刚  周杰  杨保群
作者单位:中国航空计算技术研究所,陕西,西安,710068
摘    要:基于词典和句子的长度和位置信息的双语句子对齐方法在解决真实双语文本对齐问题时具有一定的普适性。在分析该方法的基础上,提出了在解决某一指定领域内的维汉互译文本时,对基于长度和位置信息的双语句子对齐方法的改进,在此方法引入维语与汉语句子长度比的期望值,能够使数据更平滑,更有效地解决了维汉互译文本句子对齐的问题。

关 键 词:句子对齐  期望值  双语语料库  锚点  长度和位置  词典

Improvement of Bilingual Sentence Alignment Method Based on Sentence Length and Location Information with Bidirectional Dictionary
LI Wen-gang,ZHOU Jie,YANG Bao-qun.Improvement of Bilingual Sentence Alignment Method Based on Sentence Length and Location Information with Bidirectional Dictionary[J].Modern Electronic Technique,2011,34(14):25-27.
Authors:LI Wen-gang  ZHOU Jie  YANG Bao-qun
Affiliation:LI Wen-gang,ZHOU Jie,YANG Bao-qun(Aeronautical Computing Technique Research Institute,Xi'an 710068,China)
Abstract:It is useful to solve the problem of real bilingual texts by using the method based on sentence pair's length and location information with a dictionary.An optimized algorithm is proposed by introducing the expectation of the length ratio of the sentence between Uigur and Chinese in a specified area.It makes the data distribution smoothly,which can efficiently solve the problem of real bilingual texts aligning sentence between Uigur and Chinese.
Keywords:sentence alignment  expectation  bilingual corpus  anchors  length and location  dictionary  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号