首页 | 官方网站   微博 | 高级检索  
     

短语统计机器翻译的句法调序模型
引用本文:薛永增,李生,赵铁军,杨沐昀.短语统计机器翻译的句法调序模型[J].通信学报,2008,29(1):7-14.
作者姓名:薛永增  李生  赵铁军  杨沐昀
作者单位:哈尔滨工业大学,语言语音教育部-微软重点实验室,黑龙江,哈尔滨,150001
基金项目:国家高技术研究发展计划(863计划)
摘    要:为了处理统计机器翻译中的长距离调序,在基于短语的统计翻译模型的基础上提出了句法调序模型.该模型按照短语切分来分割句法树结构,从而能够避免短语和句法结构的不一致性.在该模型中依据短语对齐和短语内词对齐确定句法树部分结构的调序顺序,依据各个节点上的调序概率计算子结构的调序概率,作为对数线性模型的特征函数.该模型的实验结果比经典的短语统计翻译模型的BLEU评分有明显提高.结果表明句法调序模型对于基于短语的统计机器翻译是有效的,能够较好地将句法知识和短语翻译过程结合起来.

关 键 词:人工智能  统计翻译模型  句法调序  短语  的短语  统计机器翻译  句法知识  对数线性模型  machine  translation  statistical  结合  翻译过程  评分  BLEU  结果  实验  特征函数  算子结构  概率  节点  分结构  句法树  词对齐  短语对齐
文章编号:1000-436X(2008)01-0007-08
收稿时间:2007-04-13
修稿时间:2007-10-20

Syntax-based reordering model for phrasal statistical machine translation
XUE Yong-zeng,LI Sheng,ZHAO Tie-jun,YANG Mu-yun.Syntax-based reordering model for phrasal statistical machine translation[J].Journal on Communications,2008,29(1):7-14.
Authors:XUE Yong-zeng  LI Sheng  ZHAO Tie-jun  YANG Mu-yun
Abstract:To deal with the long-distance reordering, a linguistically syntax-based reordering model was presented for phrasal statistical machine translation. In this model, the syntax structure was decomposed according to the phrase segmentation to avoid the inconsistence between phrase and syntax. The reordering sequence of the sub-structures of a parse tree was decided by the word and phrase alignments. The reordering probability of the sub-structure was calculated on the reordering probabilities of the inside nodes, which was defined as a feature function of the log-linear statistical translation model. Experimental results show that the BLEU scores of the translation results were significantly improved compared with a conventional statistical phrase-based model. Therefore, it is effective to introduce the linguistic syntax for phrase reordering. The presented reordering model is able to efficiently incorporate the syntax into the translation process of phrases.
Keywords:artificial intelligence  statistical translation model  syntax-based reordering  phrase
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号