首页 | 官方网站   微博 | 高级检索  
     

基于Transformer增强架构的中文语法纠错方法
引用本文:王辰成,杨麟儿,王莹莹,杜永萍,杨尔弘.基于Transformer增强架构的中文语法纠错方法[J].中文信息学报,2020,34(6):106-114.
作者姓名:王辰成  杨麟儿  王莹莹  杜永萍  杨尔弘
作者单位:1.北京工业大学 信息学部,北京 100124;
2.北京语言大学 语言资源高精尖创新中心,北京 100083;
3.北京语言大学 信息科学学院,北京 100083
基金项目:北京语言大学语言资源高精尖创新中心项目(TYZ19005);国家语委信息化项目(ZDI135-105,YB135-89)
摘    要:语法纠错任务是自然语言处理领域的一项重要任务,近年来受到了学术界广泛关注。该任务旨在自动识别并纠正文本中所包含的语法、拼写以及语序错误等。该文将语法纠错任务看作是翻译任务,即将带有错误表达的文本翻译成正确的文本,采用基于多头注意力机制的Transformer模型作为纠错模型,并提出了一种动态残差结构,动态结合不同神经模块的输出来增强模型捕获语义信息的能力。受限于目前训练语料不足的情况,该文提出了一种数据增强方法,通过对单语语料的腐化从而生成更多的纠错数据,进一步提高模型的性能。实验结果表明,该文所提出的基于动态残差的模型增强以及腐化语料的数据增强方法对纠错性能有着较大的提升,在NLPCC 2018中文语法纠错共享评测数据上达到了最优性能。

关 键 词:语法纠错  多头注意力  动态残差结构  数据增强  

Chinese Grammatical Error Correction Method Based on Transformer Enhanced Architecture
WANG Chencheng,YANG Liner,WANG Yingying,DU Yongping,YANG Erhong.Chinese Grammatical Error Correction Method Based on Transformer Enhanced Architecture[J].Journal of Chinese Information Processing,2020,34(6):106-114.
Authors:WANG Chencheng  YANG Liner  WANG Yingying  DU Yongping  YANG Erhong
Affiliation:1.Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China;
2.Beijing Advanced Innovation Center for Language Resources, Beijing Language and Culture University, Beijing 100083, China;
3.School of Information Science, Beijing Language and Culture University, Beijing 100083, China
Abstract:Grammatical error correction is an important task in the field of natural language processing, which has attracted wide attention in recent years. This paper regards grammatical error correction task as a translation task to translate the wrong texts into the right ones. We use the transformer model with multi-head attention mechanism as framework, and propose a dynamic residual structure to combine the outputs of different neural blocks dynamically to better capture semantic information. Due to the lack of training corpus, we propose a data augmentation method to generate the parallel data by corrupting a monolingual corpus. The experimental results show that the proposed method based on dynamic residuals and data augmentation has significantly improved the performance of error correction, achieving the best performance on NLPCC 2018 Chinese grammatical error correction task.
Keywords:grammatical error correction  multi-head attention  dynamic residual structure  data augmentation  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号