首页 | 官方网站   微博 | 高级检索  
     

基于深度学习和迁移学习的领域自适应中文分词
引用本文:成于思,施云涛.基于深度学习和迁移学习的领域自适应中文分词[J].中文信息学报,2019,33(9):9.
作者姓名:成于思  施云涛
作者单位:1.东南大学 土木工程学院,江苏 南京 210096;
2.中国移动通信集团 南京分公司网络部,江苏 南京 210019
基金项目:国家自然科学基金(71601047);中国博士后科学基金(2015M581706)
摘    要:为了提高专业领域中文分词性能,以及弥补专业领域大规模标注语料难以获取的不足,该文提出基于深度学习以及迁移学习的领域自适应分词方法。首先,构建包含词典特征的基于深度学习的双向长短期记忆条件随机场(BI-LSTM-CRF)分词模型,在通用领域分词语料上训练得到模型参数;接着,以建设工程法律领域文本作为小规模分词训练语料,对通用领域语料的BI-LSTM-CRF分词模型进行参数微调,同时在模型的词典特征中加入领域词典。实验结果表明,迁移学习减少领域分词模型的迭代次数,同时,与通用领域的BI-LSTM-CRF模型相比,该文提出的分词方法在工程法律领域的分词结果F1值提高了7.02%,与预测时加入领域词典的BI-LSTM-CRF模型相比,分词结果的F1值提高了4.22%。该文提出的分词模型可以减少分词的领域训练语料的标注,同时实现分词模型跨领域的迁移。

关 键 词:深度学习  迁移学习  领域分词    工程法律  

Domain Adaption of Chinese Word Segmentation Based onDeep Learning and Transfer Learning
CHENG Yusi,SHI Yuntao.Domain Adaption of Chinese Word Segmentation Based onDeep Learning and Transfer Learning[J].Journal of Chinese Information Processing,2019,33(9):9.
Authors:CHENG Yusi  SHI Yuntao
Affiliation:1.School of Civil Engineering, Southeast University, Nanjing, Jiangsu 210096, China;
2.China Mobile Communications Group, Nanjing Branch Network Department, Nanjing, Jiangsu 210019, China
Abstract:To improve the performance of Chinese word segmentation on specific domain, a domain adaption method of word segmentation is proposed based on deep learning and transfer learning. Firstly, a deep learning neural network of bidirectional long short-term memory CRF (BI-LSTM-CRF) model including a dictionary feature is constructed for Chinese word segmentation and trained on the general field corpus to obtain the model parameters. Secondly, the parameters of BI-LSTM-CRF model trained in a common domain corpus are fine-tuned using a small size of training corpus in construction law domain. The domain dictionary information is added to the dictionary feature. The experimental results show that transfer learning decreases the epochs for optimization. Compared with the BI-LSTM-CRF model trained in common domain, the proposed model increases the F1 by 7.02% in construction law domain. Compared with the BI-LSTM-CRF model using a domain dictionary in prediction process, the proposed model increases the F1 by 4.22%.
Keywords:deep learning  transfer learning  domain word segmentation  construction law  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号