首页 | 官方网站   微博 | 高级检索  
     

基于对抗训练的中文电子病历命名实体识别
引用本文:孔令巍,朱艳辉,张 旭,欧阳康,黄雅淋,金书川,沈加锐.基于对抗训练的中文电子病历命名实体识别[J].湖南工业大学学报,2022,36(3):36-43.
作者姓名:孔令巍  朱艳辉  张 旭  欧阳康  黄雅淋  金书川  沈加锐
作者单位:湖南工业大学 计算机学院 湖南省智能信息感知及处理技术重点实验室
基金项目:湖南省自然科学基金资助项目(2020JJ6089);湖南省教育厅科研基金资助重点项目(19A133)
摘    要:为提高传统命名实体识别模型在中文电子病历上的准确性,提出一种在基线模型B E RT-BiLSTM-CRF中加入对抗训练的方法,该方法在词嵌入层添加扰动因子从而生成对抗样本,并利用对抗样本进行迭代训练,从而优化模型参数.CCKS2021评测数据集实验结果表明,加入FGM和PGD两个对抗训练模型后,其精准率、召回率以及F1...

关 键 词:中文电子病历  命名实体识别  对抗训练  BERT  BiLSTM  CRF  FGM  PGD
收稿时间:2021/12/20 0:00:00

Named Entity Recognition of Chinese Electronic Medical Records Based on Adversarial Training
KONG Lingwei,ZHU Yanhui,ZHANG Xu,OUYANG Kang,HUANG Yalin,JIN Shuchuan,SHEN Jiarui.Named Entity Recognition of Chinese Electronic Medical Records Based on Adversarial Training[J].Journal of Hnnnan University of Technology,2022,36(3):36-43.
Authors:KONG Lingwei  ZHU Yanhui  ZHANG Xu  OUYANG Kang  HUANG Yalin  JIN Shuchuan  SHEN Jiarui
Abstract:In view of an improvement of the accuracy of the traditional named entity recognition model in Chinese electronic medical records, a method has thus been proposed with adversarial training added to the baseline model BERT-BILSTM-CRF. By adopting the proposed method, disturbance factors are added to the word embedding layer for the generation of adversarial samples, which will be used for an iterative training to optimize the model parameters. The experimental results of CCKS2021 evaluation data set show that the accuracy rate, recall rate and F1 value are improved compared with the baseline model with FGM and PGD confrontation training models added. Based on comparative experiments, it is verified that adding confrontation training can improve the prediction ability and robustness of the model.
Keywords:
点击此处可从《湖南工业大学学报》浏览原始摘要信息
点击此处可从《湖南工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号