首页 | 官方网站   微博 | 高级检索  
     

基于深度学习和语法规约的需求文档命名实体识别
引用本文:许梦笛,王金华.基于深度学习和语法规约的需求文档命名实体识别[J].计算机与现代化,2021,0(1):105-110.
作者姓名:许梦笛  王金华
作者单位:中国电子科技集团公司第三十二研究所,上海 201808;中国电子科技集团公司第三十二研究所,上海 201808
摘    要:命名实体识别是自然语言处理中的一个关键。在需求文档中存在过长的实体:虚功能,使得普适的传统命名实体识别方法无法有效地识别得到完整的实体。本文针对需求文档实体识别模型进行深入研究,引入深度学习方法,提出基于深度残差网络(ResNet)的CNER方法与基于规则的方法相结合,进行针对中文需求文档的分词。本文的命名实体识别模型是一种编码-解码模型,使用带有注意力机制的双向长短期记忆网络(BiLSTM with attention)进行编码,得到分词后文本的上下文特征和句式特征,使用条件随机场(CRF)方法进行解码,再结合语法规约的干预进行需求文档实体识别。实验表明,所提方法在需求文档领域识别效果优于普适的传统方法。

关 键 词:命名实体识别  CNER  深度残差网络  双向长短期记忆网络  条件随机场  语法规约
收稿时间:2021-01-29

Requirements Document Named Entity RecognitionBased on Deep Learning and Grammatical Regulations
XU Meng-di,WANG Jin-hua.Requirements Document Named Entity RecognitionBased on Deep Learning and Grammatical Regulations[J].Computer and Modernization,2021,0(1):105-110.
Authors:XU Meng-di  WANG Jin-hua
Abstract:Named entity recognition is particularly critical in natural language processing. There are overlong entities in the requirements document: virtual function, which makes it hard for pervasive traditional named entity recognition method to recognize entire entity. This paper conducts an in-depth research on the entity recognition model of requirements documents, introduces CNER method, which is based on Deep Residual Network (ResNet), to combine with the method based on grammatical regulations to perform word segmentation of Chinese requirements documents. This paper’s NER model is an encoder-decoder model, applies Bidirectional Long Short-Term Memory network (BiLSTM with attention) to encode, which obtains the context features and sentence pattern features of the text after word segmentation, employs conditional random field (CRF) method to decode, then identifies the requirements document entities with the intervention of grammatical regulations as a combination. Experiments show that the proposed method has better recognition effect than the pervasive traditional methods.
Keywords:named entity recognition  CNER  ResNet  BiLSTM  CRF  grammatical regulations  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号