首页 | 官方网站   微博 | 高级检索  
     

一种基于部件CNN的网络安全命名实体识别方法
引用本文:魏笑,秦永彬,陈艳平.一种基于部件CNN的网络安全命名实体识别方法[J].计算机与数字工程,2020,48(1):106-111.
作者姓名:魏笑  秦永彬  陈艳平
作者单位:贵州大学计算机科学与技术学院 贵阳 550025;贵州大学计算机科学与技术学院 贵阳 550025;贵州大学贵州省公共大数据重点实验室 贵阳 550025
基金项目:国家自然科学基金;贵州省自然科学基金;贵州省重大应用基础研究项目;贵州省科技重大专项
摘    要:基于知识图谱的网络安全动态预警方法,能够主动感知和应对网络安全攻击,增强感知的实时性和精准性。然而,在构建网络安全知识图谱的实体抽取过程中,传统的命名实体识别工具和方法无法识别网络安全领域中的特定类别实体,文本中的未登录和中英文混合的网络安全实体也难以被准确识别。网络安全文本中的网络安全命名实体存在中英文混合、单词缩写等问题,仅基于字的命名实体识别方法难以充分表征字或词的语义信息。因此,论文考虑中英文更细粒度的部件语义捕捉字或词的语义特征,提出一种基于部件CNN的网络安全命名实体识别方法(C C-NS-NER),利用部件CNN抽取词语部件特征中的关键语义特征,丰富字词级别的语义信息,并引入BiLSTM-CRF确保抽取字向量和部件特征中的抽象信息,同时获取标签之间的关联信息,识别文本中的网络安全命名实体。在人工标注的网络安全数据集上的实验结果表明,该方法相较于传统模型,能有效获取字或词的部件语义信息,显著提高网络安全命名实体识别的效果。

关 键 词:网络安全  命名实体识别  卷积神经网络  双向长短期神经网络  条件随机场

A Network Security Named Entity Recognition Method Based on the Component CNN
WEI Xiao,QIN Yongbin,CHEN Yanping.A Network Security Named Entity Recognition Method Based on the Component CNN[J].Computer and Digital Engineering,2020,48(1):106-111.
Authors:WEI Xiao  QIN Yongbin  CHEN Yanping
Affiliation:(College of Computer Science and Technology,Guizhou University,Guiyang 550025;Guizhou Provincial Key Laboratory of Public Big Data,Guizhou University,Guiyang 550025)
Abstract:A network security dynamic early warning method based on knowledge graph can actively detect and respond to net work security attacks,and enhance the real-time and accuracy of perception.However,in the process of constructing the entity ex traction of the network security knowledge graph,the traditional named entity identification tools and methods cannot recognize spe cific categories of entities in the network security domain,and the network security entities that are not logged in and mixed in Chi nese and English are difficult to be accurately identified.The network security named entities in the network security text have prob lems such as Chinese-English mixture and word abbreviation.It is difficult to fully characterize the semantic information of words or words based on the word-based named entity recognition method.Therefore,this paper considers the semantic features of the Chi nese and English fine-grained parts semantics to capture words,and proposes a CNN-based network security named entity recogni tion method(CC-NS-NER),which uses the CNN to extract the features of the component word parts'key semantic features,rich semantic information at the word level,and the introduction of BiLSTM-CRF to ensure abstract information extracted from word vec tors and component features,and to obtain association information between tags and identify network security named entities in the text.The experimental results on the manually labeled network security dataset show that compared with the traditional model,this method can effectively obtain the semantic information of the words or words,and significantly improve the recognition of network se curity named entities.
Keywords:network security  named entity recognition  CNN  Bi-LSTM  CRF
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号