首页 | 官方网站   微博 | 高级检索  
     

基于BERT的端到端中文篇章事件抽取
引用本文:张洪宽,宋晖,徐波,王舒怡.基于BERT的端到端中文篇章事件抽取[J].中文信息学报,2022,36(10):97-106.
作者姓名:张洪宽  宋晖  徐波  王舒怡
作者单位:1.东华大学 计算机科学与技术学院,上海 201620;
2.上海市计算机软件评测重点实验室,上海 201100
基金项目:国家自然科学基金(61906035);上海市青年科技英才扬帆计划项目(19YF140230)
摘    要:篇章级事件抽取研究从整篇文档中检测事件,识别出事件包含的元素并赋予每个元素特定的角色。该文针对限定领域的中文文档提出了基于BERT的端到端模型,在模型的元素和角色识别中依次引入前序层输出的事件类型以及实体嵌入表示,增强文本的事件、元素和角色关联表示,提高篇章中各事件所属元素的识别精度。在此基础上利用标题信息和事件五元组的嵌入式表示,实现主从事件的划分及元素融合。实验证明,该文提出的方法与现有工作相比具有明显的性能提升。

关 键 词:篇章级事件抽取  端到端  主从事件  
收稿时间:2021-02-21

A BERT-based End-to-End Model for Chinese Document-level Event Extraction
ZHANG Hongkuan,SONG Hui,XU Bo,WANG Shuyi.A BERT-based End-to-End Model for Chinese Document-level Event Extraction[J].Journal of Chinese Information Processing,2022,36(10):97-106.
Authors:ZHANG Hongkuan  SONG Hui  XU Bo  WANG Shuyi
Affiliation:1.School of Computer Science and Technology, Donghua University, Shanghai 201620, China;2.Shanghai Key Laboratory of Computer Software Testing and Evaluating, Shanghai 201100, China
Abstract:Document-level event extraction aims at discovering the event with its arguments and their roles from texts. This paper proposes an end-to-end model for domain-specific document-level event extraction based on BERT. We introduce the embedding of event type and entity nodes to the subsequent layer for event argument and role identification, which represents the relation between events, arguments and roles to improve the accuracy of classifying multi-event arguments. With the title and the embedding of the quintuple of event, we realize the identification of principal and subordinate events, and element fusion between multiple events. Experimental results show that our model outperforms the baselines.
Keywords:document-level event extraction  end-to-end  principal and subordinate events  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号