首页 | 官方网站   微博 | 高级检索  
     

基于双向LSTM的维吾尔语事件因果关系抽取
引用本文:田生伟,周兴发,禹龙,冯冠军,艾山?吾买尔,李圃.基于双向LSTM的维吾尔语事件因果关系抽取[J].电子与信息学报,2018,40(1):200-208.
作者姓名:田生伟  周兴发  禹龙  冯冠军  艾山?吾买尔  李圃
作者单位:1.(新疆大学软件学院 乌鲁木齐 830046) ②(新疆大学网络中心 乌鲁木齐 830046) ③(新疆大学人文学院 乌鲁木齐 830046) ④(新疆大学信息科学与工程学院 乌鲁木齐 830046) ⑤(新疆大学语言学院 乌鲁木齐 830046)
基金项目:国家自然科学基金(61662074, 61563051, 61262064),国家自然科学基金重点项目(61331011),新疆自治区科技人才培养项目(QN2016YX0051)
摘    要:针对传统方法不能有效抽取维吾尔语事件因果关系的问题,该文提出一种基于双向LSTM(Bidirectional Long Short-Term Memory, BiLSTM)的维吾尔语事件因果关系抽取方法。通过对维吾尔语语言以及事件因果关系特点的研究,提取出10项基于事件内部结构信息的特征;同时为充分利用事件语义信息,引入词嵌入作为BiLSTM的输入,提取事件句隐含的深层语义特征并利用批样规范化(Batch Normalization, BN)算法加速BiLSTM的收敛;最后融合这两类特征作为softmax分类器的输入进而完成维吾尔语事件因果关系抽取。实验结果表明,该方法用于维吾尔语事件因果关系的抽取准确率为 89.19%, 召回率为 83.19%, F值为86.09%,证明了该文提出的方法在维吾尔语事件因果关系抽取上的有效性。

关 键 词:语言信号处理    事件因果关系    维吾尔语    双向LSTM    词嵌入    批样规范化
收稿时间:2017-05-02

Causal Relation Extraction of Uyghur Events Based on Bidirectional Long Short-term Memory Model
TIAN Shengwei,ZHOU Xingfa,YU Long,FENG Guanjun,Aishan WUMAIER,LI Pu.Causal Relation Extraction of Uyghur Events Based on Bidirectional Long Short-term Memory Model[J].Journal of Electronics & Information Technology,2018,40(1):200-208.
Authors:TIAN Shengwei  ZHOU Xingfa  YU Long  FENG Guanjun  Aishan WUMAIER  LI Pu
Affiliation:1.(School of Software, Xinjiang University, Urumqi 830046, China)2.(Net Center, Xinjiang University, Urumqi 830046, China)
Abstract:Since the traditional events causal relation has the disadvantages of small recognition coverage, a method for causal relation extraction of Uyghur events is presented based on Bidirectional Long Short-Term Memory (BiLSTM) model. In order to make full use of the event structure information, 10 characteristics of the Uyghur events structure information are extracted based on the study of the events causal relationship and Uyghur language features; At the same time, the word embedding is introduced as the input of BiLSTM to extract the deep semantic features of the Uyghur events and Batch Normalization (BN) algorithm is usded to accelerate the convergence of BiLSTM. Finally, concatenating these two kinds of features as the input of the softmax classifier to extract the Uyghur events causal relations. This method is used in the causal relation extraction of Uyghur events, and the results show that the precision rate, the recall rate and F value can reach 89.19 %, 83.19% and 86.09 %, indicating the effectiveness and practicability of the method of causal relation extraction of Uyghur events.
Keywords:
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号