基于多注意力机制的维吾尔语人称代词指代消解 Anaphora Resolution of Uyghur Personal Pronouns Based on Multi-attention Mechanism期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多注意力机制的维吾尔语人称代词指代消解

引用本文：	杨启萌,禹龙,田生伟,艾山·吾买尔.基于多注意力机制的维吾尔语人称代词指代消解[J].自动化学报,2021,47(6):1412-1421.

作者姓名：	杨启萌禹龙田生伟艾山·吾买尔

作者单位：	1.新疆大学软件学院乌鲁木齐 830008

基金项目：	国家自然科学基金61563051国家自然科学基金61662074国家自然科学基金61962057国家自然科学基金重点项目U2003208自治区重大科技项目2020A03004-4新疆自治区科技人才培养项目QN2016YX0051

摘要：	针对深度神经网络模型学习照应语和候选先行语的语义信息忽略了每一个词在句中重要程度, 且无法关注词序列连续性关联和依赖关系等问题, 提出一种结合语境多注意力独立循环神经网络(Contextual multi-attention independently recurrent neural network, CMAIR) 的维吾尔语人称代词指代消解方法. 相比于仅依赖照应语和候选先行语语义信息的深度神经网络, 该方法可以分析上下文语境, 挖掘词序列依赖关系, 提高特征表达能力. 同时, 该方法结合多注意力机制, 关注待消解对多层面语义特征, 弥补了仅依赖内容层面特征的不足, 有效识别人称代词与实体指代关系. 该模型在维吾尔语人称代词指代消解任务中的准确率为90.79 %, 召回率为83.25 %, F值为86.86 %. 实验结果表明, CMAIR模型能显著提升维吾尔语指代消解性能.
关键词：	注意力机制语境独立循环神经网络指代消解
收稿时间：	2018-10-18
Anaphora Resolution of Uyghur Personal Pronouns Based on Multi-attention Mechanism

Affiliation:	1.School of Software, Xinjiang University, Urumqi 8300082.Key Laboratory of software engineering technology, Xinjiang University, Urumqi 8300463.Key Laboratory of Signal and Information Processing, Xinjiang University, Urumqi 8300464.Network Center, Xinjiang University, Urumqi 8300465.College of formation Science and Technology, Xinjiang University, Urumqi 830046

Abstract:	The deep neural network model learns the semantic information of anaphora and candidate antecedent, ignores the importance of each word in the sentence, and cannot pay attention to the continuous association and dependence of the word sequence. This paper proposes a Uyghur personal pronoun anaphora resolution method based on contextual multi-attention independent recurrent neural network (CMAIR). Compared with deep neural networks that rely only on the semantic information of anaphora and candidate antecedent, this method can analyze context relations, mine word sequence dependencies, and improve feature expression ability. At the same time, this method combines the multiattention mechanism, pays attention to the multi-layer semantic features to be resolved, efiectively compensates for the lack of content-level features, and efiectively recognizes the relationship between personal pronouns and entities. The precision rate of this method in the Uyghur personal pronoun anaphora resolution task is 90.79 %, the recall rate is 83.25 %, and the F value is 86.86 %. The experimental results show that the CMAIR model can signiflcantly improve the performance of Uyghur personal pronoun anaphora resolution.

Keywords:

	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏