首页 | 官方网站   微博 | 高级检索  
     

基于多通道特征和自注意力的情感分类方法
引用本文:李卫疆,漆芳,余正涛.基于多通道特征和自注意力的情感分类方法[J].软件学报,2021,32(9):2783-2800.
作者姓名:李卫疆  漆芳  余正涛
作者单位:昆明理工大学 信息工程与自动化学院, 云南 昆明 650500
基金项目:国家自然科学基金(62066022);国家重点研发计划(2018YFC0830105)
摘    要:针对情感分析任务中没有充分利用现有的语言知识和情感资源,以及在序列模型中存在的问题:模型会将输入文本序列解码为某一个特定的长度向量,如果向量的长度设定过短,会造成输入文本信息丢失.提出了一种基于多通道特征和自注意力的双向LSTM情感分类方法(MFSA-BiLSTM),该模型对情感分析任务中现有的语言知识和情感资源进行建模,形成不同的特征通道,并使用自注意力重点关注加强这些情感信息.MFSA-BiLSTM可以充分挖掘句子中的情感目标词和情感极性词之间的关系,且不依赖人工整理的情感词典.另外,在MFSA-BiLSTM模型的基础上,针对文档级文本分类任务提出了MFSA-BiLSTM-D模型.该模型先训练得到文档的所有的句子表达,再得到整个文档表示.最后,对5个基线数据集进行了实验验证.结果表明:在大多数情况下,MFSA-BiLSTM和MFSA-BiLSTM-D这两个模型在分类精度上优于其他先进的文本分类方法.

关 键 词:情感分类  多通道特征  自注意力  深度学习  双向LSTM
收稿时间:2019/6/24 0:00:00
修稿时间:2019/10/31 0:00:00

Sentiment Classification Method Based on Multi-channel Features and Self-attention
LI Wei-Jiang,QI Fang,YU Zheng-Tao.Sentiment Classification Method Based on Multi-channel Features and Self-attention[J].Journal of Software,2021,32(9):2783-2800.
Authors:LI Wei-Jiang  QI Fang  YU Zheng-Tao
Affiliation:Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
Abstract:The purpose of this study is for the problem that the existing language knowledge and emotion resources are not fully utilized in the emotion analysis tasks, as well as the problems in the sequence model:the model will decode the input text sequence into a specific length vector, if the length of the vector is set too short, the information of input text will be lost. A bidirectional LSTM sentiment classification method is proposed based on multi-channel features and self-attention (MFSA-BiLSTM). This method models the existing linguistic knowledge and sentiment resources in sentiment analysis tasks to form different feature channels, and uses self-attention mechanism to focus on sentiment information. MFSA-BiLSTM model can fully explore the relationship between sentiment target words and sentiment polar words in a sentence, and does not rely on a manually compiled sentiment lexicon. In addition, this study proposes the MFSA- BiLSTM-D model based on the MFSA-BiLSTM model for document-level text classification tasks. The model first obtains all sentence expressions of the document through training, and then gets the entire document representation. Finally, experimental verifications are conducted on five sentiment classification datasets. The results show that MFSA-BiLSTM and MFSA-BiLSTM-D are superior to other state-of-the-art text classification methods in terms of classification accuracy in most cases.
Keywords:sentiment classification  multi-channel features  self-attention  deep learning  bidirectional LSTM
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号