首页 | 官方网站   微博 | 高级检索  
     

基于word embedding和CNN的情感分类模型
引用本文:蔡慧苹,王丽丹,段书凯.基于word embedding和CNN的情感分类模型[J].计算机应用研究,2016,33(10).
作者姓名:蔡慧苹  王丽丹  段书凯
作者单位:西南大学 电子信息工程学院,西南大学 电子信息工程学院,西南大学 电子信息工程学院
基金项目:国家自然科学基金(61372139),教育部“春晖计划”科研项目(z2011148)
摘    要:尝试将word embedding和卷积神经网络(CNN)相结合来解决情感分类问题。首先,利用Skip-Gram模型训练出数据集中每个词的word embedding,然后将每条样本中出现的word embedding组合为二维特征矩阵作为卷积神经网络的输入;此外,每次迭代训练过程中,输入特征也作为参数进行更新。其次,设计了一种具有3种不同大小卷积核的神经网络结构,从而完成多种局部抽象特征的自动提取过程。与传统机器学习方法相比,所提出的基于word embedding和CNN的情感分类模型成功将分类正确率提升了5.04%。

关 键 词:卷积神经网络  自然语言处理  深度学习  词嵌入  情感分类
收稿时间:6/3/2015 12:00:00 AM
修稿时间:2016/8/21 0:00:00

A sentiment classification model based on word embedding and CNN
CAI Hui-ping,WANG Li-dan and DUAN Shu-kai.A sentiment classification model based on word embedding and CNN[J].Application Research of Computers,2016,33(10).
Authors:CAI Hui-ping  WANG Li-dan and DUAN Shu-kai
Affiliation:College of Electronic and Information Engineering,Southwest University,,College of Electronic and Information Engineering,Southwest University
Abstract:This paper tries to propose a method to solve the problem of sentiment classification by integrating word embedding and convolutional neural network (CNN). First of all, a training process accomplished with Skip-gram model has generated word embedding of each word in our dataset. Then, it tried to create a two-dimensional feature matrix which is the combination of word embedding of each word in a training sample as the input of CNN model. Each iteration process of training will also update entries of feature matrix as they are part of model parameters. Secondly, this paper proposes a CNN structure which is mainly composed of three different sizes of convolution kernels so as to complete the automatic extraction process of a variety of local abstract features. Comparing with traditional machine learning algorithms, the proposed word embedding and CNN based sentiment classification model has successfully improved classification accuracy by 5.04%.
Keywords:convolutional neural network (CNN)  natural language processing (NLP)  deep learning  word embedding  sentiment classification
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号