首页 | 官方网站   微博 | 高级检索  
     

基于单层标注级联模型的篇章情感倾向分析
引用本文:李本阳,关毅,董喜双,李生.基于单层标注级联模型的篇章情感倾向分析[J].中文信息学报,2012,26(4):3-9.
作者姓名:李本阳  关毅  董喜双  李生
作者单位:哈尔滨工业大学 计算机科学与技术学院,黑龙江 哈尔滨 150001
基金项目:国家自然科学基金资助项目
摘    要:情感分类是目前篇章情感分析的主要方法,但该方法存在难以融入中文结构特征的问题。针对此问题,采用级联模型对篇章情感倾向进行分析,将篇章情感倾向分析分为两层 小句级和篇章级,对篇章情感倾向分析引入小句级的情感分析。该文使用最大熵模型处理小句级情感分类,小句级的输出作为上层篇章级的输入,并结合句型特征和句子位置等信息作为特征,采用支持向量机模型进行篇章级情感分类。同时对于级联模型中双层标注问题,基于交叉验证的思想提出了单层标注级联模型,避免了多层标注工作以及错误。实验结果表明,该方法的准确率较传统情感分类方法提高了2.53%。

关 键 词:情感倾向分析  情感分类  级联模型  最大熵  支持向量机  

Single-label Cascaded Model for Document Sentiment Analysis
LI Benyang , GUAN Yi , DONG Xishuang , LI Sheng.Single-label Cascaded Model for Document Sentiment Analysis[J].Journal of Chinese Information Processing,2012,26(4):3-9.
Authors:LI Benyang  GUAN Yi  DONG Xishuang  LI Sheng
Affiliation:School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China
Abstract:Classification is the main method to analyze the document sentiment polarity,but it is defected in its deficiency in integrating the structure features.A cascaded model for sentiment polarity analysis is proposed to address this issue,which consists of two levels: the clause level and the document level.The document is first segmented into clauses which are classified into positive and negative categories by an Maximum Entropy model.Afterwards,these categories are combined with types and positions of clauses as features for document classification via the Support Vector Machine model.Meanwhile,a Single-label Cascade Model based on cross-validation is proposed.Experimental results prove that the accuracy of the proposed method is improved by 2.53 compared with traditional methods of sentiment classification.
Keywords:sentiment analysis  sentiment classification  cascade model  ME  SVM
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号