首页 | 官方网站   微博 | 高级检索  
     

基于混合互信息算法的文本情感分析
引用本文:王 义,戴月明.基于混合互信息算法的文本情感分析[J].计算机应用研究,2020,37(2):337-341.
作者姓名:王 义  戴月明
作者单位:江南大学 物联网工程学院,江苏 无锡214122;江南大学 物联网工程学院,江苏 无锡214122
摘    要:针对互信息(mutual information,MI)特征选择方法存在的正负相关性的现象以及未考虑特征项在不同类别内词频的问题,提出了一种混合互信息特征选择算法(hybrid mutual information,HMI)。引入逆文档频率系数和类间词频信息系数,使得整个文档中的词频信息以及每个类之间的词频信息得以有效利用;引入正负相关性系数,区分正相关性和负相关性并进行有效的利用。通过实验对比表明,混合互信息算法可以有效地提高特征选择的质量,进而提高文本情感分析的效果。

关 键 词:互信息  特征选择  正负相关性  词频信息  情感分析
收稿时间:2018/8/2 0:00:00
修稿时间:2018/9/28 0:00:00

Text sentiment analysis based on hybrid mutual information algorithm
Wang Yi and Dai Yueming.Text sentiment analysis based on hybrid mutual information algorithm[J].Application Research of Computers,2020,37(2):337-341.
Authors:Wang Yi and Dai Yueming
Affiliation:Jiangnan University,
Abstract:Aiming at the phenomenon of positive and negative correlation in the feature selection method of mutual information(MI) and the problem of the word frequency of the feature items in different categories hadn''t been considered, this paper proposed a hybrid mutual information(HMI) feature selection algorithm. By introducing the inverse document frequency coefficient and the inter-class word frequency information coefficient, the algorithm could effectively utilize the word frequency information in the whole document and the word frequency information between each class. It introduced the positive and negative correlation coefficient to distinguish positive correlation and negative correlation and made effective use. The experimental results show that the hybrid mutual information algorithm can effectively improve the quality of feature selection and then improve the effect of text emotional analysis.
Keywords:mutual information  feature selection  positive and negative correlation  word frequency information  sentiment analysis
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号