首页 | 官方网站   微博 | 高级检索  
     

中文微博句子倾向性分类中特征抽取研究
引用本文:徐雄飞,徐凡,王明文,左家莉,罗文兵. 中文微博句子倾向性分类中特征抽取研究[J]. 江西师范大学学报(自然科学版), 2015, 0(3): 290-296
作者姓名:徐雄飞  徐凡  王明文  左家莉  罗文兵
作者单位:江西师范大学计算机信息工程学院,江西 南昌 330022
基金项目:国家自然科学基金(61272212,61163006,61203313,61365002,61462045)资助项目
摘    要:针对中文微博句子倾向性分类问题,在充分降低由于情感词典的扩充工作带来系统开销的基础上,抽取了中文微博句子中标点符号、情感词权重、词汇级和句法级等新型平面和结构化特征,探索了有效的特征选择方法.在基准COAE和NLP&CC中文微博语料上进行双向交叉和独立实验,并研究了有效的不平衡性语料的处理方法.实验结果表明:采用该文提出的特征后,中文微博句子倾向性分类的性能得到显著提升.

关 键 词:中文微博  句子倾向性  特征抽取  分类

The Research on Feature Extraction of Polarity Classification of Chinese Micro Blogging
XU Xiongfei , XU Fan , WANG Mingwen , ZUO Jiali , LUO Wenbing. The Research on Feature Extraction of Polarity Classification of Chinese Micro Blogging[J]. Journal of Jiangxi Normal University (Natural Sciences Edition), 2015, 0(3): 290-296
Authors:XU Xiongfei    XU Fan    WANG Mingwen    ZUO Jiali    LUO Wenbing
Affiliation:XU Xiongfei;XU Fan;WANG Mingwen;ZUO Jiali;LUO Wenbing;School of Computer Information Engineering,Jiangxi Normal University;
Abstract:According to Chinese micro blogging sentence polarity identification problem,while fully reducing due to the emotional lexicon expansion work brought on the basis of system overhead,many novel flat and structural features,e.g.punctuation,sentiment word weighting,lexical and syntactic level information,from Chinese micro blogging,together with the effective feature selection method has been extracted.In-depth bidirectional and independent experiments on both COAE and NLP&CC,along with the effective imbalance corpus handling method has been conducted.Evaluation results show that the effectiveness of our novel features.Its also show that the model significantly outperforms existing model currently in the research field.
Keywords:Chinese micro blogging  sentence polarity  feature extraction  classification
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《江西师范大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《江西师范大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号