首页 | 官方网站   微博 | 高级检索  
     

基于多层关系图模型的中文评价对象与评价词抽取方法
引用本文:廖祥文, 陈兴俊, 魏晶晶, 陈国龙, 程学旗. 基于多层关系图模型的中文评价对象与评价词抽取方法. 自动化学报, 2017, 43(3): 462-471. doi: 10.16383/j.aas.2017.c160060
作者姓名:廖祥文  陈兴俊  魏晶晶  陈国龙  程学旗
作者单位:1.福州大学数学与计算机科学学院 福州 350116;;2.福建江夏学院电子信息科学学院 福州 350108;;3.福建省网络计算与智能信息处理重点实验室 (福州大学) 福州 350116;;4.中国科学院计算技术研究所网络数据科学与技术重点实验室 北京 100190
基金项目:国家自然科学基金青年项目(61300105),中国科学院网络数据科学与技术重点实验室开放基金课题(CASNDST20140X)资助
摘    要:中文评价对象与评价词抽取是文本倾向性分析的重要问题.如何利用评价对象与评价词之间的语法、共现等关系设计模型是提高抽取精度的关键.本文提出了一种基于多层关系图模型的中文评价对象与评价词抽取方法.该方法首先利用词对齐模型抽取评价对象与评价词搭配;然后,考虑评价对象与评价词的依存句法关系、评价对象内部的共现关系和评价词内部的共现关系,建立多层情感关系图,接着利用随机游走方法计算候选评价对象与评价词的置信度;最后,选取置信度高的候选评价对象与评价词作为输出.实验结果表明,与现有的方法相比,本文所提出的方法不仅对评价对象和评价词的抽取精度均有显著提升,而且具有良好的鲁棒性.

关 键 词:倾向性分析   观点挖掘   依存句法分析   随机游走
收稿时间:2016-01-20

A Multi-layer Relation Graph Model for Extracting Opinion Targets and Opinion Words
LIAO Xiang-Wen, CHEN Xing-Jun, WEI Jing-Jing, CHEN Guo-Long, CHENG Xue-Qi. A Multi-layer Relation Graph Model for Extracting Opinion Targets and Opinion Words. ACTA AUTOMATICA SINICA, 2017, 43(3): 462-471. doi: 10.16383/j.aas.2017.c160060
Authors:LIAO Xiang-Wen  CHEN Xing-Jun  WEI Jing-Jing  CHEN Guo-Long  CHENG Xue-Qi
Affiliation:1. College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116;;2. College of Electronics and Information Science, Fujian Jiangxia University, Fuzhou 350108;;3. Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing, Fuzhou 350116;;4. Key Laboratory of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190
Abstract:Mining opinion targets and opinion words is a fundamental task for the Chinese online media to mine opinion and analyze sentiment. The key to enhancing the effectiveness of opinion target and opinion word is to integrate syntactic relations and co-occurrence relations between opinion target and opinion word into the mining model. A novel approach based on a multi-layer relation graph model is proposed to extract opinion targets and opinion words from Chinese social media. First, the word alignment model is employed to extract the candidates of opinion target and opinion word candidates. Second, a multi-layer relation graph is constructed by the syntactic inter-relations between opinion target and opinion word, the co-occurrence intra-relations among opinion targets, and the co-occurrence intra-relations among opinion words. Third, a random-walk algorithm is adopted to calculate the confidence of each opinion target candidate and opinion word candidate. Finally, opinion targets and opinion words are extracted according to their confidence values. Experimental results show that the presented method can not only achieve significant improvement over previous methods, but also have good robustness.
Keywords:Sentiment analysis  opinion mining  dependency syntactic parsing  random walk
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号