首页 | 官方网站   微博 | 高级检索  
     

基于卷积神经网络的自适应权重multi-gram语句建模系统
引用本文:张春云,秦鹏达,尹义龙.基于卷积神经网络的自适应权重multi-gram语句建模系统[J].计算机科学,2017,44(1):60-64.
作者姓名:张春云  秦鹏达  尹义龙
作者单位:山东财经大学计算机科学与技术学院 济南250014,北京邮电大学信息与通信工程学院 北京100876,山东大学计算机科学与技术学院 济南250101
基金项目:本文受国家自然科学基金重点项目:基于机器学习的多模态医学影像信息处理与分析(U1201258),山东省自然科学杰出青年基金项目:基于机器学习的生物特征识别研究(JQ201316)资助
摘    要:如今信息量呈爆炸式增长,自然语言处理得到了越来越广泛的重视。传统的自然语言处理系统过多地依赖昂贵的人工标注特征和语言分析工具的语法信息,导致预处理中语法信息的错误传递到系统训练和预测过程中。因此,深度学习的应用受到了学者们的关注。因为它能实现端对端预测并尽可能少地 依赖 外部信息。自然语言处理领域流行的深度学习框架为了更好地获取句子信息,采用multi-gram策略。但不同任务和不同数据集的信息分布状况不尽相同,而且这种策略并没有考虑到不同n-gram的重要性分布。针对该问题,提出了一种基于深度学习的自适应学习multi-gram权重的策略,从而根据各n-gram特征的贡献为其分配相应的权重;并且还提出了一种新的multi-gram特征向量结合方法,大大降低了系统复杂度。将该模型应用到电影评论正负倾向判断和关系分类两种分类任务中,实验结果证明采用的自适应multi-gram权重策略能够大大改善模型的分类效果。

关 键 词:深度学习  自然语言处理  自适应权重  multi-gram
收稿时间:2015/8/1 0:00:00
修稿时间:2015/10/11 0:00:00

Self-adaptation Multi-gram Weight Learning Strategy for Sentence Representation Based on Convolutional Neural Network
ZHANG Chun-yun,QIN Peng-da and YIN Yi-long.Self-adaptation Multi-gram Weight Learning Strategy for Sentence Representation Based on Convolutional Neural Network[J].Computer Science,2017,44(1):60-64.
Authors:ZHANG Chun-yun  QIN Peng-da and YIN Yi-long
Affiliation:School of Computer Science and Technology,Shandong University of Finance and Economics,Jinan 250014,China,School of Information and Communication Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China and School of Computer Science and Technology,Shandong University,Jinan 250101,China
Abstract:Nowadays,with the explosive growth of the information,nature language processing has been paid more attention.The traditional nature language processing systems are overly dependent on the expensive handcrafted features annotated by experts and synatx information of language analysis tools.Deep neural network can achieve end-to-end learning even without costly features.In order to extract more information from input sentences,most neural networks of nature language processing combines with multi-gram strategy.However,due to various tasks or various datasets,the information distribution of diverse n-gram is different.With this consideration,this paper proposed a self-adaptation weight learning strategy of multi-gram,which generates the importance order of multi-gram by the training procedure of neural network.Moreover,a novel combination method of multi-gram feature vectors was exploited.Experimental results show that such method can not only reduce the complexity of network,but also can improve performances of positive and negative tendency classification of movie criticism,and relation classification.
Keywords:Deep learning  Natural language processing  Self-adaptation  Multi-gram
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号