首页 | 官方网站   微博 | 高级检索  
     

可解释化、结构化、多模态化的深度神经网络
引用本文:熊红凯,高星,李劭辉,徐宇辉,王涌壮,余豪阳,刘昕,张云飞. 可解释化、结构化、多模态化的深度神经网络[J]. 模式识别与人工智能, 2018, 31(1): 1-11. DOI: 10.16451/j.cnki.issn1003-6059.201801001
作者姓名:熊红凯  高星  李劭辉  徐宇辉  王涌壮  余豪阳  刘昕  张云飞
作者单位:1.上海交通大学 电子工程系 上海 200240
2.深圳市腾讯计算机系统有限公司 深圳 518000
3.宇龙计算机通信科技有限公司 深圳 518035
摘    要:深度学习方法依赖于大规模的标签数据,通过端到端的监督训练,在计算机视觉、自然语言处理领域都取得优异性能.但是,现有方法通常针对单一模态数据,忽视数据的内在结构,缺乏理论支撑.针对上述问题,文中从基于小波核学习的深度滤波器组网络设计、基于结构化学习的深度学习、基于多模态学习的深度学习3个角度阐述结合深度学习方法与小波理论、结构化预测的潜在方法,以及其拓展到多模态数据的可行机制.

关 键 词:深度学习  滤波器组  小波理论  结构化学习  多模态学习  
收稿时间:2017-09-26

Interpretable Structured Multi-modal Deep Neural Network
XIONG Hongkai,GAO Xing,LI Shaohui,XU Yuhui,WANG Yongzhuang,YU Haoyang,LIU Xin,ZHANG Yunfei. Interpretable Structured Multi-modal Deep Neural Network[J]. Pattern Recognition and Artificial Intelligence, 2018, 31(1): 1-11. DOI: 10.16451/j.cnki.issn1003-6059.201801001
Authors:XIONG Hongkai  GAO Xing  LI Shaohui  XU Yuhui  WANG Yongzhuang  YU Haoyang  LIU Xin  ZHANG Yunfei
Affiliation:1.Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240
2.Shenzhen Tencent Computer System Co., Ltd, Shenzhen 518000
3.Yulong Computer Communication Technology Co., Ltd, Shenzhen 518035
Abstract:Deep learning methods achieve excellent performance in the fields of computer vision and natural language processing through end-to-end supervised training dependent on large scale labeled datasets. However, the existing methods are often targeted for single modal data, ignoring the inherent structure of the data with the lack of theoretical support. Therefore, the wavelet theory based deep convolution networks, the structured deep learning and the multi-modal deep learning are discussed in this paper to demonstrate the potential methods of the combination of deep learning techniques, wavelet theory and structure prediction, and the viable mechanism for extending to multi-modal data is explored as well.
Keywords:Deep Learning  Filter Bank  Wavelet Theory  Structured Learning  Multi-modal Learning  
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号