以图像视频为中心的跨媒体分析与推理 Image video centered cross-media analysis and reasoning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

以图像视频为中心的跨媒体分析与推理

引用本文：	黄庆明,,王树徽,许倩倩,李亮,蒋树强.以图像视频为中心的跨媒体分析与推理[J].智能系统学报,2021,16(5):835-848.

作者姓名：	黄庆明王树徽许倩倩李亮蒋树强

作者单位：	1. 中国科学院大学计算机科学与技术学院，北京 100049;2. 中国科学院计算技术研究所智能信息处理实验室，北京 100190

摘要：	如何跨越从跨媒体数据到跨媒体知识所面临的“异构鸿沟”和“语义鸿沟”，对体量巨大的跨媒体数据进行有效管理与利用，是发展新一代人工智能亟待突破的瓶颈问题。针对以图像视频为代表的海量网络跨媒体内容，借鉴人类感知与认知机理，本文对跨媒体内容统一表征与符号化表征、跨媒体深度关联理解、类人跨媒体智能推理等关键技术开展研究。基于上述关键技术，着力于解决发展新一代人工智能的知识匮乏共性难题，开展大规模跨媒体知识图谱的构建及人机协同标注技术研究，为跨媒体感知进阶到认知提供关键支撑，进一步为跨媒体理解、检索、内容转换生成等跨媒体内容管理与服务热点应用领域提供了可行思路。
关键词：	跨媒体图像视频统一表征关联理解可解释推理人机协同知识图谱内容管理与服务
Image video centered cross-media analysis and reasoning

HUANG Qingming,,WANG Shuhui,XU Qianqian,LI Liang,JIANG Shuqiang.Image video centered cross-media analysis and reasoning[J].CAAL Transactions on Intelligent Systems,2021,16(5):835-848.

Authors:	HUANG Qingming WANG Shuhui XU Qianqian LI Liang JIANG Shuqiang

Affiliation:	1. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China;2. Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China

Abstract:	How to surpass the heterogeneity gap and semantic gap between the cross-media content and cross-media knowledge, and how to manage and utilize the huge amount of cross-media data effectively are urgent bottleneck problems of developing a new generation of artificial intelligence. Aiming at massive online cross-media content represented by image video and by referring to human perception and cognition mechanisms, this paper undertakes studies on such key technologies as unified representation and symbolic representation of cross-media content, deep correlative understanding of cross-media and human-like cross-media intelligent reasoning. Based on the above technologies, this paper focuses on solving the common problem of knowledge shortage in the development of a new generation of artificial intelligence and carries out a research on the construction of large-scale cross-media knowledge graph and the human-machine cooperation based labeling technology, to provide strong support for the advancement from cross-media perception to cognition and further provide feasible solutions towards cross-media content management and popular service applications, e.g., cross-media content understanding, retrieval, content transformation and generation, etc.

Keywords:	cross-media image video unified representation correlative understanding explainable reasoning Human-computer collaboration knowledge graph content management and service

	点击此处可从《智能系统学报》浏览原始摘要信息
	点击此处可从《智能系统学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏