首页 | 官方网站   微博 | 高级检索  
     

以图像视频为中心的跨媒体分析与推理
引用本文:黄庆明,,王树徽,许倩倩,李亮,蒋树强.以图像视频为中心的跨媒体分析与推理[J].智能系统学报,2021,16(5):835-848.
作者姓名:黄庆明    王树徽  许倩倩  李亮  蒋树强
作者单位:1. 中国科学院大学 计算机科学与技术学院,北京 100049;2. 中国科学院计算技术研究所 智能信息处理实验室,北京 100190
摘    要:如何跨越从跨媒体数据到跨媒体知识所面临的“异构鸿沟”和“语义鸿沟”,对体量巨大的跨媒体数据进行有效管理与利用,是发展新一代人工智能亟待突破的瓶颈问题。针对以图像视频为代表的海量网络跨媒体内容,借鉴人类感知与认知机理,本文对跨媒体内容统一表征与符号化表征、跨媒体深度关联理解、类人跨媒体智能推理等关键技术开展研究。基于上述关键技术,着力于解决发展新一代人工智能的知识匮乏共性难题,开展大规模跨媒体知识图谱的构建及人机协同标注技术研究,为跨媒体感知进阶到认知提供关键支撑,进一步为跨媒体理解、检索、内容转换生成等跨媒体内容管理与服务热点应用领域提供了可行思路。

关 键 词:跨媒体  图像视频  统一表征  关联理解  可解释推理  人机协同  知识图谱  内容管理与服务

Image video centered cross-media analysis and reasoning
HUANG Qingming,,WANG Shuhui,XU Qianqian,LI Liang,JIANG Shuqiang.Image video centered cross-media analysis and reasoning[J].CAAL Transactions on Intelligent Systems,2021,16(5):835-848.
Authors:HUANG Qingming    WANG Shuhui  XU Qianqian  LI Liang  JIANG Shuqiang
Affiliation:1. School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China;2. Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
Abstract:How to surpass the heterogeneity gap and semantic gap between the cross-media content and cross-media knowledge, and how to manage and utilize the huge amount of cross-media data effectively are urgent bottleneck problems of developing a new generation of artificial intelligence. Aiming at massive online cross-media content represented by image video and by referring to human perception and cognition mechanisms, this paper undertakes studies on such key technologies as unified representation and symbolic representation of cross-media content, deep correlative understanding of cross-media and human-like cross-media intelligent reasoning. Based on the above technologies, this paper focuses on solving the common problem of knowledge shortage in the development of a new generation of artificial intelligence and carries out a research on the construction of large-scale cross-media knowledge graph and the human-machine cooperation based labeling technology, to provide strong support for the advancement from cross-media perception to cognition and further provide feasible solutions towards cross-media content management and popular service applications, e.g., cross-media content understanding, retrieval, content transformation and generation, etc.
Keywords:cross-media  image video  unified representation  correlative understanding  explainable reasoning  Human-computer collaboration  knowledge graph  content management and service
点击此处可从《智能系统学报》浏览原始摘要信息
点击此处可从《智能系统学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号