首页 | 官方网站   微博 | 高级检索  
     


Multimodal concept detection in broadcast media: KavTan
Authors:Medeni Soysal  K. Berker Loğoğlu  Mashar Tekin  Ersin Esen  Ahmet Saracoğlu  Banu Oskay Acar  Ezgi Can Ozan  Tuğrul K. Ateş  Hakan Sevimli  Müge Sevinç  İlkay Atıl  Savaş Özkan  Mehmet Ali Arabacı  Seda Tankız  Talha Karadeniz  Duygu Önür  Sezin Selçuk  A. Aydın Alatan  Tolga Çiloğlu
Affiliation:1. TUBITAK - UZAY, METU Campus, Ankara, Turkey
Abstract:Concept detection stands as an important problem for efficient indexing and retrieval in large video archives. In this work, the KavTan System, which performs high-level semantic classification in one of the largest TV archives of Turkey, is presented. In this system, concept detection is performed using generalized visual and audio concept detection modules that are supported by video text detection, audio keyword spotting and specialized audio-visual semantic detection components. The performance of the presented framework was assessed objectively over a wide range of semantic concepts (5 high-level, 14 visual, 9 audio, 2 supplementary) by using a significant amount of precisely labeled ground truth data. KavTan System achieves successful high-level concept detection performance in unconstrained TV broadcast by efficiently utilizing multimodal information that is systematically extracted from both spatial and temporal extent of multimedia data.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号