首页 | 官方网站   微博 | 高级检索  
     

一种基于内容的音频流二级分割方法
引用本文:张一彬,周杰,边肇祺,张大鹏.一种基于内容的音频流二级分割方法[J].计算机学报,2006,29(3):457-465.
作者姓名:张一彬  周杰  边肇祺  张大鹏
作者单位:1. 清华大学自动化系,北京,100084
2. 香港理工大学计算学系,香港
基金项目:中国科学院资助项目;北京市自然科学基金
摘    要:基于内容的音频流分割是多媒体数据分析领域中的一个十分重要和困难的问题.目前大多数传统的音频流分割方法是基于小尺度音频分类的,但是这类分割方法普遍存在虚假分割点过多的缺点,严重影响了实际应用的效果.作者的研究表明,大尺度音频片段的分类正确率要明显高于小尺度音频片段的分类正确率,并且这个趋势与分类器选择无关.基于这个事实和减少虚假分割点的目的,作者提出了一种新的音频流分割方法.首先,采用基于大尺度音频分类的分割方法对音频流进行粗分割,以减少虚假分割点;然后定义了分割点评价函数,并利用它在边界区域中进一步精确定位分割点.实验结果表明这种音频流分割方法可以比较精确地获取分割点位置,同时将虚假分割点减少到传统方法的四分之一.

关 键 词:音频分类  音频流分割  分割点评价函数  虚假分割  神经网络
收稿时间:2004-12-29
修稿时间:2004-12-292005-11-04

A Two-Stage Content-Based Audio Segmentation Algorithm
ZHANG Yi-Bin,ZHOU Jie,BIAN Zhao-Qi,ZHANG David.A Two-Stage Content-Based Audio Segmentation Algorithm[J].Chinese Journal of Computers,2006,29(3):457-465.
Authors:ZHANG Yi-Bin  ZHOU Jie  BIAN Zhao-Qi  ZHANG David
Affiliation:1.Department of Automation, Tsinghua University, Beijing 100084;2.Department of Computing, The Hong Kong Polytechnic University, Hong Kong
Abstract:Content-based audio segmentation plays an important role in multimedia applications.In order to segment accurately and on-line,most conventional algorithms are based on small-scale audio classification and always result in a high false segmentation rate.The authors'experimental results show that large-scale audio can be more easily classified than small ones,and this trend is irrespective of classifiers.According to this fact,this paper presents a novel framework for audio segmentation to reduce the false segmentations.First,a rough segmentation step based on large-scale audio classification is taken to ensure the integrality of the content of audio segments,which can(avoid) the consecutive audio belonging to the same kind being segmented into different pieces.Then a subtle segmentation step based on segmentation point evaluation function is taken to further locate the segmentation points for the boundary areas computed by the rough segmentation step.Experimental results show that nearly 3/4 false segmentation points can be reduced comparing to the conventional audio segmentation method based on small-scale audio classification,while preserving a low missing rate.
Keywords:audio classification  audio segmentation  segmentation point evaluation function  false segmentation  neural network
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号