一种基于内容的音频流二级分割方法 A Two-Stage Content-Based Audio Segmentation Algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于内容的音频流二级分割方法

引用本文：	张一彬,周杰,边肇祺,张大鹏.一种基于内容的音频流二级分割方法[J].计算机学报,2006,29(3):457-465.

作者姓名：	张一彬周杰边肇祺张大鹏

作者单位：	1. 清华大学自动化系,北京,100084 2. 香港理工大学计算学系,香港

基金项目：	中国科学院资助项目;北京市自然科学基金

摘要：	基于内容的音频流分割是多媒体数据分析领域中的一个十分重要和困难的问题．目前大多数传统的音频流分割方法是基于小尺度音频分类的，但是这类分割方法普遍存在虚假分割点过多的缺点，严重影响了实际应用的效果．作者的研究表明，大尺度音频片段的分类正确率要明显高于小尺度音频片段的分类正确率，并且这个趋势与分类器选择无关．基于这个事实和减少虚假分割点的目的，作者提出了一种新的音频流分割方法．首先，采用基于大尺度音频分类的分割方法对音频流进行粗分割，以减少虚假分割点；然后定义了分割点评价函数，并利用它在边界区域中进一步精确定位分割点．实验结果表明这种音频流分割方法可以比较精确地获取分割点位置，同时将虚假分割点减少到传统方法的四分之一．
关键词：	音频分类音频流分割分割点评价函数虚假分割神经网络
收稿时间：	2004-12-29
修稿时间：	2004-12-292005-11-04
A Two-Stage Content-Based Audio Segmentation Algorithm

ZHANG Yi-Bin,ZHOU Jie,BIAN Zhao-Qi,ZHANG David.A Two-Stage Content-Based Audio Segmentation Algorithm[J].Chinese Journal of Computers,2006,29(3):457-465.

Authors:	ZHANG Yi-Bin ZHOU Jie BIAN Zhao-Qi ZHANG David

Affiliation:	1.Department of Automation, Tsinghua University, Beijing 100084;2.Department of Computing, The Hong Kong Polytechnic University, Hong Kong

Abstract:	Content-based audio segmentation plays an important role in multimedia applications.In order to segment accurately and on-line,most conventional algorithms are based on small-scale audio classification and always result in a high false segmentation rate.The authors'experimental results show that large-scale audio can be more easily classified than small ones,and this trend is irrespective of classifiers.According to this fact,this paper presents a novel framework for audio segmentation to reduce the false segmentations.First,a rough segmentation step based on large-scale audio classification is taken to ensure the integrality of the content of audio segments,which can(avoid) the consecutive audio belonging to the same kind being segmented into different pieces.Then a subtle segmentation step based on segmentation point evaluation function is taken to further locate the segmentation points for the boundary areas computed by the rough segmentation step.Experimental results show that nearly 3/4 false segmentation points can be reduced comparing to the conventional audio segmentation method based on small-scale audio classification,while preserving a low missing rate.

Keywords:	audio classification audio segmentation segmentation point evaluation function false segmentation neural network
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏