一种新的基于分类的音频流分割方法 A Novel Classification-Based Audio Segmentation Algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种新的基于分类的音频流分割方法

引用本文：	张一彬,周杰,边肇祺,张大鹏.一种新的基于分类的音频流分割方法[J].电子学报,2006,34(4):612-617.

作者姓名：	张一彬周杰边肇祺张大鹏

作者单位：	1. 清华大学自动化系,北京 100084;2. 香港理工大学计算机学系,中国香港

基金项目：	中国科学院资助项目，北京市自然科学基金

摘要：	很多传统的音频流分割方法都是基于小尺度音频分类的,它们普遍存在虚假分割点过多的缺点,严重影响了实际应用的效果.我们的研究表明,大尺度音频片段的分类正确率明显高于小尺度音频片段的分类正确率.基于这个事实和减少虚假分割点的目的,我们提出了一种新的基于分类的音频流分割方法.首先,采用基于大尺度分类的分割方法对音频流进行粗分割,然后采用基于小尺度分类的细分割步骤在边界区域中进一步精确定位分割点.理论分析和实验结果均表明,当处理类别变换频率较低的音频流时,这种分割方法在保持真实分割点检测率的同时能够大幅降低虚假分割率.
关键词：	音频分类音频分割虚假分割神经网络
文章编号：	0372-2112（2006）04-0612-06
收稿时间：	2005-03-21
修稿时间：	2005-03-212005-12-22
A Novel Classification-Based Audio Segmentation Algorithm

ZHANG Yi-bin,ZHOU Jie,BIAN Zhao-qi,ZHANG Da-peng.A Novel Classification-Based Audio Segmentation Algorithm[J].Acta Electronica Sinica,2006,34(4):612-617.

Authors:	ZHANG Yi-bin ZHOU Jie BIAN Zhao-qi ZHANG Da-peng

Affiliation:	1. Department of Automation,Tsinghua University,Beijing 100084,China;2. Department of Computing,Hong Kong Polytechnic University,Hong Kong,China

Abstract:	Content-based audio segmentation plays an important role in multimedia applications. Many conventional segmentation algorithms are based on small-scale classification and always result in a high false alarm rate. Our experimental results show that large-scale audio can be more easily classified than small ones, and this trend is irrespective of classifiers. According to this fact,we present a novel framework for audio segmentation to reduce the false seg- mentations. First,a rough segmentation step based on large-scale classification is taken to ensure the integrality of the content of segments. Then a subtle segmentation step based on small-scale classification is taken to further locate the segmentation points from the boundary areas computed by the rough segmentation step. Both theoretical analysis and ex- perimental results show that nearly 3/4 false segmentation points can be reduced comparing to the conventional audio segmentation method based on small-scale audio classification, while preserving a low missing rate, when infrequently type-changed audio streams are dealt. So it can be concluded that it is very suitable for the real tasks such as music broadcast segmentation or music video analysis.

Keywords:	audio classification audio segmentation false segmentation rate neural network
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《电子学报》浏览原始摘要信息
	点击此处可从《电子学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏