期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

基于内容的音频检索综述 总被引：14，自引：0，他引：14

朱爱红李连《微机发展》2003,13(12):58-60,64

传统的基于文本的音频检索技术存在主观性和不完整性等缺点，而且不支持实时音频检索。为此，基于内容的音频检索技术应运而生。文中根据音频检索的研究现状，综述基于内容的音频检索方法，讨论了一些音频检索技术研究中的关键技术：音频特征提取、音频分类、语音识别技术等。最后展望了音频检索技术的发展前景。相似文献

2.

基于内容的音频检索综述

朱爱红李连《计算机技术与发展》2003,13(12)

传统的基于文本的音频检索技术存在主观性和不完整性等缺点,而且不支持实时音频检索.为此,基于内容的音频检索技术应运而生.文中根据音频检索的研究现状,综述基于内容的音频检索方法,讨论了一些音频检索技术研究中的关键技术:音频特征提取、音频分类、语音识别技术等.最后展望了音频检索技术的发展前景. 相似文献

3.

典型的音频分类算法

郑怡文《计算机与现代化》2007,(8):59-63

音频分类是提取音频结构和内容语义的重要手段,是基于内容的音频检索和分析的基础.本文对几种常用的音频分类算法作了综述,介绍了最小距离法、神经网络、支持向量机、决策树方法、隐马尔可夫模型等典型算法的特征,并对它们的优缺点进行了比较. 相似文献

4.

基于内容的音频检索关键技术研究 总被引：4，自引：0，他引：4

朱爱红李连《现代计算机》2003,(11):37-40,51

音频是一种重要的媒体，包含丰富的听觉特征。本文根据当前音频检索研究的进展，综述基于内容的音频检索方法，讨论了一些音频检索技术研究中的关键技术：音频特征提取、音频分类、语音识别技术等。最后展望了音频检索技术的发展前景。相似文献

5.

基于内容的音频检索技术综述

吴春辉陈洪生《福建电脑》2010,26(12):37-38

基于内容的音频检索是多媒体检索技术中一个重要的组成部分,而其检索技术却相对滞后.基于内容的音频检索已成为多媒体检索技术的研究热点.本文分析并总结了音频检索的概念,综述了基于内容的音频检索方法和相关技术,最后通过一个简单的系统对基于内容的音频检索方法进行了测试. 相似文献

6.

基于支持向量机的音频分类与分割 总被引：8，自引：0，他引：8

白亮老松杨陈剑赟吴玲达《计算机科学》2005,32(4):87-90

音频分类与分割是提取音频结构和内容语义的重要手段,是基于内容的音频、视频检索和分析的基础。支持向量机(SVM)是一种有效的统计学习方法。本文提出了一种基于SVM的音频分类算法。将音频分为5类：静音、噪音、音乐、纯语音和带背景音的语音。在分类的基础上,采用3个平滑规则对分类结果进行平滑。分析了SVM分类嚣的分类性能,同时也评估了本文提出的新的音频特征在SVM分类嚣上的分类效果。实验结果显示,基于SVM的音频分类算法分类效果良好,平滑处理后的音频分割结果比较准确。相似文献

7.

音频信息检索的研究及实现 总被引：9，自引：0，他引：9

宋博须德《计算机应用》2003,23(12):52-54

介绍了常见的基于内容的音频检索的关键技术和音频特征提取的一般方法，讨论了其中基于隐马尔科夫(HMM)模型识别音频例子的关键问题，并在此基础上给出了一个基于内容的音频信息检索系统的框架和实例。相似文献

8.

基于Web Service的多层音频分类器设计 总被引：1，自引：0，他引：1

李超熊璋贺静薛玲《计算机工程与设计》2006,27(4):614-617

随着网络技术的发展和多媒体数据的增长,应用对音频内容的分类、查询、检索需求越来越强烈。MPEG-7和MPEG-21等标准的出现为多媒体内容的规范化提供了良好的外部条件,有利于进一步向基于内容检索服务的过渡。介绍了一个基于Web服务分布式部署的多层音频分类器的设计,提出以特征代理为前端进行特征提取,分类代理为后端进行特征处理,节点采用多分类器融合策略,通过发布/订阅机制实现音频特征的传送。系统结构有利于进一步扩展,从而为音频内容查询检索和音频搜索引擎等应用提供平台式服务。相似文献

9.

基于单状态HMM的音频分类方法研究

郑继明李瑞仙蒲兴成《计算机应用》2009,29(2):392-394

经典的隐马尔可夫模型(HMM)是一种基于统计信号的模型,它在基于内容的音频检索系统中具有重要的作用。根据音频分类重类型轻内容的特性,将单状态的HMM用于音频分类,克服了多状态HMM在模型初始化时状态初始概率和转移概率赋值带有假设不准确的缺点。实验结果表明基于单状态的HMM模型音频分类方法能有效地减少误识率,提高音频分类的精确度。相似文献

10.

音频检索技术研究

李晨 ;周明全《微机发展》2008,(8):215-218

结合音频检索发展现状,描述了当前相关研究的进展,介绍了现在最常用到的音频检索方法,讨论了与音频检索相关的关键技术：音频特征提取、音频分割和分类。基于内容的音乐检索研究是一种涉及音乐理论、信号处理、模式识别等相关领域的综合学科研究,其在音乐数据库管理、Internet音乐检索以及生活娱乐等方面都具有非常重要的意义。分析并总结出音乐内容及其检索的概念,给出音乐检索的系统结构,综述了基于内容的音乐检索方法,最后指出了音频检索发展的前景。相似文献

11.

基于SVM的音频分类系统设计及实现

孙文静李士强《计算机科学》2010,37(12):209-210

分析音频时域特征及提取方法,研究基于支持向量机的语音分类系统流程、分类系统架构以及SVM语音分类器的设计,并进行了相关实验。结果表明,设计的基于SVM的音频分类系统能够有效地对音频进行分类,平均识别准确率达到90%以上。相似文献

12.

Content based audio classification: a neural network approach

Vikramjit Mitra Chia-Jiu Wang 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2008,12(7):639-646

Content based music genre classification is a key component for next generation multimedia search agents. This paper introduces an audio classification technique based on audio content analysis. Artificial Neural Networks (ANNs), specifically multi-layered perceptrons (MLPs) are implemented to perform the classification task. Windowed audio files of finite length are analyzed to generate multiple feature sets which are used as input vectors to a parallel neural architecture that performs the classification. This paper examines a combination of linear predictive coding (LPC), mel frequency cepstrum coefficients (MFCCs), Haar Wavelet, Daubechies Wavelet and Symlet coefficients as feature sets for the proposed audio classifier. Parallel to MLP, a Gaussian radial basis function (GRBF) based ANN is also implemented and analyzed. The obtained prediction accuracy of 87.3% in determining the audio genres claims the efficiency of the proposed architecture. The ANN prediction values are processed by a rule based inference engine (IE) that presents the final decision. 相似文献

13.

Content-based audio classification and segmentation by using support vector machines 总被引：9，自引：0，他引：9

Lie Lu Hong-Jiang Zhang Stan Z. Li 《Multimedia Systems》2003,8(6):482-492

Content-based audio classification and segmentation is a basis for further audio/video analysis. In this paper, we present our work on audio segmentation and classification which employs support vector machines (SVMs). Five audio classes are considered in this paper: silence, music, background sound, pure speech, and non- pure speech which includes speech over music and speech over noise. A sound stream is segmented by classifying each sub-segment into one of these five classes. We have evaluated the performance of SVM on different audio type-pairs classification with testing unit of different- length and compared the performance of SVM, K-Nearest Neighbor (KNN), and Gaussian Mixture Model (GMM). We also evaluated the effectiveness of some new proposed features. Experiments on a database composed of about 4- hour audio data show that the proposed classifier is very efficient on audio classification and segmentation. It also shows the accuracy of the SVM-based method is much better than the method based on KNN and GMM. 相似文献

14.

Audio-based description and structuring of videos

Hadi Harb Liming Chen 《International Journal on Digital Libraries》2006,6(1):70-81

相似文献

15.

基于支持向量机的多类音频分类

俞玉莲郭世杰《计算机应用与软件》2010,27(4):98-101

研究一种用支持向量机(SVM)进行多类音频分类的方法,其中引入增广两类分类法(AB法)设计多类分类器。该算法把音频分为四类:音乐、纯语音、带背景音的语音和典型的环境音,并分析了这几类音频的八个区别性特征,包括修正低能量成分比率(MLER)和修正基频(MPF)两个新特征以及频域总能量、子带能量、频率中心等其它六个基本特征,综合考察了不同特征集在基于SVM分类器中的分类精度。实验结果表明,提取的音频特征有效,基于SVM的多类音频分类效果良好。相似文献

16.

音频内容分割与聚类的研究 总被引：1，自引：0，他引：1

张春林杨玉红胡瑞敏《计算机工程》2002,28(7):173-174

分析了采用音频特征检测音频边缘来分割音频的过程，给出了采用高斯混合模型GMM描述音频段的方法；介绍了音频段聚类的实现；并给出了实验结果，实验结果说明分割和聚类的效果较好。相似文献

17.

Windows低级音频函数剖析及应用

王玮琪《现代计算机》2003,(6):89-91

本文分析了Windows系统提供的音频服务、波形音频文件的结构，重点阐述了低级音频函数使用方法，并结合实际应用案例给出其VC 示例代码。相似文献

18.

Audio steganalysis based on reversed psychoacoustic model of human hearing

《Digital Signal Processing》2016

During the last decade, audio information hiding has attracted lots of attention due to its ability to provide a covert communication channel. On the other hand, various audio steganalysis schemes have been developed to detect the presence of any secret messages. Basically, audio steganography methods attempt to hide their messages in areas of time or frequency domains where human auditory system (HAS) does not perceive. Considering this fact, we propose a reliable audio steganalysis system based on the reversed Mel-frequency cepstral coefficients (R-MFCC) which aims to provide a model with maximum deviation from HAS model. Genetic algorithm is deployed to optimize dimension of the R-MFCC-based features. This will both speed up feature extraction and reduce the complexity of classification. The final decision is made by a trained support vector machine (SVM) to detect suspicious audio files. The proposed method achieves detection rates of 97.8% and 94.4% in the targeted (Steghide@1.563%) and universal scenarios. These results are respectively 17.3% and 20.8% higher than previous D2-MFCC based method. 相似文献