共查询到18条相似文献,搜索用时 93 毫秒
1.
2.
动态时间规整方法是一种以动态规整为基础的模式匹配方法,在模式识别特别是语音识别领域中有着广泛的应用。本文介绍将动态时间规整方法应用于说话人识别的方案,实验研究的结果表明,动态时间规整方法应用于说话人识别是行之有效的,而且实验的错误率可以降低到静态时间规整说话人识别实验的错误率的一半。 相似文献
3.
4.
5.
通过电能质量监测系统(power quality monitoring system, PQMS)中蕴含的电网历史故障变化、趋势等重要信息,对未来电压暂降进行预测,可为用户和电网公司合理规划生产,避免经济损失提供有力帮助。该文提出一种基于隐马尔可夫模型的电压暂降发生时间(occurrence time of voltage sag, OTVS)预测方法。首先对电压暂降发生时间的变量可预测性、数据冗余性、事件混沌性进行分析,揭示电压暂降监测数据特性;然后针对这三种特性,提出基于模糊C-均值聚类算法(fuzzy C-means algorithm, FCMA)和赤池信息准则(Akaike information criterion, AIC)的电压暂降历史状态识别与划分方法,以区间型变量刻画监测数据中的历史变化信息;建立考虑暂降历史变化信息和电网扰动变化信息的隐马尔可夫模型,实现对未来电压暂降的预测。最后,利用中部某省10个监测点的历史数据进行验证,所提方法的预测准确率最高可达92.85%,所提方法的预测性能较其他典型预测方法约高5%~30%。 相似文献
6.
7.
8.
提出了一种改进型隐马尔可夫模型/神经网络混合分类器,该分类器将隐马尔可夫模型的时间校正能力与神经网络的静态区分能力结合在一起。它首先利用循环无跳转HMM模型对第一测试特征序列进行全状态分割。将T帧特征序列按时间演化顺序校正成N帧平均状态序列。然后 交其作为RBF网络的输入矢量进行分类。实验结果表明,该分类器比单纯的神经网络或隐马尔可夫模型分类器具有更限的分类效果。 相似文献
9.
10.
11.
Abstract In this paper, the performance of several speech recognition techniques applied on the highly confusing Mandarin syllables were carefully compared, including dynamic time warping (DTW), the newly proposed DTW with superimposed weighting function (DTWW), the discrete hidden Markov models (DHMM) and the continuous hidden Markov models (CHMM). The vocabulary used here consists of 409 first tone isolated Mandarin syllables. Due to the fact that many confusing sets exist in this vocabulary, the accurate recognition of these syllables is relatively difficult, and all the recognition experiments were performed in the speaker dependent mode. After a series of 13 experiments, it was found that the recognition rate of the newly proposed DTWW (88.3) is higher than that of DTW (85.1), DHMM (65.0) and CHMM (83.9), and that the CPU time used for DTWW is 1.03 times that for DTW, 24 times that for DHMM and 4.3 times that for CHMM. In addition, the memory space required for DTWW and DTW is 3.4 times that of DHMM and 8.5 times that of CHMM. Therefore, DTWW has the highest recognition rate, DHMM has the fastest recognition speed, whereas CHMM appears to be very attractive when all the different factors including recognition rate, recognition speed and memory space requirement are considered. 相似文献
12.
针对基于"视频指纹"特征的视频检索算法在实际应用中存在视频亮度整体漂移、突变干扰以及视频再编辑的问题,提出了一种基于动态时间规划的视频特征检索改进算法.该算法在原"视频指纹"算法的基础上采用了新的视频检索策略:首先,对视频帧进行区域分割,将"视频指纹"由单分量扩展到多分量;然后,采用基于粒度的相似性比较算法,用比对每帧视频指纹的变化,取代比较视频指纹本身;最后,采用改进的动态时间规划算法进行视频特征匹配,定位目标视频.在算法评估中,建立了由74段广告视频和154段再编辑视频组成的测试集.通过试验证明,改进算法可以有效应对视频检索过程中的亮度整体漂移、突变干扰和视频再编辑问题,检索算法具有很好的鲁棒性. 相似文献
13.
14.
Jun BH 《Journal of hazardous materials》2011,185(1):262-268
This paper proposes a diagnosis system using dynamic time warping (DTW) and discriminant analysis with oxidation-reduction potential (ORP) and dissolved oxygen (DO) values for swine wastewater treatment. A full-scale sequencing batch reactor (SBR), which has an effective volume of 20 m(3), was auto-controlled, and the reaction phase was performed by a sub-cycle operation consisting of a repeated short cycle of the anoxic-aerobic step. Using ORP and DO profiles, SBR status was divided into four categories of normal and abnormal cases; these were influent disturbance, aeration controller fault, instrument trouble and inadequate raw wastewater feeding. Through the DTW process, difference values (D) were determined and classified into seven cases. In spite of the misclassification of high loading rates, the ORP profile provided good diagnosis results. However, the DO profiles detected five misclassifications that indicated different statuses. After the DTW process, several statistical values, including maximum value, minimum value, average value, standard deviation value and three quartile values, were extracted and applied to establish the discriminant function. The discriminant analysis allows one to classify seven cases with a percentage of 100% and 92.7% for ORP and DO profiles, respectively. Consequently, the study showed that ORP profiles are more efficient than DO profiles as diagnosis parameters and DTW diagnosis algorithms and discriminants. 相似文献
15.
口腔运动与人们的饮食规律息息相关,该文通过对口腔运动状态的分析识别来监测人们的饮食规律,以此来指导人们的饮食习惯。借助语音识别技术的思想和方法,分析识别口腔运动产生的骨导音,为提升识别效率,采用了传统的隐马尔可夫模型。基于隐马尔可夫模型建立了一套骨导音识别系统,在进行骨导音识别之前,通过分帧加窗、提取梅尔频率倒谱系数,对其进行模型训练;在识别过程中,找出与待测音频信号和模板库中匹配度最高的模型,以其模型输出结果作为最后的识别结果。该方法的识别结果可以达到 84%,实验结果表明该方法具有一定的可行性。 相似文献
16.
FCSS算法和FCM算法具有相同的优缺点,而与FCM和PcM相结合的聚类算法能很好的解决FCM算法容易陷入局部极值点的缺点。在本文中,首先证明了用此FCM-PCM模型来对曲线进行识别的可行性。数值实验表明:新方法对球壳型数据有令人满意的效果。 相似文献
17.
鲁棒语音识别技术在人机交互、智能家居、语音翻译系统等方面有重要应用。为了提高在噪声和语音干扰等复杂声学环境下的语音识别性能,基于人耳听觉系统的掩蔽效应和鸡尾酒效应,利用不同声源的空间方位,提出了基于双耳声源分离和丢失数据技术的鲁棒语音识别算法。该算法首先根据目标语音的空间方位信息,在双耳声信号的等效矩形带宽(Equivalent Rectangular Bandwidth,ERB)子带内进行混合语音信号的分离,从而得到目标语音的数据流。针对分离后目标语音在频域存在频谱数据丢失的问题,利用丢失数据技术修正基于隐马尔科夫模型的概率计算,再进行语音识别。仿真实验表明,由于双耳声源分离方法得到的目标语音数据去除了噪声和干扰的影响,所提出的算法显著提高了复杂声学环境下的语音识别性能。 相似文献
18.
Hsien-Cheng Liao 《中国工程学刊》2016,39(5):574-577
This paper proposes a maximum confidence measure-based closed-loop dual-microphone beamforming direction and beamwidth steering algorithm to facilitate robust speech recognition. This technique involves feeding back the confidence measure reported through a back-end speech recognizer, automatically steering a front-end microphone array to optimally identify the correct speaker direction and array beamwidth. The technique enables users to move around freely and directly improves overall system performance. The experimental results from a voice command task show that the proposed approach demonstrated superior performance. 相似文献