排序方式: 共有45条查询结果,搜索用时 15 毫秒
41.
42.
受到模型规模大和计算量大的限制,经典的基于高斯混合模型的说话人识别方法不适合于资源有限的PDA平台实时说话人自动识别要求。以Mel倒谱系数为说话人特征,运用主成分分类技术,结合定点数计算技术实现实时说话人自动识别。在19个用户的语音库上进行系统识别实验,此新型分类技术的训练时间缩短为基线系统的1/50,测试时间缩短为1/12,模型规模缩小为1/6,同时识别性能达到94.7%。 相似文献
43.
借鉴人耳听觉原理和特征学习的优势,提出了梅尔(Mel)频率谱提取和稀疏非负矩阵分解相结合的方法用于低空飞行目标声信号识别。首先,以不同目标的Mel频率谱为特征矩阵,利用稀疏非负矩阵分解方法学习得到各自的模板矩阵;然后,利用按列合并后的模板矩阵对训练/测试样本进行特征分解获得编码系数,该系数可作为分类特征;最后,结合不同目标的特点,采用分频段特征提取和顺序二类分类的方法进行多目标分类,并与Mel频率倒谱系数进行性能比较。结果显示,无论在单类目标辨识还是在多类目标分类中,稀疏非负矩阵分解方法均取得了更好的效果。 相似文献
44.
45.
Speaker recognition is a major challenge in various languages for researchers. For programmed speaker recognition structure prepared by utilizing ordinary speech, shouting creates a confusion between the enlistment and test, henceforth minimizing the identification execution as extreme vocal exertion is required during shouting. Speaker recognition requires more time for classification of data, accuracy is optimized, and the low root-mean-square error rate is the major problem. The objective of this work is to develop an efficient system of speaker recognition. In this work, an improved method of Wiener filter algorithm is applied for better noise reduction. To obtain the essential feature vector values, Mel-frequency cepstral coefficient feature extraction method is used on the noise-removed signals. Furthermore, input samples are created by using these extracted features after the dimensions have been reduced using probabilistic principal component analysis. Finally, recurrent neural network-bidirectional long-short-term memory is used for the classification to improve the prediction accuracy. For checking the effectiveness, the proposed work is compared with the existing methods based on accuracy, sensitivity, and error rate. The results obtained with the proposed method demonstrate an accuracy of 95.77%. 相似文献