首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 20 毫秒
1.
This paper considers the blind separation of nonstationary sources in the underdetermined convolutive mixture case. We introduce, two methods based on the sparsity assumption of the sources in the time-frequency (TF) domain. The first one assumes that the sources are disjoint in the TF domain, i.e., there is at most one source signal present at a given point in the TF domain. In the second method, we relax this assumption by allowing the sources to be TF-nondisjoint to a certain extent. In particular, the number of sources present (active) at a TF point should be strictly less than the number of sensors. In that case, the separation can be achieved thanks to subspace projection which allows us to identify the active sources and to estimate their corresponding time-frequency distribution (TFD) values. Another contribution of this paper is a new estimation procedure for the mixing channel in the underdetermined case. Finally, numerical performance evaluations and comparisons of the proposed methods are provided highlighting their effectiveness.  相似文献   

2.
提出一种基于时频分析的卷积混合盲分离算法.由于信号源与各传感器的距离不同,在传播的过程中会产生不同的幅度衰减和时间延迟.该算法用短时傅里叶变换对语音信号进行时频分析,将其中一个传感器信号作为参考信号,构造了源信号的幅度衰减向量和时间延迟向量.根据语音信号的时频域稀疏性,以这两个向量为特征,在时频域上对传感器信号进行聚类,再通过估计参考信号混合系数来获得源信号时频域表示,进一步得到源信号.该方法可以用于源信号数目大于传感器信号数目的情况.仿真实验证明,算法可以完成欠定情况下卷积混合信号的盲分离,分离结果令人满意.  相似文献   

3.
Looking at the speaker's face can be useful to better hear a speech signal in noisy environment and extract it from competing sources before identification. This suggests that the visual signals of speech (movements of visible articulators) could be used in speech enhancement or extraction systems. In this paper, we present a novel algorithm plugging audiovisual coherence of speech signals, estimated by statistical tools, on audio blind source separation (BSS) techniques. This algorithm is applied to the difficult and realistic case of convolutive mixtures. The algorithm mainly works in the frequency (transform) domain, where the convolutive mixture becomes an additive mixture for each frequency channel. Frequency by frequency separation is made by an audio BSS algorithm. The audio and visual informations are modeled by a newly proposed statistical model. This model is then used to solve the standard source permutation and scale factor ambiguities encountered for each frequency after the audio blind separation stage. The proposed method is shown to be efficient in the case of 2 times 2 convolutive mixtures and offers promising perspectives for extracting a particular speech source of interest from complex mixtures  相似文献   

4.
5.
研究了一种新的线性卷积混合信号的盲分离算法。该算法通过计算预白化观测数据的零时延和多时延自相关协方差矩阵,获得了多时延处理的二阶解相关统计信息。利用得到的二阶统计信息构建了两个对称正定矩阵,通过使用Cholesky分解和奇异值分解等一系列变换,得出了惟一存在的矩阵。理论分析表明,该矩阵可以使两个正定矩阵同时精确对角化。计算机仿真表明,该算法与已有算法相比,运算时间短,盲分离性能更优。  相似文献   

6.
In this paper, a novel solution is developed to solve blind source separation of postnonlinear convolutive mixtures. The proposed model extends the conventional linear instantaneous mixture model to include both convolutive mixing and postnonlinear distortion. The maximum-likelihood (ML) approach solution based on the expectation-maximization (EM) algorithm is developed to estimate the source signals and the parameters in the proposed nonlinear model. In the proposed solution, the sufficient statistics associated with the source signals are estimated in the E-step, while the model parameters are optimized through these statistics in the M-step. However, the complication resulted from the postnonlinear function associated with the mixture renders these statistics difficult to be formulated in a closed form and hence causes intractability in the parameter optimization. A computationally efficient algorithm is proposed which uses the extended Kalman smoother (EKS) to facilitate the E-step tractable and a set of self-updated polynomials is used as the nonlinearity estimator to facilitate closed form estimations of the parameters in the M-step. The theoretical foundation of the proposed solution has been rigorously developed and discussed in details. Both simulations and recorded speech signals have been carried out to verify the success and efficacy of the proposed algorithm. Remarkable improvement has been obtained when compared with the existing algorithms.  相似文献   

7.
解元  邹涛  孙为军  谢胜利 《自动化学报》2023,49(5):1062-1072
卷积混叠环境下的盲源分离(Blind source separation, BSS)是一个极具挑战性和实际意义的问题. 本文在独立分量分析框架下, 建立非负矩阵分解(Nonnegative matrix factorization, NMF)模型, 设计新的优化目标函数, 通过严格的数学理论推导, 得到新的模型参数更新规则; 并对解混叠矩阵进行标准化处理, 避免幅度歧义性问题; 在源信号的重构阶段, 通过实时更新非负矩阵分解模型参数, 避免源信号的排序歧义性问题. 实验结果验证了所提算法在分离中英文语音混叠信号、音乐混叠信号时的有效性和优越性.  相似文献   

8.
基于FastICA算法的盲源分离   总被引:2,自引:0,他引:2  
近年来,ICA(Independent Component Analysis,独立成分分析)已成为处理BSS(Blind Source Separation,盲源分离)问题的主要手段,同时也受到人们越来越多的关注,为此讨论ICA的原理及其优越性.首先介绍ICA,然后引入FastICA算法的推导过程,最后通过MATLAB仿真将其与梯度算法、PCA(Principal Component Analysis,主成分分析)算法所得的仿真结果进行对比分析.通过算法验证,经FastICA处理得到的分离信号与源信号相关系数的绝对值不小于0.99,与其他两种算法比较可以明显地得到FastICA是一种更为有效的盲源分离方法.  相似文献   

9.
In this paper, we consider the problem of separation of unknown number of sources from their underdetermined convolutive mixtures via time-frequency (TF) masking. We propose two algorithms, one for the estimation of the masks which are to be applied to the mixture in the TF domain for the separation of signals in the frequency domain, and the other for solving the permutation problem. The algorithm for mask estimation is based on the concept of angles in complex vector space. Unlike the previously reported methods, the algorithm does not require any estimation of the mixing matrix or the source positions for mask estimation. The algorithm clusters the mixture samples in the TF domain based on the Hermitian angle between the sample vector and a reference vector using the well known k -means or fuzzy c -means clustering algorithms. The membership functions so obtained from the clustering algorithms are directly used as the masks. The algorithm for solving the permutation problem clusters the estimated masks by using k-means clustering of small groups of nearby masks with overlap. The effectiveness of the algorithm in separating the sources, including collinear sources, from their underdetermined convolutive mixtures obtained in a real room environment, is demonstrated.  相似文献   

10.
We consider inference in a general data-driven object-based model of multichannel audio data, assumed generated as a possibly underdetermined convolutive mixture of source signals. We work in the short-time Fourier transform (STFT) domain, where convolution is routinely approximated as linear instantaneous mixing in each frequency band. Each source STFT is given a model inspired from nonnegative matrix factorization (NMF) with the Itakura–Saito divergence, which underlies a statistical model of superimposed Gaussian components. We address estimation of the mixing and source parameters using two methods. The first one consists of maximizing the exact joint likelihood of the multichannel data using an expectation-maximization (EM) algorithm. The second method consists of maximizing the sum of individual likelihoods of all channels using a multiplicative update algorithm inspired from NMF methodology. Our decomposition algorithms are applied to stereo audio source separation in various settings, covering blind and supervised separation, music and speech sources, synthetic instantaneous and convolutive mixtures, as well as professionally produced music recordings. Our EM method produces competitive results with respect to state-of-the-art as illustrated on two tasks from the international Signal Separation Evaluation Campaign (SiSEC 2008).   相似文献   

11.
Identification and verification of a video clip via its fingerprint find applications in video browsing, database search and security. For this purpose, the video sequence must be collapsed into a short fingerprint using a robust hash function based on signal processing operations. We propose two robust hash algorithms for video based both on the discrete cosine transform (DCT), one on the classical basis set and the other on a novel randomized basis set (RBT). The robustness and randomness properties of the proposed hash functions are investigated in detail. It is found that these hash functions are resistant to signal processing and transmission impairments, and therefore can be instrumental in building database search, broadcast monitoring and watermarking applications for video. The DCT hash is more robust, but lacks security aspect, as it is easy to find different video clips with the same hash value. The RBT based hash, being secret key based, does not allow this and is more secure at the cost of a slight loss in the receiver operating curves  相似文献   

12.
基于稳健联合分块对角化的卷积盲分离   总被引:1,自引:0,他引:1  
汤辉  王殊 《自动化学报》2013,39(9):1502-1510
针对卷积盲分离问题,提出一种新的矩阵联合分块对角化(Joint block diagonalization, JBD)算法. 现有的迭代非正交联合分块对角化算法都存在不收敛的情况,本文利用分离矩阵的特殊结构确保其可逆性,使得算法的迭代过程稳定. 在已知矩阵分块结构的条件下,首先,将卷积盲分离模型写成瞬时形式,并说明其满足联合分块对角化结构; 然后,提出联合分块对角化的代价函数,依据代价函数的最小化等价于矩阵中每个分块的范数最小化, 将整个分离矩阵的迭代更新转化成每个分块的迭代更新;最后,利用最小化条件得到迭代算法. 实数和复数两种情况下的算法都进行了推导.基本实验验证了新算法在不同条件下的性能; 仿真实验中对在时域和频域都重叠的信号的卷积混合进行盲分离,实验结果验证了新算法具有更好的分离性能和更稳定的分离能力.  相似文献   

13.
We present here a theory of motion from a topological point of view, in a symbolic perspective. Taking space–time histories of objects as primitive entities, we introduce temporal and topological relations on the thus defined space–time to characterize classes of spatial changes. The theory thus accounts for qualitative spatial information, dealing with underspecified, symbolic information when accurate data are not available or unnecessary. We show that these structures give a basis for commonsense spatio–temporal reasoning by presenting a number of significant deductions in the theory. This can serve as a formal basis for languages describing motion events in a qualitative way.  相似文献   

14.
文威  张杭 《系统仿真技术》2011,7(4):318-323
频域方法可以有效地解决卷积混合盲源分离问题.针对频域方法中存在排序模糊,基于分离信号相邻频点功率谱密度的相关性较高的原理,提出1种改进的排序模糊消除算法.相比于原算法,扩展了参考频点的取值范围,同时还采用了1种置信度量方法,能够获得更准确的排序估计.仿真实验表明所提算法有效地消除了排序模糊,并且能够纠正某一频点排序的突...  相似文献   

15.
Various techniques have previously been proposed for the separation of convolutive mixtures. These techniques can be classified as stochastic, adaptive, and deterministic. Stochastic methods are computationally expensive since they require an iterative process for the calculation of the demixing filters based on a separation criterion that usually assumes that the source signals are statistically independent. Adaptive methods, such as the adaptive beamformers, also exploit signal properties in order to optimize a multichannel filter structure. However, these algorithms need initialization and time to converge. Deterministic methods, on the other hand, provide a closed-form solution based on the deterministic aspects of the problem, such as the channel characteristics and the source directions. This paper presents a technique that exploits the intensity vector statistics to achieve a nearly closed-form solution for the separation of the convolutive mixtures as recorded with a coincident microphone array. No assumptions are made on the signals, but it is assumed that the source directions are known a priori. Directivity functions based on von Mises functions are designed for beamforming depending on the circular statistics of the calculated intensity vectors. Numerical evaluation results were presented for various speech and instrument sounds and source positions in two reverberant rooms.  相似文献   

16.
欠定条件下的盲分离算法   总被引:8,自引:0,他引:8  
盲信号分离中当源信号个数大于观测信号个数,且源信号不是足够稀疏时,如果利用聚类算法进行分离,分离效果将会变差。为此提出一种在此欠定条件下新的盲信号分离算法。利用源信号的“稀疏性”估计混合矩阵,然后简化混合矩阵构造新的混合模型。由于源信号间具有的独立性,使得可以在新的混合模型中从观察信号的自相关函数中估计出源信号的频谱,从而达到分离出源信号的目的,且分离效果优于聚类算法。最后给出仿真试验实例,试验结果验证了算法的有效性。  相似文献   

17.
18.
Convolutive blind source separation (CBSS) that exploits the sparsity of source signals in the frequency domain is addressed in this paper. We assume the sources follow complex Laplacian-like distribution for complex random variable, in which the real part and imaginary part of complex-valued source signals are not necessarily independent. Based on the maximum a posteriori (MAP) criterion, we propose a novel natural gradient method for complex sparse representation. Moreover, a new CBSS method is further developed based on complex sparse representation. The developed CBSS algorithm works in the frequency domain. Here, we assume that the source signals are sufficiently sparse in the frequency domain. If the sources are sufficiently sparse in the frequency domain and the filter length of mixing channels is relatively small and can be estimated, we can even achieve underdetermined CBSS. We illustrate the validity and performance of the proposed learning algorithm by several simulation examples.  相似文献   

19.
基于修正离散傅里叶变换的频域卷积混合盲分离   总被引:1,自引:0,他引:1  
针对频域卷积混合盲分离,依据所导出的卷积混合信号每帧的频域表示模型,提出了一种最小均方误差意义下的最优变换--修正离散傅里叶变换,用于代替频域卷积混合盲分离中常用的离散傅里叶变换.在每个频率片上,卷积混合信号的修正离散傅里叶变换系数在最小均方误差意义下最接近于源信号频谱的瞬时混合.相对于离散傅里叶变换系数,现有瞬时混合盲分离算法能从修正离散傅里叶变抉系数中更精确地估计各频率片上分离矩阵,从而提高现有频域卷积混合盲分离算法的分离性能.仿真结果证明了修正离散傅里叶变换对现有频域卷积混合盲分离算法的有效性.  相似文献   

20.
解卷积混合语音频域盲分离的次序问题新方法   总被引:1,自引:0,他引:1  
多通道语音信号的混合往往是卷积混合,瞬时盲分离方法不能获得好的分离效果,而频域方法由于频率次序的问题使性能下降.本文采用时频掩模的方法得到各频点上具有确定次序的、但带有失真的分离信号,将其作为参考,与频域上解得的次序不定信号进行相关,从而获得精确的语音分离信号.实验表明:本文提出的方法能有效地解决频域盲分离的次序不确定性问题,得到精度更高的分离卷积混舍的语音信号.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号