Abstract
In this paper, we propose a noble algorithm for mixing matrix estimation in Underdetermined blind source separation. A concept of confidence measure and that of being able to be single source points at all the points in time frequency plane are introduced in the proposed algorithm. At first, we can detect the single source points from real parts and imaginary parts of time frequency coefficients of mixture signals and calculate principal vectors and its confidence measures through principal component analysis at the single source points using this algorithm. Finally, mixing matrix is obtained by clustering principal vectors according to its confidence measure. Experimental results show that the proposed algorithm is very suitable for actual situation of UBSS.







Similar content being viewed by others
References
A. Aïssa-El-Bey, N. Linh-Trung, K. Abed-Meraim, A. Belouchrani, Y. Grenier, Underdetermined blind separation of non-disjoint sources in the time-frequency domain. IEEE Trans. Signal Process. 55(3), 897–907 (2007)
S. Arberet, R. Gribonval, F. Bimbot, A robust method to count and locate audio sources in a stereophonic linear instantaneous mixture, in ICA, pp. 536-543 (2006)
S. Arberet, R. Gribonval, F. Bimbot, A robust method to count and locate audio sources in a stereophonic linear anechoic mixture. ICASSP 2007, 745–748 (2007)
S. Arberet, R. Gribonval, F. Bimbot, A robust method to count and locate audio sources in a multichannel underdetermined mixture. IEEE Trans. Signal Process. 58(1), 121–133 (2010)
P. Bofill, M. Zibulevsky, Underdetermined blind source separation using sparse representations. Signal Process. 81, 2353–2362 (2001)
M. Davies, N. Mitianoudis, Simple mixture model for sparse overcomplete ICA. IEEE Proc. Vis. Image Signal Process. 151(1), 35–43 (2004)
Y. Deville, Sparse component analysis: a general framework for linear and nonlinear blind source separation and mixture identification, in Blind Source Sep. ed. by G.R. Naik, W. Wang (Springer, Berlin, Heidelberg, 2014), pp. 151–196
Q. Guo, G. Ruan, Y. Liao, A time-frequency domain underdetermined blind source separation algorithm for MIMO radar signals. Symmetry 9, 104 (2017)
W.K. Härdle, L. Simar (eds.) Applied Multivariate Statistical Analysis. Springer, Berlin, Heidelberg (2007)
X. He, F. He, W. Cai, Underdetermined BSS based on K-means and AP clustering. Circuits Syst. Signal Process. 35, 2881–2913 (2016)
Y. Li, W. Nie, F. Ye, Y. Lin, A mixing matrix estimation algorithm for underdetermined blind source separation. Circuits Syst. Signal Process. 35, 3367–3379 (2016)
W. Li, H. Yang, A non-linear blind source separation method based on perceptron structure and conjugate gradient algorithm. Circuits Syst. Signal Process. 33, 3573–3590 (2014)
C. Liu, Y. Li, W. Nie, A new underdetermined blind source separation algorithm under the anechoic mixing model. In: Proceedings of ICSP, pp. 1799-1803 (2016)
J. Lu, W. Cheng, Y. Zi, A novel underdetermined blind source separation method and its application to source contribution quantitative estimation. Sensors 19, 1413 (2019)
T. Peng, Y. Chen, Z. Liu, A time-frequency domain blind source separation method for underdetermined instantaneous mixtures. Circuits Syst. Signal Process. 34, 3883–3895 (2015)
R. Qi, Y. Zhang, H. Li, Overcomplete blind source separation based on generalized Gaussian function and SL0 norm. Circuits Syst. Signal Process. 34, 2255–2270 (2015)
V.G. Reju, S.N. Koh, I.Y. Soon, An algorithm for mixing matrix estimation in instantaneous blind source separation. Signal Process. 89, 1762–1773 (2009)
J.J. Thiagarajan, K.N. Ramamurthy, A. Spanias, Mixing matrix estimation using discriminative clustering for blind source. Digit. Signal Process. 23, 9–18 (2013)
E. Vincent, S. Araki, P. Bofill, Signal separation evaluation campaign. in (SiSEC 2008)/Underdetermined speech and music mixtures task results (2008), http://www.irisa.fr/metiss/SiSEC08/SiSEC_underdetermined/dev2_eval.html
S. Xie, L. Yang, J.M. Yang, G. Zhou, Y. Xiang, Time-frequency approach to underdetermined blind source separation. IEEE Trans. Neural Netw. Learn. Syst. 23(2), 306–316 (2012)
Q. Yi, Blind source separation by weighted K-means clustering. J. Syst. Eng. Electron. 19(5), 882–887 (2008)
L. Zhen, D. Peng, Z. Yi, Y. Xiang, P. Chen, Underdetermined blind source separation using sparse coding. IEEE Trans. Neural Netw. Learn. Syst. 28, 3102–3108 (2017)
Acknowledgements
The authors thank the researchers, Simon Arberet, Rémi Gribonval, Frédéric Bimbot, V. G. Reju, Soo Ngee Koh, Ing Yann Soon and their papers. The authors thank also the editor in chief and the anonymous referees for their very strict comments and valuable suggestions to improve the quality of this paper.
Author information
Authors and Affiliations
Contributions
JGR, SHJ contributed to conceptualization; JGR helped in methodology; JGR, KSK contributed to formal analysis and investigation; SHJ contributed to resources; JGR contributed to writing; JGR, WCK contributed to editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Consent to participate
All authors read and approved the final manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: Statistical Analysis
Appendix: Statistical Analysis
From assumption that the mixture has some SSPs in TF plane and at every SSP, in neighborhood of this point only one signal is dominant, all columns of matrix \(\mathbf{X} _{\varOmega _{t,f}}\) must be same column of mixing matrix \(\mathbf{A} \). However, the absolute direction of matrix \(\mathbf{X} _{\varOmega _{t,f}}\) are different. Hence, at SSP (t, f), we can choose the principal vector \(\hat{\theta }(t,f)\) with a unit norm as the vector that is collinear with the column of mixing matrix \(\mathbf{A} \). Now we model the STFT coefficients of the dominant source in the TF region \(\varOmega _{t,f}\) with a centered normal distribution of large variance \(\sigma ^2_1\), and the rest of the sources as centered normal distribution with variances \(\sigma ^2_2\ge \cdots \ge \sigma ^2_M\), respectively. Suppose that noise \(\mathbf{n} \) has centered normal distribution with variance \(\sigma ^2_{noise}\), i.e., \(\mathbf{n} \sim N(0,\sigma ^2_{noise}{} \mathbf{I} _M)\). For point \((t',f') \in \varOmega _{t,f}\), we have the following expression:
where \(s_i(t',f') \sim N(0,\sigma ^2_i)\) and \(\mathbf{a} _i\) is column of mixing matrix \(\mathbf{A} \) corresponding to i-th strong source of mixture. We’d like to stress that \(\mathbf{a} _i\) may be not i-th column of \(\mathbf{A} \). Therefore, \(\mathbf{X} (t',f') \sim N(0,\varSigma ), ~\varSigma =\sum ^{M}_{i=1}\sigma ^2_i \mathbf{a} _i \mathbf{a} _i^T+\sigma ^2_{noise}{} \mathbf{I} _M\). Let the eigenvalues of the covariance matrix \(\varSigma \) be \(\lambda _1\ge \cdots \ge \lambda _M\) and a unit eigenvectors corresponding with \(\lambda _i\) be \(\mathbf{u} _i\). If \(\sigma ^2_2= \cdots \sigma ^2_M=\sigma ^2_{noise}=0\), then \(\mathbf{a} _1\) is collinear with \(\mathbf{u} _1\). At this point, we can regard the ratio \(\frac{\lambda _1}{\sum ^M_{i=1}\lambda _i}\) as a measure of how accurately the first principal component represents a space spanned by columns of the mixing matrix \(\mathbf{A} \). Since this is relatively measured, we can replace the ratio \(\frac{\lambda _1}{\sum ^M_{i=1}\lambda _i}\) with the ratio \(T=\frac{\lambda _1}{\sum ^M_{i=2}\lambda _i}\). On the other hand, according to [9, Theorem 5.7], the matrix \(\mathbf{X} _{\varOmega _{t,f}}{} \mathbf{X} _{\varOmega _{t,f}}^T\) has a Wishart distribution \(W_M(\varSigma ,card(\varOmega _{t,f})-1)\) and its expectation is \((card(\varOmega _{t,f})-1)\varSigma \), i.e.,
Hence, \(\hat{T}(t,f)\) in Eq. (7) is an estimator of the ratio \(\frac{\lambda _1}{\sum ^M_{i=2}\lambda _i}\). Then, from [9, Theorem 9.4], we can conclude that \(\sqrt{card(\varOmega _{t,f})-1}(\hat{\theta }(t,f)-\theta (t,f))\) has asymptotically normal distribution
and \(\sqrt{card(\varOmega _{t,f})-1}(\hat{\lambda }_1-\lambda _1)\) has asymptotically normal distribution
where \(\mathbf{V} _1=\lambda _1\sum ^M_{k=2}\lambda _k\mathbf{u} _k\mathbf{u} _k^T/(\lambda _k-\lambda _1)^2, ~\theta (t,f)=\mathbf{u} _1\) and \(\hat{\lambda }_1\) is an estimator of \(\lambda _1\). Now taking \(f(\lambda )=\lambda _1/\sum ^M_{i=2}\lambda _i=T\), according to [9, Theorem 4.11]
where \(\mathbf{D} =(\partial f/\partial t_i)|_{t=\lambda }\) is the M-dimensional vector. From this we have Eq. (8). In Eqs. (9) and (11), we have \(q_1\) and \(q_2\) as \(q_1=q_2=\sqrt{2}\times erfinv(0.98)=2.3263 \approx 2.33\), where erfinv is an inverse error function in MATLAB.
Rights and permissions
About this article
Cite this article
Ri, JG., Jang, SH., Kim, WC. et al. An Algorithm for Mixing Matrix Estimation in Underdetermined Blind Source Separation. Circuits Syst Signal Process 41, 5049–5064 (2022). https://doi.org/10.1007/s00034-022-02020-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00034-022-02020-8