P. Comon, Independent component analysis, a new concept?Signal Process.36(3), 287–314 (1994).
P. Smaragdis, Blind separation of convolved mixtures in the frequency domain. Neurocomputing. 22:, 21–34 (1998).
S. Kurita, H. Saruwatari, S. Kajita, K. Takeda, F. Itakura, in Proc. ICASSP. Evaluation of blind signal separation method using directivity pattern under reverberant conditions, vol. 5 (IEEE, 2000), pp. 3140–3143.
N. Murata, S. Ikeda, A. Ziehe, An approach to blind source separation based on temporal structure of speech signals. Neurocomputing. 41(1–4), 1–24 (2001).
H. Saruwatari, T. Kawamura, T. Nishikawa, A. Lee, K. Shikano, Blind source separation based on a fast-convergence algorithm combining ICA and beamforming. IEEE Trans. ASLP. 14(2), 666–678 (2006).
H. Sawada, R. Mukai, S. Araki, S. Makino, A robust and precise method for solving the permutation problem of frequency-domain blind source separation. IEEE Trans. SAP. 12(5), 530–538 (2004).
A. Hiroe, in Proc. ICA. Solution of permutation problem in frequency domain ICA using multivariate probability density functions (SpringerBerlin, Heidelberg, 2006), pp. 601–608.
T. Kim, T. Eltoft, T.-W. Lee, in Proc. ICA. Independent vector analysis: an extension of ICA to multivariate components (SpringerBerlin, Heidelberg, 2006), pp. 165–172.
T. Kim, H.T. Attias, S.-Y. Lee, T.-W. Lee, Blind source separation exploiting higher-order frequency dependencies. IEEE Trans. ASLP. 15(1), 70–79 (2007).
N. Ono, in Proc. WASPAA. Stable and fast update rules for independent vector analysis based on auxiliary function technique (IEEE, 2011), pp. 189–192.
D. Kitamura, N. Ono, H. Sawada, H. Kameoka, H. Saruwatari, Determined blind source separation unifying independent vector analysis and nonnegative matrix factorization. IEEE/ACM Trans. ASLP. 24(9), 1626–1641 (2016).
D. Kitamura, N. Ono, H. Sawada, H. Kameoka, H. Saruwatari, in Audio Source Separation, ed. by S. Makino. Determined blind source separation with independent low-rank matrix analysis (SpringerCham, 2018), pp. 125–155.
T. Tachikawa, K. Yatabe, Y. Oikawa, in Proc. IWAENC. Underdetermined source separation with simultaneous DOA estimation without initial value dependency (IEEE, 2018), pp. 161–165.
D.D. Lee, H.S. Seung, Learning the parts of objects by non-negative matrix factorization. Nature. 401(6755), 788–791 (1999).
D.D. Lee, H.S. Seung, in Proc. NIPS. Algorithms for non-negative matrix factorization, (2000), pp. 556–562.
C. Févotte, N. Bertin, J.-L. Durrieu, Nonnegative matrix factorization with the Itakura-Saito divergence. With application to music analysis. Neural Comput.21(3), 793–830 (2009).
Y. Mitsui, D. Kitamura, S. Takamichi, N. Ono, H. Saruwatari, in Proc. ICASSP. Blind source separation based on independent low-rank matrix analysis with sparse regularization for time-series activity (IEEE, 2017), pp. 21–25.
H. Kagami, H. Kameoka, M. Yukawa, in Proc. ICASSP. Joint separation and dereverberation of reverberant mixtures with determined multichannel non-negative matrix factorization (IEEE, 2018), pp. 31–35.
R. Ikeshita, Y. Kawaguchi, in Proc. ICASSP. Independent low-rank matrix analysis based on multivariate complex exponential power distribution (IEEE, 2018), pp. 741–745.
D. Kitamura, S. Mogami, Y. Mitsui, N. Takamune, H. Saruwatari, N. Ono, Y. Takahashi, K. Kondo, Generalized independent low-rank matrix analysis using heavy-tailed distributions for blind source separation. EURASIP J. Adv. Signal Process.2018:, 28 (2018).
K. Yoshii, K. Kitamura, Y. Bando, E. Nakamura, T. Kawahara, in EUSIPCO. Independent low-rank tensor analysis for audio source separation (IEEE, 2018), pp. 1657–1661.
R. Ikeshita, in EUSIPCO. Independent positive semidefinite tensor analysis in blind source separation (IEEE, 2018), pp. 1652–1656.
R. Ikeshita, N. Ito, T. Nakatani, H. Sawada, in WASPAA. Independent low-rank matrix analysis with decorrelation learning (IEEE, 2019), pp. 288–292.
N. Makishima, S. Mogami, N. Takamune, D. Kitamura, H. Sumino, S. Takamichi, H. Saruwatari, N. Ono, Independent deeply learned matrix analysis for determined audio source separation. IEEE/ACM Trans. ASLP. 27(10), 1601–1615 (2019).
K. Sekiguchi, Y. Bando, A.A. Nugraha, K. Yoshii, T. Kawahara, Semi-supervised multichannel speech enhancement with a deep speech prior. IEEE/ACM Trans. ASLP. 27(12), 2197–2212 (2019).
S. Mogami, N. Takamune, D. Kitamura, H. Saruwatari, Y. Takahashi, K. Kondo, N. Ono, Independent low-rank matrix analysis based on time-variant sub-Gaussian source model for determined blind source separation. IEEE/ACM Trans. ASLP. 28:, 503–518 (2019).
Y. Takahashi, D. Kitahara, K. Matsuura, A. Hirabayashi, in Proc. ICASSP. Determined source separation using the sparsity of impulse responses (IEEE, 2020), pp. 686–690.
M. Togami, in Proc. ICASSP. Multi-channel speech source separation and dereverberation with sequential integration of determined and underdetermined models (IEEE, 2020), pp. 231–235.
S. Kanoga, T. Hoshino, H. Asoh, Independent low-rank matrix analysis-based automatic artifact reduction technique applied to three BCI paradigms. Front. Hum. Neurosci.14:, 17 (2020).
D. Kitamura, N. Ono, H. Saruwatari, in Proc. EUSIPCO. Experimental analysis of optimal window length for independent low-rank matrix analysis, (2017), pp. 1210–1214.
Y. Liang, S.M. Naqvi, J. Chambers, Overcoming block permutation problem in frequency domain blind source separation when using AuxIVA algorithm. Electron. Lett.48(8), 460–462 (2012).
K. Yatabe, Consistent ICA: determined BSS meets spectrogram consistency. IEEE Signal Process. Lett.27:, 870–874 (2020).
T. Gerkmann, M. Krawczyk-Becker, J. Le Roux, Phase processing for single-channel speech enhancement: history and recent advances. IEEE Signal Process. Mag.32(2), 55–66 (2015).
P. Mowlaee, R. Saeidi, Y. Stylianou, Advances in phase-aware signal processing in speech communication. Speech Commun.81:, 1–29 (2016).
P. Mowlaee, J. Kulmer, J. Stahl, F. Mayer, Single channel phase-aware signal processing in speech communication: theory and practice (Wiley, 2016).
K. Yatabe, Y. Oikawa, in Proc. ICASSP. Phase corrected total variation for audio signals (IEEE, 2018), pp. 656–660.
K. Yatabe, Y. Masuyama, Y. Oikawa, in Proc. IWAENC. Rectified linear unit can assist Griffin–Lim phase recovery (IEEE, 2018), pp. 555–559.
Y. Masuyama, K. Yatabe, Y. Oikawa, in Proc. IWAENC. Model-based phase recovery of spectrograms via optimization on Riemannian manifolds (IEEE, 2018), pp. 126–130.
Y. Masuyama, K. Yatabe, Y. Oikawa, Griffin–Lim like phase recovery via alternating direction method of multipliers. IEEE Signal Process. Lett.26(1), 184–188 (2019).
Y. Masuyama, K. Yatabe, Y. Koizumi, Y. Oikawa, N. Harada, in Proc. ICASSP. Deep Griffin–Lim iteration (IEEE, 2019), pp. 61–65.
Y. Masuyama, K. Yatabe, Y. Oikawa, in Proc. ICASSP. Phase-aware harmonic/percussive source separation via convex optimization (IEEE, 2019), pp. 985–989.
Y. Masuyama, K. Yatabe, Y. Oikawa, in Proc. ICASSP. Low-rankness of complex-valued spectrogram and its application to phase-aware audio processing (IEEE, 2019), pp. 855–859.
Y. Masuyama, K. Yatabe, Y. Koizumi, Y. Oikawa, N. Harada, in Proc. ICASSP. Phase reconstruction based on recurrent phase unwrapping with deep neural networks (IEEE, 2020), pp. 826–830.
J.L. Roux, H. Kameoka, N. Ono, S. Sagayama, in Proc. DAFx. Fast signal reconstruction from magnitude STFT spectrogram based on spectrogram consistency, (2010).
J. Le Roux, E. Vincent, Consistent Wiener filtering for audio source separation. IEEE Signal Process. Lett.20(3), 217–220 (2013).
N. Perraudin, P. Balazs, P.L. Søndergaard, in Proc. WASPAA. A fast Griffin–Lim algorithm (IEEE, 2013), pp. 1–4.
K. Yatabe, Y. Masuyama, T. Kusano, Y. Oikawa, Representation of complex spectrogram via phase conversion. Acoust. Sci. Tech.40(3), 170–177 (2019).
M. Kowalski, E. Vincent, R. Gribonval, Beyond the narrowband approximation: wideband convex methods for under-determined reverberant audio source separation. IEEE Trans. ASLP. 18(7), 1818–1829 (2010).
K. Matsuoka, S. Nakashima, in Proc. ICA. Minimal distortion principle for blind source separation, (2001), pp. 722–727.
K. Yatabe, D. Kitamura, in Proc. ICASSP. Determined blind source separation via proximal splitting algorithm (IEEE, 2018), pp. 776–780.
K. Yatabe, D. Kitamura, in Proc. ICASSP. Time-frequency-masking-based determined BSS with application to sparse IVA (IEEE, 2019), pp. 715–719.
K. Yatabe, D. Kitamura, Determined BSS based on time-frequency masking and its application to harmonic vector analysis. arXiv:2004.14091 (2020).
M. Brandstein, D. Ward, Microphone arrays: signal processing techniques and applications (Springer Science & Business Media, 2013).
S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa, H. Saruwatari, Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures. EURASIP J. Adv. Signal Process.2003(11), 1157–1166 (2003).
D. Griffin, J. Lim, Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust. Speech Signal Process.32(2), 236–243 (1984).
D. Gunawan, D. Sen, Iterative phase estimation for the synthesis of separated sources from single-channel mixtures. IEEE Signal Process. Lett.17(5), 421–424 (2010).
N. Sturmel, L. Daudet, L. Girin, in Proc. DAFx. Phase-based informed source separation of music, (2012).
M. Watanabe, P. Mowlaee, in Proc. INTERSPEECH. Iterative sinusoidal-based partial phase reconstruction in single-channel source separation, (2013).
F. Mayer, D. Williamson, P. Mowlaee, D.L. Wang, Impact of phase estimation on single-channel speech separation based on time-frequency masking. J. Acoust. Soc. Am.141:, 4668–4679 (2017).
S. Araki, F. Nesta, E. Vincent, Z. Koldovsky, G. Nolte, A. Ziehe, A. Benichoux, in Proc. LVA/ICA. The 2011 signal separation evaluation campaign (SiSEC2011): -Audio source separation, (2012), pp. 414–422.
S. Nakamura, K. Hiyane, F. Asano, T. Nishiura, T. Yamada, in Proc. LREC. Acoustical sound database in real environments for sound scene understanding and hands-free speech recognition, (2000), pp. 965–968.
E. Vincent, R. Gribonval, C. Févotte, Performance measurement in blind audio source separation. IEEE Trans. ASLP. 14(4), 1462–1469 (2006).
W.H. Press, S.A. Teukolsky, W.T. Vetterling, B.P. Flannery, Numerical Recipes in C: The Art of Scientific Computing (Cambridge University Press, New York, 1992).
I. Andrianakis, P. White, Speech spectral amplitude estimators using optimally shaped gamma and chi priors. Speech Comm.51(1), 1–14 (2009).
P. Mowlaee, J. Stahl, Single-channel speech enhancement with correlated spectral components: limits-potential. Speech Comm.121:, 58–69 (2020).