| ||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2005 The Institute of Electronics, Information and Communication Engineers
Special Section on Multi-channel Acoustic Signal Processing -- Papers |
Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain
1 The authors are with NTT Communication Science Laboratories, NTT Corporation, Kyoto-fu, 619-0237 Japan. E-mail: maki{at}cslab.kecl.ntt.co.jp
This paper overviews a total solution for frequency-domain blind source separation (BSS) of convolutive mixtures of audio signals, especially speech. Frequency-domain BSS performs independent component analysis (ICA) in each frequency bin, and this is more efficient than time-domain BSS. We describe a sophisticated total solution for frequency-domain BSS, including permutation, scaling, circularity, and complex activation function solutions. Experimental results of 2 x 2, 3 x 3, 4 x 4, 6 x 8, and 2 x 2 (moving sources), (#sources x #microphones) in a room are promising.
Key Words: blind source separation, convolutive mixtures, independent component analysis, frequency-domain BSS, microphone array, adaptive beamformer
Manuscript received February 16, 2005. Final manuscript received March 9, 2005.