Open Access

Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions

  • Hamid Reza Abutalebi (EURASIP Member)1, 2Email author and
  • Hossein Momenzadeh1
EURASIP Journal on Advances in Signal Processing20112011:621390

https://doi.org/10.1155/2011/621390

Received: 30 April 2010

Accepted: 14 January 2011

Published: 24 January 2011

Abstract

TDOA- (time difference of arrival-) based algorithms are common methods for speech source localization. The generalized cross correlation (GCC) method is the most important approach for estimating TDOA between microphone pairs. The performance of this method significantly degrades in the presence of noise and reverberation. This paper addresses the problem of 3D localization in joint noisy and reverberant conditions and a single-speaker scenario. We first propose a modification to make the GCC-PHAse transform (GCC-PHAT) method robust against environment noise. Then, we use an iterative technique that employs location estimation to improve TDOAs accuracy. Extensive experiments on both simulated and real (practical) data (in a single-source scenario) show the capability of the proposed methods to significantly improve TDOA accuracy and, consequently, source location estimates.

Publisher note

To access the full article, please see PDF.

Authors’ Affiliations

(1)
Speech Processing Research Lab (SPRL), Electrical and Computer Engineering Department, Yazd University
(2)
Idiap Research Institute

Copyright

© H. R. Abutalebi and H. Momenzadeh. 2011

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.