Pitch Correlogram Clustering for Fast Speaker Identification

Jhanwar, Nitin; Raina, Ajay K.

doi:10.1155/S1110865704408026

Research Article
Published: 27 December 2004

Pitch Correlogram Clustering for Fast Speaker Identification

Nitin Jhanwar¹ &
Ajay K. Raina^1,2

EURASIP Journal on Advances in Signal Processing volume 2004, Article number: 372807 (2004) Cite this article

971 Accesses
8 Citations
Metrics details

Abstract

Gaussian mixture models (GMMs) are commonly used in text-independent speaker identification systems. However, for large speaker databases, their high computational run-time limits their use in online or real-time speaker identification situations. Two-stage identification systems, in which the database is partitioned into clusters based on some proximity criteria and only a single-cluster GMM is run in every test, have been suggested in literature to speed up the identification process. However, most clustering algorithms used have shown limited success, apparently because the clustering and GMM feature spaces used are derived from similar speech characteristics. This paper presents a new clustering approach based on the concept of a pitch correlogram that captures frame-to-frame pitch variations of a speaker rather than short-time spectral characteristics like cepstral coefficient, spectral slopes, and so forth. The effectiveness of this two-stage identification process is demonstrated on the IVIE corpus of 110 speakers. The overall system achieves a run-time advantage of 500% as well as a 10% reduction of error in overall speaker identification.

Author information

Authors and Affiliations

Research and Development Division, Danlaw Technologies India Limited, Hyderabad, 500 034, India
Nitin Jhanwar & Ajay K. Raina
Department of Electrical and Electronic Engineering, The University of Melbourne, Victoria, 3010, Australia
Ajay K. Raina

Authors

Nitin Jhanwar
View author publications
You can also search for this author in PubMed Google Scholar
Ajay K. Raina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nitin Jhanwar.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jhanwar, N., Raina, A.K. Pitch Correlogram Clustering for Fast Speaker Identification. EURASIP J. Adv. Signal Process. 2004, 372807 (2004). https://doi.org/10.1155/S1110865704408026

Download citation

Received: 24 June 2003
Revised: 25 June 2004
Published: 27 December 2004
DOI: https://doi.org/10.1155/S1110865704408026

Pitch Correlogram Clustering for Fast Speaker Identification

Abstract

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases