On the Use of Complementary Spectral Features for Speaker Recognition

Hosseinzadeh, Danoush; Krishnan, Sridhar

doi:10.1155/2008/258184

Research Article
Open access
Published: 24 October 2007

On the Use of Complementary Spectral Features for Speaker Recognition

Danoush Hosseinzadeh¹ &
Sridhar Krishnan¹

EURASIP Journal on Advances in Signal Processing volume 2008, Article number: 258184 (2007) Cite this article

1695 Accesses
23 Citations
Metrics details

Abstract

The most popular features for speaker recognition are Mel frequency cepstral coefficients (MFCCs) and linear prediction cepstral coefficients (LPCCs). These features are used extensively because they characterize the vocal tract configuration which is known to be highly speaker-dependent. In this work, several features are introduced that can characterize the vocal system in order to complement the traditional features and produce better speaker recognition models. The spectral centroid (SC), spectral bandwidth (SBW), spectral band energy (SBE), spectral crest factor (SCF), spectral flatness measure (SFM), Shannon entropy (SE), and Renyi entropy (RE) were utilized for this purpose. This work demonstrates that these features are robust in noisy conditions by simulating some common distortions that are found in the speakers' environment and a typical telephone channel. Babble noise, additive white Gaussian noise (AWGN), and a bandpass channel with 1 dB of ripple were used to simulate these noisy conditions. The results show significant improvements in classification performance for all noise conditions when these features were used to complement the MFCC and MFCC features. In particular, the SC and SCF improved performance in almost all noise conditions within the examined SNR range (10–40 dB). For example, in cases where there was only one source of distortion, classification improvements of up to 8% and 10% were achieved under babble noise and AWGN, respectively, using the SCF feature.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Ryerson University, 350 Victoria Street, Toronto, ON, M5B 2K3, Canada
Danoush Hosseinzadeh & Sridhar Krishnan

Authors

Danoush Hosseinzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Sridhar Krishnan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sridhar Krishnan.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Hosseinzadeh, D., Krishnan, S. On the Use of Complementary Spectral Features for Speaker Recognition. EURASIP J. Adv. Signal Process. 2008, 258184 (2007). https://doi.org/10.1155/2008/258184

Download citation

Received: 29 November 2006
Revised: 07 May 2007
Accepted: 29 September 2007
Published: 24 October 2007
DOI: https://doi.org/10.1155/2008/258184

On the Use of Complementary Spectral Features for Speaker Recognition

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords