Data-Model Relationship in Text-Independent Speaker Recognition

Mason, John S. D.; Evans, Nicholas W. D.; Stapert, Robert; Auckenthaler, Roland

doi:10.1155/ASP.2005.471

Research Article
Open access
Published: 30 March 2005

Data-Model Relationship in Text-Independent Speaker Recognition

John S. D. Mason¹,
Nicholas W. D. Evans¹,
Robert Stapert² &
…
Roland Auckenthaler¹

EURASIP Journal on Advances in Signal Processing volume 2005, Article number: 582548 (2005) Cite this article

839 Accesses
3 Citations
Metrics details

Abstract

Text-independent speaker recognition systems such as those based on Gaussian mixture models (GMMs) do not include time sequence information (TSI) within the model itself. The level of importance of TSI in speaker recognition is an interesting question and one addressed in this paper. Recent works has shown that the utilisation of higher-level information such as idiolect, pronunciation, and prosodics can be useful in reducing speaker recognition error rates. In accordance with these developments, the aim of this paper is to show that as more data becomes available, the basic GMM can be enhanced by utilising TSI, even in a text-independent mode. This paper presents experimental work incorporating TSI into the conventional GMM. The resulting system, known as the segmental mixture model (SMM), embeds dynamic time warping (DTW) into a GMM framework. Results are presented on the 2000-speaker SpeechDat Welsh database which show improved speaker recognition performance with the SMM.

Author information

Authors and Affiliations

School of Engineering, University of Wales Swansea, Swansea, SA2 8 PP, UK
John S. D. Mason, Nicholas W. D. Evans & Roland Auckenthaler
Aculab, Milton Keynes, MK1 1PT, UK
Robert Stapert

Authors

John S. D. Mason
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas W. D. Evans
View author publications
You can also search for this author in PubMed Google Scholar
Robert Stapert
View author publications
You can also search for this author in PubMed Google Scholar
Roland Auckenthaler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John S. D. Mason.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Mason, J.S.D., Evans, N.W.D., Stapert, R. et al. Data-Model Relationship in Text-Independent Speaker Recognition. EURASIP J. Adv. Signal Process. 2005, 582548 (2005). https://doi.org/10.1155/ASP.2005.471

Download citation

Received: 12 December 2002
Revised: 23 September 2004
Published: 30 March 2005
DOI: https://doi.org/10.1155/ASP.2005.471

Data-Model Relationship in Text-Independent Speaker Recognition

Abstract

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords and phrases