Likelihood-Maximizing-Based Multiband Spectral Subtraction for Robust Speech Recognition

BabaAli, Bagher; Sameti, Hossein; Safayani, Mehran

doi:10.1155/2009/878105

Research Article
Open access
Published: 01 March 2009

Likelihood-Maximizing-Based Multiband Spectral Subtraction for Robust Speech Recognition

Bagher BabaAli¹,
Hossein Sameti¹ &
Mehran Safayani¹

EURASIP Journal on Advances in Signal Processing volume 2009, Article number: 878105 (2009) Cite this article

1318 Accesses
7 Citations
Metrics details

Abstract

Automatic speech recognition performance degrades significantly when speech is affected by environmental noise. Nowadays, the major challenge is to achieve good robustness in adverse noisy conditions so that automatic speech recognizers can be used in real situations. Spectral subtraction (SS) is a well-known and effective approach; it was originally designed for improving the quality of speech signal judged by human listeners. SS techniques usually improve the quality and intelligibility of speech signal while speech recognition systems need compensation techniques to reduce mismatch between noisy speech features and clean trained acoustic model. Nevertheless, correlation can be expected between speech quality improvement and the increase in recognition accuracy. This paper proposes a novel approach for solving this problem by considering SS and the speech recognizer not as two independent entities cascaded together, but rather as two interconnected components of a single system, sharing the common goal of improved speech recognition accuracy. This will incorporate important information of the statistical models of the recognition engine as a feedback for tuning SS parameters. By using this architecture, we overcome the drawbacks of previously proposed methods and achieve better recognition accuracy. Experimental evaluations show that the proposed method can achieve significant improvement of recognition rates across a wide range of signal to noise ratios.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
Bagher BabaAli, Hossein Sameti & Mehran Safayani

Authors

Bagher BabaAli
View author publications
You can also search for this author in PubMed Google Scholar
Hossein Sameti
View author publications
You can also search for this author in PubMed Google Scholar
Mehran Safayani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bagher BabaAli.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

BabaAli, B., Sameti, H. & Safayani, M. Likelihood-Maximizing-Based Multiband Spectral Subtraction for Robust Speech Recognition. EURASIP J. Adv. Signal Process. 2009, 878105 (2009). https://doi.org/10.1155/2009/878105

Download citation

Received: 12 May 2008
Revised: 17 December 2008
Accepted: 19 January 2009
Published: 01 March 2009
DOI: https://doi.org/10.1155/2009/878105

Likelihood-Maximizing-Based Multiband Spectral Subtraction for Robust Speech Recognition

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords