Exploiting Temporal Feature Integration for Generalized Sound Recognition

Ntalampiras, Stavros; Potamitis, Ilyas; Fakotakis, Nikos

doi:10.1155/2009/807162

Research Article
Open access
Published: 28 December 2009

Exploiting Temporal Feature Integration for Generalized Sound Recognition

Stavros Ntalampiras¹,
Ilyas Potamitis² &
Nikos Fakotakis¹

EURASIP Journal on Advances in Signal Processing volume 2009, Article number: 807162 (2009) Cite this article

1833 Accesses
28 Citations
Metrics details

Abstract

This paper presents a methodology that incorporates temporal feature integration for automated generalized sound recognition. Such a system can be of great use to scene analysis and understanding based on the acoustic modality. The performance of three feature sets based on Mel filterbank, MPEG-7 audio protocol, and wavelet decomposition is assessed. Furthermore we explore the application of temporal integration using the following three different strategies: (a) short-term statistics, (b) spectral moments, and (c) autoregressive models. The experimental setup is thoroughly explained and based on the concurrent usage of professional sound effects collections. In this way we try to form a representative picture of the characteristics of ten sound classes. During the first phase of our implementation, the process of audio classification is achieved through statistical models (HMMs) while a fusion scheme that exploits the models constructed by various feature sets provided the highest average recognition rate. The proposed system not only uses diverse groups of sound parameters but also employs the advantages of temporal feature integration.

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Electrical and Computer Engineering Department, University of Patras, 26500, Rio-Patras, Greece
Stavros Ntalampiras & Nikos Fakotakis
Department of Music Technology and Acoustics, Technological Educational Institute of Crete, Daskalaki-Perivolia, Crete, 74100, Greece
Ilyas Potamitis

Authors

Stavros Ntalampiras
View author publications
You can also search for this author in PubMed Google Scholar
Ilyas Potamitis
View author publications
You can also search for this author in PubMed Google Scholar
Nikos Fakotakis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stavros Ntalampiras.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ntalampiras, S., Potamitis, I. & Fakotakis, N. Exploiting Temporal Feature Integration for Generalized Sound Recognition. EURASIP J. Adv. Signal Process. 2009, 807162 (2009). https://doi.org/10.1155/2009/807162

Download citation

Received: 13 July 2009
Revised: 25 September 2009
Accepted: 18 November 2009
Published: 28 December 2009
DOI: https://doi.org/10.1155/2009/807162

Exploiting Temporal Feature Integration for Generalized Sound Recognition

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords