Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach

Bugatti, Alessandro; Flammini, Alessandra; Migliorati, Pierangelo

doi:10.1155/S1110865702000720

Research Article
Published: 30 April 2002

Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach

Alessandro Bugatti¹,
Alessandra Flammini¹ &
Pierangelo Migliorati¹

EURASIP Journal on Advances in Signal Processing volume 2002, Article number: 980905 (2002) Cite this article

4615 Accesses
20 Citations
Metrics details

Abstract

We focus the attention on the problem of audio classification in speech and music for multimedia applications. In particular, we present a comparison between two different techniques for speech/music discrimination. The first method is based on Zero crossing rate and Bayesian classification. It is very simple from a computational point of view, and gives good results in case of pure music or speech. The simulation results show that some performance degradation arises when the music segment contains also some speech superimposed on music, or strong rhythmic components. To overcome these problems, we propose a second method, that uses more features, and is based on neural networks (specifically a multi-layer Perceptron). In this case we obtain better performance, at the expense of a limited growth in the computational complexity. In practice, the proposed neural network is simple to be implemented if a suitable polynomial is used as the activation function, and a real-time implementation is possible even if low-cost embedded systems are used.

Author information

Authors and Affiliations

Department of Electronics for Automation, University of Brescia, Via Branze 38, Brescia, 25123, Italy
Alessandro Bugatti, Alessandra Flammini & Pierangelo Migliorati

Authors

Alessandro Bugatti
View author publications
You can also search for this author in PubMed Google Scholar
Alessandra Flammini
View author publications
You can also search for this author in PubMed Google Scholar
Pierangelo Migliorati
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alessandro Bugatti.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bugatti, A., Flammini, A. & Migliorati, P. Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach. EURASIP J. Adv. Signal Process. 2002, 980905 (2002). https://doi.org/10.1155/S1110865702000720

Download citation

Received: 27 July 2001
Revised: 08 January 2002
Published: 30 April 2002
DOI: https://doi.org/10.1155/S1110865702000720

Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach

Abstract

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords