Open Access

Wavelets in Recognition of Bird Sounds

EURASIP Journal on Advances in Signal Processing20062007:051806

https://doi.org/10.1155/2007/51806

Received: 9 September 2005

Accepted: 22 June 2006

Published: 11 October 2006

Abstract

This paper presents a novel method to recognize inharmonic and transient bird sounds efficiently. The recognition algorithm consists of feature extraction using wavelet decomposition and recognition using either supervised or unsupervised classifier. The proposed method was tested on sounds of eight bird species of which five species have inharmonic sounds and three reference species have harmonic sounds. Inharmonic sounds are not well matched to the conventional spectral analysis methods, because the spectral domain does not include any visible trajectories that computer can track and identify. Thus, the wavelet analysis was selected due to its ability to preserve both frequency and temporal information, and its ability to analyze signals which contain discontinuities and sharp spikes. The shift invariant feature vectors calculated from the wavelet coefficients were used as inputs of two neural networks: the unsupervised self-organizing map (SOM) and the supervised multilayer perceptron (MLP). The results were encouraging: the SOM network recognized 78% and the MLP network 96% of the test sounds correctly.

[12345678910111213141516171819202122232425262728293031323334353637]

Authors’ Affiliations

(1)
Department of Information Technology, Tampere University of Technology

References

  1. Catchpole CK, Slater PJB: Bird Song: Biological Themes and Variations. Cambridge University Press, Cambridge, UK; 1995.Google Scholar
  2. Kroodsma DE: The Singing Life of Birds: The Art and Science of Listening Birdsong. Houghton Miflin, Boston, Mass, USA; 2005.Google Scholar
  3. Greenewalt CH: Bird Song: Acoustics and Physiology. Smithsonian Institution Press, Washington, DC, USA; 1968.Google Scholar
  4. Zollinger SA, Riede T, Suthers RA: Production of nonlinear phenomena in the Northern Mockingbirds ( Minus polyglottos ). Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 283-284.Google Scholar
  5. Suthers RA, Beckers G, Zollinger SA, Vallet E, Kreuzer M: Mechanisms of vocal complexity in birds. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 237-238.Google Scholar
  6. Bradbury JW: Parrots and technology. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 29-30.Google Scholar
  7. Baker MC, Logue DM: Population differentiation in a complex bird sound: a comparison of three bioacoustical analysis procedures. Ethology 2003,109(3):223-242. 10.1046/j.1439-0310.2003.00866.xView ArticleGoogle Scholar
  8. Groth JG: Call matching and positive assortative mating in red crossbills. The Auk 1993,110(2):398-401.Google Scholar
  9. Robb MS: Introduction to vocalizations of crossbills in Northwestern Europe. Dutch Birding 2000,22(2):61-107.Google Scholar
  10. Deecke VB, Janik VM: Automated categorization of bioacoustic signals: avoiding perceptual pitfalls. Journal of the Acoustical Society of America 2006,119(1):645-653. 10.1121/1.2139067View ArticleGoogle Scholar
  11. Elowson AM, Hailman JP: Analysis of complex variation: dichotomous sorting of predator-elicited calls of the Florida scrub jay. Bioacoustics 1991,3(4):295-320.View ArticleGoogle Scholar
  12. Groth JG: Resolution of cryptic species in appalachian red crossbills. The Condor 1988,90(4):745-760. 10.2307/1368832View ArticleGoogle Scholar
  13. Lovell SF, Lein MR: Song variation in a population of Alder Flycatchers. Journal of Field Ornithology 2004,75(2):146-151.View ArticleGoogle Scholar
  14. Härmä A: Automatic identification of bird species based on sinusoidal modelling of syllables. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), April 2003, Hong Kong 5: 545-548.Google Scholar
  15. Härmä A, Somervuo P: Classification of the harmonic structure in bird vocalization. Proceedings IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), May 2004, Montreal, Quebec, Canada 5: 701-704.Google Scholar
  16. Mesgarani N, Shamma S: Bird call classification using multiresolution spectrotemporal auditory model. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 155-156.Google Scholar
  17. Tanttu JT, Turunen J, Selin A, Ojanen M: Automatic feature extraction and classification of crossbill ( Loxia spp. ) flight calls. Bioacoustics 2006,15(3):251-269.View ArticleGoogle Scholar
  18. Somervuo P, Härmä A: Bird song recognition based on syllable pair histograms. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), May 2004, Montreal, Quebec, Canada 5: 825-828.Google Scholar
  19. Fagerlund S, Härmä A: Parametrization of inharmonic bird sounds for automatic recognition. proceedings of the 13th European Signal Processing Conference (EUSIPCO '05), September 2005, Antalya, Turkey Proceedings on CD-ROMGoogle Scholar
  20. Rioul O, Vetterli M: Wavelets and signal processing. IEEE Signal Processing Magazine 1991,8(4):14-38. 10.1109/79.91217View ArticleGoogle Scholar
  21. Soman AK, Vaidyanathan PP: Paraunitary filter banks and wavelet packets. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '92), March 1992, San Francisco, Calif, USA 397-400.Google Scholar
  22. Pittner S, Kamarthi SV: Feature extraction from wavelet coefficients for pattern recognition tasks. IEEE Transactions on Pattern Analysis and Machine Intelligence 1999,21(1):83-88. 10.1109/34.745739View ArticleGoogle Scholar
  23. Learned R: Wavelet packet based transient signal classification, M.S. thesis.Google Scholar
  24. Phelps SM, Ryan MJ: Neural networks predict response biases of female tungara frogs. Proceedings of the Royal Society—Biological Sciences (Series B) 1998,265(1393):279-285. 10.1098/rspb.1998.0293View ArticleGoogle Scholar
  25. Deecke VB, Ford JKB, Spong P: Quantifying complex patterns of bioacoustic variation: use of a neural network to compare killer whale (Orcinus orca) dialects. The Journal of the Acoustical Society of America 1999,105(4):2499-2507. 10.1121/1.426853View ArticleGoogle Scholar
  26. Placer J, Slobodchikoff CN: A fuzzy-neural system for identification of species-specific alarm calls of Gunnison's prairie dogs. Behavioural Processes 2000,52(1):1-9. 10.1016/S0376-6357(00)00105-4View ArticleGoogle Scholar
  27. Thorn A: Artificial neural networks for vocal repertoire analysis. Proceedings of the 1st International Conference on Acoustic Communication by Animals, July 2003, College Park, Md, USA 245-246.Google Scholar
  28. McIlraith AL, Card HC: Birdsong recognition using backpropagation and multivariate statistics. IEEE Transactions on Signal Processing 1997,45(11):2740-2748. 10.1109/78.650100View ArticleGoogle Scholar
  29. Terry AMR, McGregor PK: Census and monitoring based on individually identifiable vocalizations: the role of neural networks. Animal Conservation 2002,5(2):103-111. 10.1017/S1367943002002147View ArticleGoogle Scholar
  30. Somervuo P, Härmä A: Analyzing bird song syllables on the self-organizing map. Proceedings of the Workshop on Self-Organizing Maps (WSOM '03), September 2003, Hibikino, Japan Proceedings on CD-ROMGoogle Scholar
  31. Boggess A, Narcowich FJ: A First Course in Wavelets with Fourier Analysis. Prentice-Hall, Upper Saddle River, NJ, USA; 2001.MATHGoogle Scholar
  32. Daubechies I: Ten Lectures on Wavelets. SIAM, Philadelphia, Pa, USA; 1992.View ArticleMATHGoogle Scholar
  33. Akansu AN, Haddad RA: Multiresolution Signal Decomposition: Transforms, Subbands, and Wavelets. Academic Press, Boston, Mass, USA; 1992.MATHGoogle Scholar
  34. Misiti M, Misiti Y, Oppenheim G, Poggi J-M: Wavelet Toolbox for Use with Matlab. MathWorks, Natick, Mass, USA; 2000.MATHGoogle Scholar
  35. Kohonen T: Self-Organizing Maps. Springer, Berlin, Germany; 2001.View ArticleMATHGoogle Scholar
  36. Haykin S: Neural Networks: A Comprehensive Foundation. Macmillan College, New York, NY, USA; 1994.MATHGoogle Scholar
  37. MathWorks : Matlab Software Homepage. June 2005, http://www.mathworks.com

Copyright

© Arja Selin et al. 2007

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.