Skip to main content

Bird Species Recognition Using Support Vector Machines


Automatic identification of bird species by their vocalization is studied in this paper. Bird sounds are represented with two different parametric representations: (i) the mel-cepstrum parameters and (ii) a set of low-level signal parameters, both of which have been found useful for bird species recognition. Recognition is performed in a decision tree with support vector machine (SVM) classifiers at each node that perform classification between two species. Recognition is tested with two sets of bird species whose recognition has been previously tested with alternative methods. Recognition results with the proposed method suggest better or equal performance when compared to existing reference methods.


  1. 1.

    Selouani S-A, Kardouchi M, Hervet E, Roy D: Automatic birdsong recognition based on autoregressive time-delay neural networks. Congress on Computational Intelligence Methods and Applications (CIMA '05), December 2005, Istanbul, Turkey 1–6.

    Google Scholar 

  2. 2.

    Kwan C, Ho KC, Mei G, et al.: An automated acoustic system to monitor and classify birds. EURASIP Journal on Applied Signal Processing 2006, 2006: 19 pages.

    Google Scholar 

  3. 3.

    Vilches E, Escobar IA, Vallejo EE, Taylor CE: Data mining applied to acoustic bird species recognition. Proceedings of the 18th International Conference on Pattern Recognition (ICPR '06), August 2006, Hong Kong 3: 400–403.

    Article  Google Scholar 

  4. 4.

    Tyagi H, Hegde RM, Murthy HA, Prabhakar A: Automatic identification of bird calls using spectral ensemble average voiceprints. Proceedings of the 13th European Signal Processing Conference (EUSIPCO '06), September 2006, Florence, Italy

    Google Scholar 

  5. 5.

    Fox EJS, Roberts JD, Bennamoun M: Text-independent speaker identification in birds. Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP '06), September 2006, Pittsburgh, Pa, USA

    Google Scholar 

  6. 6.

    Anderson SE, Dave AS, Margoliash D: Template-based automatic recognition of birdsong syllables from continuous recordings. The Journal of the Acoustical Society of America 1996,100(2):1209-1219. 10.1121/1.415968

    Article  Google Scholar 

  7. 7.

    Kogan JA, Margoliash D: Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. The Journal of the Acoustical Society of America 1998,103(4):2185-2196. 10.1121/1.421364

    Article  Google Scholar 

  8. 8.

    Mcllraith AL, Card HC: Birdsong recognition using backpropagation and multivariate statistics. IEEE Transactions on Signal Processing 1997,45(11):2740-2748. 10.1109/78.650100

    Article  Google Scholar 

  9. 9.

    Fagerlund S: Avesound project web-site. 2006.

    Google Scholar 

  10. 10.

    Catchpole CK, Slater PJB: Bird Song: Biological Themes and Variations. Cambridge University Press, Cambridge, UK; 1995.

    Google Scholar 

  11. 11.

    Härmä A: Automatic identification of bird species based on sinusoidal modelling of syllables. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), April 2003, Hong Kong 5: 545–548.

    Google Scholar 

  12. 12.

    Härmä A, Somervuo P: Classification of the harmonic structure in bird vocalization. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), May 2004, Montreal, Canada 5: 701–704.

    Google Scholar 

  13. 13.

    Somervuo P, Härmä A: Bird song recognition based on syllable pair histograms. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), May 2004, Montreal, Canada 5: 825–828.

    Google Scholar 

  14. 14.

    Fagerlund S: Automatic recognition of bird species by their sounds, M.S. thesis. Helsinki University of Technology, Espoo, Finland; 2004.

    Google Scholar 

  15. 15.

    Fagerlund S, Härmä A: Parametrization of inharmonic bird sounds for automatic recognition. Proceedings of the 13th European Signal Processing Conference (EUSIPCO '05), September 2005, Antalya, Turkey

    Google Scholar 

  16. 16.

    Selin A, Turunen J, Tanttu JT: Wavelets in recognition of bird sounds. EURASIP Journal on Advances in Signal Processing 2007, 2007: 9 pages.

    MATH  Google Scholar 

  17. 17.

    Somervuo P, Härmä A, Fagerlund S: Parametric representations of bird sounds for automatic species recognition. IEEE Transactions on Audio, Speech and Language Processing 2006,14(6):2252-2263.

    Article  Google Scholar 

  18. 18.

    Li D, Sethi IK, Dimitrova N, McGee T: Classification of general audio data for content-based retrieval. Pattern Recognition Letters 2001,22(5):533-544. 10.1016/S0167-8655(00)00119-7

    Article  Google Scholar 

  19. 19.

    Davis SB, Mermelstein P: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech, and Signal Processing 1980,28(4):357-366. 10.1109/TASSP.1980.1163420

    Article  Google Scholar 

  20. 20.

    Juneja A, Espy-Wilson C: Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines. Proceedings of the International Joint Conference on Neural Networks (IJCNN '03), July 2003, Portland, Ore, USA 1: 675–679.

    Google Scholar 

  21. 21.

    Ganapathiraju A, Hamaker JE, Picone J: Applications of support vector machines to speech recognition. IEEE Transactions on Signal Processing 2004,52(8):2348-2355. 10.1109/TSP.2004.831018

    Article  Google Scholar 

  22. 22.

    Lin C-C, Chen S-H, Truong T-K, Chang Y: Audio classification and categorization based on wavelets and support vector machine. IEEE Transactions on Speech and Audio Processing 2005,13(5, part 1):644-651.

    Article  Google Scholar 

  23. 23.

    Vapnik V: Statistical Learning Theory. John Wiley & Sons, New York, NY, USA; 1998.

    MATH  Google Scholar 

  24. 24.

    Burges CJC: A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 1998,2(2):121-167. 10.1023/A:1009715923555

    Article  Google Scholar 

  25. 25.

    Schwenker F: Hierarchical support vector machines for multi-class pattern recognition. Proceedings of the 4th International Conference on Knowledge-Based Intellingent Engineering Systems and Allied Technologies (KES '00), August-September 2000, Brighton, UK 2: 561–565.

    Google Scholar 

  26. 26.

    Platt JC, Cristianini N, Shawe-Taylor J: Large margin dags for multiclass classification. In Advances in Neural Information Processing Systems 12. MIT Press, Cambridge, Mass, USA; 2000:547-553.

    Google Scholar 

  27. 27.

    Platt JC: Fast training of support vector machines using sequential minimal optimization. In Advances in Kernel Methods - Support Vector Learning. Edited by: Scholkopf B, Burges C, Smola AJ. MIT Press, Cambridge, Mass, USA; 1999:185-208. chapter 12

    Google Scholar 

  28. 28.

    Cawley GC:MATLAB support vector machine toolbox (v0.55 ). School of Information Systems, University of East Anglia, Norwich, Norfolk, UK. NR4 7TJ, 2000,

    Google Scholar 

Download references

Author information



Corresponding author

Correspondence to Seppo Fagerlund.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Fagerlund, S. Bird Species Recognition Using Support Vector Machines. EURASIP J. Adv. Signal Process. 2007, 038637 (2007).

Download citation


  • Support Vector Machine
  • Information Technology
  • Decision Tree
  • Support Vector
  • Quantum Information