Open Access

Music Genre Classification Using MIDI and Audio Features

EURASIP Journal on Advances in Signal Processing20072007:036409

https://doi.org/10.1155/2007/36409

Received: 1 December 2005

Accepted: 19 October 2006

Published: 4 January 2007

Abstract

We report our findings on using MIDI files and audio features from MIDI, separately and combined together, for MIDI music genre classification. We use McKay and Fujinaga's 3-root and 9-leaf genre data set. In order to compute distances between MIDI pieces, we use normalized compression distance (NCD). NCD uses the compressed length of a string as an approximation to its Kolmogorov complexity and has previously been used for music genre and composer clustering. We convert the MIDI pieces to audio and then use the audio features to train different classifiers. MIDI and audio from MIDI classifiers alone achieve much smaller accuracies than those reported by McKay and Fujinaga who used not NCD but a number of domain-based MIDI features for their classification. Combining MIDI and audio from MIDI classifiers improves accuracy and gets closer to, but still worse, accuracies than McKay and Fujinaga's. The best root genre accuracies achieved using MIDI, audio, and combination of them are 0.75, 0.86, and 0.93, respectively, compared to 0.98 of McKay and Fujinaga. Successful classifier combination requires diversity of the base classifiers. We achieve diversity through using certain number of seconds of the MIDI file, different sample rates and sizes for the audio file, and different classification algorithms.

[123456789101112131415161718192021222324]

Authors’ Affiliations

(1)
Computer Engineering Department, Faculty of Electrical and Electronic Engineering, Istanbul Technical University, Maslak

References

  1. Lippens S, Martens JP, De Mulder T: A comparison of human and automatic musical genre classification. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '04), May 2004, Montreal, Quebec, Canada 4: 233-236.Google Scholar
  2. Basili R, Serafini A, Stellato A: Classification of musical genre: a machine learning approach. Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, SpainGoogle Scholar
  3. Jarvinen T, Toiviainen P, Louhivuori J: Classification and categorization of musical styles with statistical analysis and self-organizing maps. Proceedings of the AISB Symposium on Musical Creativity, April 1999, Edinburgh, Scotland 54-57.Google Scholar
  4. McKay C, Fujinaga I: Automatic genre classification using large high-level musical feature sets. Proceedings of 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, SpainGoogle Scholar
  5. Tzanetakis G, Ermolinskyi A, Cook P: Pitch histograms in audio and symbolic music information retrieval. Journal of New Music Research 2003,32(2):143-152. 10.1076/jnmr.32.2.143.16743View ArticleGoogle Scholar
  6. Cilibrasi R, Vitányi PMB, de Wolf R: Algorithmic clustering of music based on string compression. Computer Music Journal 2004,28(4):49-67. 10.1162/0148926042728449View ArticleGoogle Scholar
  7. Li M, Chen X, Li X, Ma B, Vitányi PMB: The similarity metric. IEEE Transactions on Information Theory 2004,50(12):3250-3264. 10.1109/TIT.2004.838101View ArticleMATHMathSciNetGoogle Scholar
  8. Keogh E, Lonardi S, Rtanamahatana CA: Towards parameter-free data mining. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '04), August 2004, Seattle, Wash, USA 206-215.Google Scholar
  9. Pan D: A tutorial on MPEG/audio compression. IEEE Multimedia 1995,2(2):60-74. 10.1109/93.388209View ArticleGoogle Scholar
  10. Aucouturier JJ, Pachet F: Representing musical genre: a state of the art. Journal of New Music Research 2003,32(1):83-93. 10.1076/jnmr.32.1.83.16801View ArticleGoogle Scholar
  11. Lidy T, Rauber A: Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR '05), September 2005, London, UKGoogle Scholar
  12. Tzanetakis G, Cook P: Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 2002,10(5):293-302. 10.1109/TSA.2002.800560View ArticleGoogle Scholar
  13. Xu C, Maddage NC, Shao X, Cao F, Tian Q: Musical genre classification using support vector machines. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '03), April 2003, Hong Kong 5: 429-432.Google Scholar
  14. Gouyon F, Dixon S, Pampalk E, Widmer G: Evaluating rhythmic descriptors for musical genre classification. Proceedings of the 25th International AES Conference, June 2004, London, UKGoogle Scholar
  15. West K, Cox S: Features and classifiers for the automatic classification of musical audio signals. Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, SpainGoogle Scholar
  16. Sonmez A: Music genre and composer identification by using Kolmogorov distance, M. Sc. thesis. Computer Engineering Department, Istanbul Technical University, Istanbul, Turkey, 2005.Google Scholar
  17. Cataltepe Z, Sonmez A, Adali E: Music classification using Kolmogorov distance. Representation in Music/Musical Representation Congress, October 2005, Istanbul, TurkeyGoogle Scholar
  18. Li T, Ogihara M, Li Q: A comparative study on content-based music genre classification. Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '03), July-August 2003, Toronto, Ontario, Canada 282-289.View ArticleGoogle Scholar
  19. Turnbull D, Elkan C: Fast recognition of musical genres using RBF networks. IEEE Transactions on Knowledge and Data Engineering 2005,17(4):580-584.View ArticleGoogle Scholar
  20. Duda RO, Hart PE, Stork DG: Pattern Classification. John Wiley & Sons, New York, NY, USA; 2000.MATHGoogle Scholar
  21. Bergstra J, Casagrande N, Eck D: Genre classification: timbre and rhythm-based multiresolution audio classification. Proceedings of 1st Annual Music Information Retrieval Evaluation eXchange (MIREX) Genre Classification Contest, September 2005, London, UKGoogle Scholar
  22. Li T, Tzanetakis G: Factors in automatic musical genre classification of audio signals. Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA '03), October 2003, New Paltz, NY, USAGoogle Scholar
  23. Uitdenbogerd L, Zobel J: Music ranking techniques evaluated. Australian Computer Science Communications 2002,24(1):275-283.Google Scholar
  24. Kuncheva LI: Combining Pattern Classifiers. John Wiley & Sons, New York, NY, USA; 2004.View ArticleMATHGoogle Scholar

Copyright

© Cataltepe et al. 2007