Music Genre Classification Using MIDI and Audio Features

Cataltepe, Zehra; Yaslan, Yusuf; Sonmez, Abdullah

doi:10.1155/2007/36409

Research Article
Open access
Published: 01 December 2007

Music Genre Classification Using MIDI and Audio Features

Zehra Cataltepe¹,
Yusuf Yaslan¹ &
Abdullah Sonmez¹

EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 036409 (2007) Cite this article

5690 Accesses
39 Citations
Metrics details

Abstract

We report our findings on using MIDI files and audio features from MIDI, separately and combined together, for MIDI music genre classification. We use McKay and Fujinaga's 3-root and 9-leaf genre data set. In order to compute distances between MIDI pieces, we use normalized compression distance (NCD). NCD uses the compressed length of a string as an approximation to its Kolmogorov complexity and has previously been used for music genre and composer clustering. We convert the MIDI pieces to audio and then use the audio features to train different classifiers. MIDI and audio from MIDI classifiers alone achieve much smaller accuracies than those reported by McKay and Fujinaga who used not NCD but a number of domain-based MIDI features for their classification. Combining MIDI and audio from MIDI classifiers improves accuracy and gets closer to, but still worse, accuracies than McKay and Fujinaga's. The best root genre accuracies achieved using MIDI, audio, and combination of them are 0.75, 0.86, and 0.93, respectively, compared to 0.98 of McKay and Fujinaga. Successful classifier combination requires diversity of the base classifiers. We achieve diversity through using certain number of seconds of the MIDI file, different sample rates and sizes for the audio file, and different classification algorithms.

References

Lippens S, Martens JP, De Mulder T: A comparison of human and automatic musical genre classification. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '04), May 2004, Montreal, Quebec, Canada 4: 233–236.
Google Scholar
Basili R, Serafini A, Stellato A: Classification of musical genre: a machine learning approach. Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain
Google Scholar
Jarvinen T, Toiviainen P, Louhivuori J: Classification and categorization of musical styles with statistical analysis and self-organizing maps. Proceedings of the AISB Symposium on Musical Creativity, April 1999, Edinburgh, Scotland 54–57.
Google Scholar
McKay C, Fujinaga I: Automatic genre classification using large high-level musical feature sets. Proceedings of 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain
Google Scholar
Tzanetakis G, Ermolinskyi A, Cook P: Pitch histograms in audio and symbolic music information retrieval. Journal of New Music Research 2003,32(2):143-152. 10.1076/jnmr.32.2.143.16743
Article Google Scholar
Cilibrasi R, Vitányi PMB, de Wolf R: Algorithmic clustering of music based on string compression. Computer Music Journal 2004,28(4):49-67. 10.1162/0148926042728449
Article Google Scholar
Li M, Chen X, Li X, Ma B, Vitányi PMB: The similarity metric. IEEE Transactions on Information Theory 2004,50(12):3250-3264. 10.1109/TIT.2004.838101
Article MathSciNet Google Scholar
Keogh E, Lonardi S, Rtanamahatana CA: Towards parameter-free data mining. Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '04), August 2004, Seattle, Wash, USA 206–215.
Google Scholar
Pan D: A tutorial on MPEG/audio compression. IEEE Multimedia 1995,2(2):60-74. 10.1109/93.388209
Article Google Scholar
Aucouturier JJ, Pachet F: Representing musical genre: a state of the art. Journal of New Music Research 2003,32(1):83-93. 10.1076/jnmr.32.1.83.16801
Article Google Scholar
Lidy T, Rauber A: Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR '05), September 2005, London, UK
Google Scholar
Tzanetakis G, Cook P: Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 2002,10(5):293-302. 10.1109/TSA.2002.800560
Article Google Scholar
Xu C, Maddage NC, Shao X, Cao F, Tian Q: Musical genre classification using support vector machines. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '03), April 2003, Hong Kong 5: 429–432.
Google Scholar
Gouyon F, Dixon S, Pampalk E, Widmer G: Evaluating rhythmic descriptors for musical genre classification. Proceedings of the 25th International AES Conference, June 2004, London, UK
Google Scholar
West K, Cox S: Features and classifiers for the automatic classification of musical audio signals. Proceedings of the 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain
Google Scholar
Sonmez A: Music genre and composer identification by using Kolmogorov distance, M. Sc. thesis. Computer Engineering Department, Istanbul Technical University, Istanbul, Turkey, 2005.
Google Scholar
Cataltepe Z, Sonmez A, Adali E: Music classification using Kolmogorov distance. Representation in Music/Musical Representation Congress, October 2005, Istanbul, Turkey
Google Scholar
Li T, Ogihara M, Li Q: A comparative study on content-based music genre classification. Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '03), July-August 2003, Toronto, Ontario, Canada 282–289.
Google Scholar
Turnbull D, Elkan C: Fast recognition of musical genres using RBF networks. IEEE Transactions on Knowledge and Data Engineering 2005,17(4):580-584.
Article Google Scholar
Duda RO, Hart PE, Stork DG: Pattern Classification. John Wiley & Sons, New York, NY, USA; 2000.
MATH Google Scholar
Bergstra J, Casagrande N, Eck D: Genre classification: timbre and rhythm-based multiresolution audio classification. Proceedings of 1st Annual Music Information Retrieval Evaluation eXchange (MIREX) Genre Classification Contest, September 2005, London, UK
Google Scholar
Li T, Tzanetakis G: Factors in automatic musical genre classification of audio signals. Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA '03), October 2003, New Paltz, NY, USA
Google Scholar
Uitdenbogerd L, Zobel J: Music ranking techniques evaluated. Australian Computer Science Communications 2002,24(1):275-283.
Google Scholar
Kuncheva LI: Combining Pattern Classifiers. John Wiley & Sons, New York, NY, USA; 2004.
Book Google Scholar

Download references

Author information

Authors and Affiliations

Computer Engineering Department, Faculty of Electrical and Electronic Engineering, Istanbul Technical University, Maslak, Sariyer, Istanbul, 34469, Turkey
Zehra Cataltepe, Yusuf Yaslan & Abdullah Sonmez

Authors

Zehra Cataltepe
View author publications
You can also search for this author in PubMed Google Scholar
Yusuf Yaslan
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Sonmez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zehra Cataltepe.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Cataltepe, Z., Yaslan, Y. & Sonmez, A. Music Genre Classification Using MIDI and Audio Features. EURASIP J. Adv. Signal Process. 2007, 036409 (2007). https://doi.org/10.1155/2007/36409

Download citation

Received: 01 December 2005
Revised: 17 October 2006
Accepted: 19 October 2006
Published: 01 December 2007
DOI: https://doi.org/10.1155/2007/36409

Music Genre Classification Using MIDI and Audio Features

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords