Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes

Chuan, Ching-Hua; Chew, Elaine

doi:10.1155/2007/56561

Research Article
Open access
Published: 01 December 2006

Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes

Ching-Hua Chuan¹ &
Elaine Chew²

EURASIP Journal on Advances in Signal Processing volume 2007, Article number: 056561 (2006) Cite this article

1512 Accesses
15 Citations
3 Altmetric
Metrics details

Abstract

We systematically analyze audio key finding to determine factors important to system design, and the selection and evaluation of solutions. First, we present a basic system, fuzzy analysis spiral array center of effect generator algorithm, with three key determination policies: nearest-neighbor (NN), relative distance (RD), and average distance (AD). AD achieved a 79% accuracy rate in an evaluation on 410 classical pieces, more than 8% higher RD and NN. We show why audio key finding sometimes outperforms symbolic key finding. We next propose three extensions to the basic key finding system—the modified spiral array (mSA), fundamental frequency identification (F0), and post-weight balancing (PWB)—to improve performance, with evaluations using Chopin's Preludes (Romantic repertoire was the most challenging). F0 provided the greatest improvement in the first 8 seconds, while mSA gave the best performance after 8 seconds. Case studies examine when all systems were correct, or all incorrect.

References

Chew E: Towards a mathematical model of tonality, Doctoral dissertation.
Chew E: Modeling tonality: applications to music cognition. Proceedings of the 23rd Annual Meeting of the Cognitive Science Society (CogSci '01), August 2001, Edinburgh, Scotland, UK 206–211.
Google Scholar
Chuan C-H, Chew E: Fuzzy analysis in pitch-class determination for polyphonic audio key finding. Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR '05), September 2005, London, UK 296–303.
Google Scholar
Longuet-Higgins HC, Steedman MJ: On interpreting bach. In Machine Intelligence. Volume 6. Edinburgh University Press, Edinburgh, Scotland, UK; 1971:221–241.
Google Scholar
Krumhansl CL: Quantifying tonal hierarchies and key distances. In Cognitive Foundations of Musical Pitch. Oxford University Press, New York, NY, USA; 1990:16–49. chapter 2
Google Scholar
Temperley D: What's key for key? the Krumhansl-Schmuckler key-finding algorithm reconsidered. Music Perception 1999,17(1):65–100.
Article Google Scholar
Chuan C-H, Chew E: Polyphonic audio key finding using the spiral array CEG algorithm. Proceedings of IEEE International Conference on Multimedia and Expo (ICME '05), July 2005, Amsterdam, The Netherlands 21–24.
Google Scholar
Gómez E, Herrera P: Estimating the tonality of polyphonic audio files: cognitive versus machine learning modelling strategies. Proceedings of 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain 92–95.
Google Scholar
Pauws S: Musical key extraction from audio. Proceedings of 5th International Conference on Music Information Retrieval (ISMIR '04), October 2004, Barcelona, Spain 96–99.
Google Scholar
1st Annual Music Information Retrieval Evaluation eXchange, MIREX 2005, https://doi.org/www.music-ir.org/mirex2005/index.php/Main_Page
Chuan C-H, Chew E: Audio key finding using FACEG: fuzzy analysis with the CEG algorithm. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Google Scholar
Gómez E: Key estimation from polyphonic audio. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Google Scholar
İzmirli Ö: An algorithm for audio key finding. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Google Scholar
Pauws S: KEYEX: audio key extraction. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Google Scholar
Purwins H, Blankertz B: Key finding in audio. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Google Scholar
Zhu Y: An audio key finding algorithm. Abstract of the 1st Annual Music Information Retrieval Evaluation eXchange (MIREX '05), September 2005, London, UK
Google Scholar
Chew E, François ARJ: Interactive multi-scale visualizations of tonal evolution in MuSA.RT Opus 2. Computers in Entertainment 2005,3(4):1–16. special issue on Music Visualization
Article Google Scholar
Chew E, Chen Y-C: Mapping MIDI to the spiral array: disambiguating pitch spellings. Proceedings of the 8th INFORMS Computing Society Conference (ICS '03), January 2003, Chandler, Ariz, USA 259–275.
Google Scholar
Chew E, Chen Y-C: Real-time pitch spelling using the spiral array. Computer Music Journal 2005,29(2):61–76. 10.1162/0148926054094378
Article Google Scholar
İzmirli Ö: Template based key finding from audio. Proceedings of the International Computer Music Conference (ICMC '05), September 2005, Barcelona, Spain
Google Scholar
Electronic Music Studios in the University of Iowa, https://doi.org/theremin.music.uiowa.edu/MIS.html
Klapuri AP: Multiple fundamental frequency estimation based on harmonicity and spectral smoothness. IEEE Transactions on Speech and Audio Processing 2003,11(6):804–816. 10.1109/TSA.2003.815516
Article Google Scholar
Klapuri A: A perceptually motivated multiple-F0 estimation method. Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, October 2005, New Paltz, NY, USA
Google Scholar

Download references

Author information

Authors and Affiliations

Integrated Media Systems Center, Department of Computer Science, USC Viterbi School of Engineering, University of Southern California, Los Angeles, CA, 90089-0781, USA
Ching-Hua Chuan
Integrated Media Systems Center, Epstein Department of Industrial and Systems Engineering, USC Viterbi School of Engineering, University of Southern California, Los Angeles, CA, 90089-0193, USA
Elaine Chew

Authors

Ching-Hua Chuan
View author publications
You can also search for this author in PubMed Google Scholar
Elaine Chew
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ching-Hua Chuan.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Chuan, CH., Chew, E. Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes. EURASIP J. Adv. Signal Process. 2007, 056561 (2006). https://doi.org/10.1155/2007/56561

Download citation

Received: 08 December 2005
Revised: 31 May 2006
Accepted: 22 June 2006
Published: 01 December 2006
DOI: https://doi.org/10.1155/2007/56561

Audio Key Finding: Considerations in System Design and Case Studies on Chopin's 24 Preludes

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords