A Robust Formant Extraction Algorithm Combining Spectral Peak Picking and Root Polishing

Kim, Chanwoo; Seo, Kwang-deok; Sung, Wonyong

doi:10.1155/ASP/2006/67960

Research Article
Open access
Published: 01 December 2006

A Robust Formant Extraction Algorithm Combining Spectral Peak Picking and Root Polishing

Chanwoo Kim¹,
Kwang-deok Seo² &
Wonyong Sung³

EURASIP Journal on Advances in Signal Processing volume 2006, Article number: 067960 (2006) Cite this article

2850 Accesses
4 Citations
Metrics details

Abstract

We propose a robust formant extraction algorithm that combines the spectral peak picking, formants location examining for peak merger checking, and the root extraction methods. The spectral peak picking method is employed to locate the formant candidates, and the root extraction is used for solving the peak merger problem. The location and the distance between the extracted formants are also utilized to efficiently find out suspected peak mergers. The proposed algorithm does not require much computation, and is shown to be superior to previous formant extraction algorithms through extensive tests using TIMIT speech database.

References

Rabiner LR, Schafer RW: Digital Processing of Speech Signals. Prentice-Hall, Englewood Cliffs, NJ, USA; 1978.
Google Scholar
Snell RC, Milinazzo F: Formant location from LPC analysis data. IEEE Transactions on Speech and Audio Processing 1993, 1(2):129–134. 10.1109/89.222882
Article Google Scholar
McCandless SS: An algorithm for automatic formant extraction using linear prediction spectra. IEEE Transactions on Acoustics, Speech, and Signal Processing 1974, 22(2):135–141. 10.1109/TASSP.1974.1162559
Article Google Scholar
Welling L, Ney H: Formant estimation for speech recognition. IEEE Transactions on Speech and Audio Processing 1998, 6(1):36–48. 10.1109/89.650308
Article Google Scholar
Dellar JR Jr., Proakis JG, Hansen JHL: Discrete-Time Processing of Speech Signals. Macmillan, New York, NY, USA; 1993.
Google Scholar
Garofolo JS, Lamel LF, Fisher WM, Fiscus JG, Pallett DS, Dahlgren NL: Darpa TIMIT acoustic-phonetic continuous speech corpus. In Tech. Rep. NISTIR 4930. U.S. Department of Commerce, National Institute of Standards and Technology, Gaithersburg, Md, USA; 1993.
Google Scholar
Peterson GE, Barney HL: Control methods used in a study of the vowels. Journal of the Acoustical Society of America 1952, 24(2):175–194. 10.1121/1.1906875
Article Google Scholar
Kim C, Sung W: Vowel pronunciation accuracy checking system based on phoneme segmentation and formants extraction. Proceedings of International Conference on Speech Processing, August 2001, Daejeon, Korea 447–452.
Google Scholar
Markel JD: Digital inverse filtering: a new tool for formant trajectory estimation. IEEE Transactions on Audio and Electroacoustics 1972, 20(2):129–137. 10.1109/TAU.1972.1162367
Article MathSciNet Google Scholar
Atal BS, Hanauer SL: Speech analysis and synthesis by linear prediction of the speech wave. Journal of the Acoustical Society of America 1971, 50(2B):637–655. 10.1121/1.1912679
Article Google Scholar
Kang GS, Coulter DC: 600 bits per second voice digitizer (linear predictive formant vocoder). Naval Research Laboratory Report 8043 November 1976.
Google Scholar
Bell CG, Fujisaki H, Heinz JM, Stevens KN, House AS: Reduction of speech spectra by analysis-by-synthesis techniques. Journal of the Acoustical Society of America 1961, 33(12):1725–1736. 10.1121/1.1908556
Article Google Scholar
Press WH, Teukolsky SA, Vetterling WT, Flannery BP: Numerical Recipes in C. Cambridge University Press, Cambridge, UK; 1992. pp. 376
MATH Google Scholar
Burden RL, Faires JD: Numerical Analysis. Brooks/Cole, Pacific Grove, Calif, USA; 1997.
MATH Google Scholar
Dunn HK: Methods of measuring vowel formant bandwidths. Journal of the Acoustical Society of America 1961, 33(12):1737-1746. 10.1121/1.1908558
Article Google Scholar
WaveSurfer Center for Speech Technology (CTT) at KTH, Stockholm, Sweden, available at https://doi.org/www.speech.kth.se/wavesurfer/

Download references

Author information

Authors and Affiliations

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213-3891, USA
Chanwoo Kim
Computer and Telecommunications Engineering Division, Yonsei University, Wonju, Gangwon, 220-710, Korea
Kwang-deok Seo
School of Electrical Engineering and Computer Science, Seoul National University, Gwanak-gu, Seoul, 151-744, Korea
Wonyong Sung

Authors

Chanwoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kwang-deok Seo
View author publications
You can also search for this author in PubMed Google Scholar
Wonyong Sung
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Kim, C., Seo, Kd. & Sung, W. A Robust Formant Extraction Algorithm Combining Spectral Peak Picking and Root Polishing. EURASIP J. Adv. Signal Process. 2006, 067960 (2006). https://doi.org/10.1155/ASP/2006/67960

Download citation

Received: 22 September 2004
Revised: 27 July 2005
Accepted: 22 August 2005
Published: 01 December 2006
DOI: https://doi.org/10.1155/ASP/2006/67960

A Robust Formant Extraction Algorithm Combining Spectral Peak Picking and Root Polishing

Abstract

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords