A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications

Gordan, Mihaela; Kotropoulos, Constantine; Pitas, Ioannis

doi:10.1155/S1110865702207039

Research Article
Published: 28 November 2002

A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications

Mihaela Gordan¹,
Constantine Kotropoulos¹ &
Ioannis Pitas¹

EURASIP Journal on Advances in Signal Processing volume 2002, Article number: 427615 (2002) Cite this article

1256 Accesses
25 Citations
Metrics details

Abstract

Visual speech recognition is an emerging research field. In this paper, we examine the suitability of support vector machines for visual speech recognition. Each word is modeled as a temporal sequence of visemes corresponding to the different phones realized. One support vector machine is trained to recognize each viseme and its output is converted to a posterior probability through a sigmoidal mapping. To model the temporal character of speech, the support vector machines are integrated as nodes into a Viterbi lattice. We test the performance of the proposed approach on a small visual speech recognition task, namely the recognition of the first four digits in English. The word recognition rate obtained is at the level of the previous best reported rates.

Author information

Authors and Affiliations

Department of Informatics, Aristotle University of Thessaloniki, Box 451, Thessaloniki, 54006, Greece
Mihaela Gordan, Constantine Kotropoulos & Ioannis Pitas

Authors

Mihaela Gordan
View author publications
You can also search for this author in PubMed Google Scholar
Constantine Kotropoulos
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Pitas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mihaela Gordan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gordan, M., Kotropoulos, C. & Pitas, I. A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications. EURASIP J. Adv. Signal Process. 2002, 427615 (2002). https://doi.org/10.1155/S1110865702207039

Download citation

Received: 26 November 2001
Revised: 26 July 2002
Published: 28 November 2002
DOI: https://doi.org/10.1155/S1110865702207039

A Support Vector Machine-Based Dynamic Network for Visual Speech Recognition Applications

Abstract

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords