Skip to main content

Three-Dimensional Facial Adaptation for MPEG-4 Talking Heads

Abstract

This paper studies a new method for three-dimensional (3D) facial model adaptation and its integration into a text-to-speech (TTS) system. The 3D facial adaptation requires a set of two orthogonal views of the user′s face with a number of feature points located on both views. Based on the correspondences of the feature points′ positions, a generic face model is deformed nonrigidly treating every facial part as a separate entity. A cylindrical texture map is then built from the two image views. The generated head models are compared to corresponding models obtained by the commonly used adaptation method that utilizes 3D radial bases functions. The generated 3D models are integrated into a talking head system, which consists of two distinct parts: a multilingual text to speech sub-system and an MPEG-4 compliant facial animation sub-system. Support for the Greek language has been added, while preserving lip and speech synchronization.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Nikos Grammalidis.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Grammalidis, N., Sarris, N., Deligianni, F. et al. Three-Dimensional Facial Adaptation for MPEG-4 Talking Heads. EURASIP J. Adv. Signal Process. 2002, 764808 (2002). https://doi.org/10.1155/S1110865702206113

Download citation

Keywords

  • MPEG-4
  • 3D model based coding
  • text to speech
  • facial adaptation
  • talking face