Open Access

Three-Dimensional Facial Adaptation for MPEG-4 Talking Heads

  • Nikos Grammalidis1Email author,
  • Nikos Sarris2,
  • Fani Deligianni1 and
  • Michael G. Strintzis1
EURASIP Journal on Advances in Signal Processing20022002:764808

https://doi.org/10.1155/S1110865702206113

Received: 31 August 2001

Published: 22 October 2002

Abstract

This paper studies a new method for three-dimensional (3D) facial model adaptation and its integration into a text-to-speech (TTS) system. The 3D facial adaptation requires a set of two orthogonal views of the user′s face with a number of feature points located on both views. Based on the correspondences of the feature points′ positions, a generic face model is deformed nonrigidly treating every facial part as a separate entity. A cylindrical texture map is then built from the two image views. The generated head models are compared to corresponding models obtained by the commonly used adaptation method that utilizes 3D radial bases functions. The generated 3D models are integrated into a talking head system, which consists of two distinct parts: a multilingual text to speech sub-system and an MPEG-4 compliant facial animation sub-system. Support for the Greek language has been added, while preserving lip and speech synchronization.

Keywords

MPEG-4 3D model based coding text to speech facial adaptation talking face

Authors’ Affiliations

(1)
Informatics and Telematics Institute, Centre for Research and Technology Hellas
(2)
Information Processing Laboratory, Electrical and Computer Engineering Department, Aristotle University of Thessaloniki

Copyright

© Grammalidis et al. 2002