Skip to main content

Talking-Face Identity Verification, Audiovisual Forgery, and Robustness Issues


The robustness of a biometric identity verification (IV) system is best evaluated by monitoring its behavior under impostor attacks. Such attacks may include the transformation of one, many, or all of the biometric modalities. In this paper, we present the transformation of both speech and visual appearance of a speaker and evaluate its effects on the IV system. We propose MixTrans, a novel method for voice transformation. MixTrans is a mixture-structured bias voice transformation technique in the cepstral domain, which allows a transformed audio signal to be estimated and reconstructed in the temporal domain. We also propose a face transformation technique that allows a frontal face image of a client speaker to be animated. This technique employs principal warps to deform defined MPEG-4 facial feature points based on determined facial animation parameters (FAPs). The robustness of the IV system is evaluated under these attacks.

Publisher note

To access the full article, please see PDF.

Author information



Corresponding author

Correspondence to Walid Karam.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Karam, W., Bredin, H., Greige, H. et al. Talking-Face Identity Verification, Audiovisual Forgery, and Robustness Issues. EURASIP J. Adv. Signal Process. 2009, 746481 (2009).

Download citation


  • Face Image
  • Audio Signal
  • Transformation Technique
  • Identity Verification
  • Facial Animation