CUED Publications database

Expressive visual text-to-speech using active appearance models

Anderson, R and Stenger, B and Wan, V and Cipolla, R (2013) Expressive visual text-to-speech using active appearance models. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. pp. 3382-3389. ISSN 1063-6919

Full text not available from this repository.

Abstract

This paper presents a complete system for expressive visual text-to-speech (VTTS), which is capable of producing expressive output, in the form of a 'talking head', given an input text and a set of continuous expression weights. The face is modeled using an active appearance model (AAM), and several extensions are proposed which make it more applicable to the task of VTTS. The model allows for normalization with respect to both pose and blink state which significantly reduces artifacts in the resulting synthesized sequences. We demonstrate quantitative improvements in terms of reconstruction error over a million frames, as well as in large-scale user studies, comparing the output of different systems. © 2013 IEEE.

Item Type: Article
Subjects: UNSPECIFIED
Divisions: Div F > Machine Intelligence
Depositing User: Cron Job
Date Deposited: 17 Jul 2017 19:13
Last Modified: 26 Oct 2017 01:49
DOI: