Real-time vision and speech driven avatars for multimedia applications

Oliver Schreer, Roman Englert, Peter Eisert, Ralf Tanger

Research output: Contribution to journalArticlepeer-review

17 Scopus citations

Abstract

Recent progress in advanced video communication services and multimedia applications is grounded on novel human machine interfaces, improved usability, and user friendliness driven by user centric research and development. In this paper, we describe a complete system concept and algorithmic details of an example application within this area. The key features of the system are vision and speech based interfaces, which are used to animate an avatar for an audio-visual representation of a communication partner. The system is applied in two application scenarios, namely video chat and customer care services. Both applications are mass-market oriented and therefore careful design and development of robust and supporting user interfaces are required. The presented approach is integrated into a complete real-time prototype system, which is permanently demonstrated in the showcase at the head quarter of Deutsche Telekom, Bonn, Germany.

Original languageEnglish
Article number4456693
Pages (from-to)352-360
Number of pages9
JournalIEEE Transactions on Multimedia
Volume10
Issue number3
DOIs
StatePublished - 1 Apr 2008

Keywords

  • Avatar
  • Multimodality
  • Real-time tracking
  • Segmentation

ASJC Scopus subject areas

  • Signal Processing
  • Media Technology
  • Computer Science Applications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Real-time vision and speech driven avatars for multimedia applications'. Together they form a unique fingerprint.

Cite this