Embodied conversational agents in Wizard-of-Oz and multimodal interaction applications

Authors:
Matej Rojc;Tomaž Rotovnik;Mišo Brus;Dušan Jan;Zdravko Kačič
Affiliations:
University of Maribor, Faculty of Electrical Engineering and Computer Science, Maribor, Slovenia;University of Maribor, Faculty of Electrical Engineering and Computer Science, Maribor, Slovenia;Agito d.o.o., Ljubljana, Slovenia;Agito d.o.o., Ljubljana, Slovenia;University of Maribor, Faculty of Electrical Engineering and Computer Science, Maribor, Slovenia
Venue:
COST 2102'07 Proceedings of the 2007 COST action 2102 international conference on Verbal and nonverbal communication behaviours
Year:
2007

Citing 10
Cited 0

A corpus-based approach to language learning

A corpus-based approach to language learning
Speaker-independent continuous speech dictation

Speech Communication
Heterogeneous relation graphs as a formalism for representating linguistic information

Speech Communication - Special issue on speech annotation and corpus tools
Finite-State Language Processing

Finite-State Language Processing
Multilingual Text-to-Speech Synthesis

Multilingual Text-to-Speech Synthesis
On some applications of finite-state automata theory to natural language processing

Natural Language Engineering
Compilation of weighted finite-state transducers from decision trees

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
An efficient compiler for weighted rewrite rules

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Time and space-efficient architecture for a corpus-based text-to-speech synthesis system

Speech Communication
Joint prosody prediction and unit selection for concatenative speech synthesis

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02

Quantified Score

Hi-index	0.01

Visualization

Abstract

Embodied conversational agents employed in multimodal interaction applications have the potential to achieve similar properties as humans in faceto-face conversation. They enable the inclusion of verbal and nonverbal communication. Thus, the degree of personalization of the user interface is much higher than in other human-computer interfaces. This, of course, greatly contributes to the naturalness and user friendliness of the interface, opening-up a wide area of possible applications. Two implementations of embodied conversational agents in human-computer interaction are presented in this paper: the first one in a Wizard-of-Oz application and the second in a dialogue system. In the Wizard-of-Oz application, the embodied conversational agent is applied in a way that it conveys the spoken information of the operator to the user with whom the operator communicates. Depending on the scenario of the application, the user may or not be aware of the operator's involvement. The operator can communicate with the user based on audio/visual, or only audio, communication. This paper describes an application setup, which enables distant communication with the user, where the user is unaware of the operator's involvement. A real-time viseme recognizer is needed to ensure a proper response from the agent. In addition, implementation of the embodied conversational agent Lili hosting an entertainment show, which is broadcast by RTV Slovenia, will be described in more detail. Employment of the embodied conversational agent as a virtual major-domo named Maja, within an intelligent ambience, using speech recognition system and TTS system PLATTOS, will be also described.