Salience in the generation of multimodal referring acts

Authors:
Paul Piwek
Affiliations:
The Open University, Milton Keynes, United Kingdom
Venue:
Proceedings of the 2009 international conference on Multimodal interfaces
Year:
2009

Citing 3
Cited 1

Generating Referring Expressions in a Multimodal Environment

Proceedings of the 6th International Workshop on Natural Language Generation: Aspects of Automated Natural Language Generation
Dynamically structuring, updating and interrelating representations of visual and linguistic discourse context

Artificial Intelligence - Special volume on connecting language to the world
Knowledge and data flow architecture for reference processing in multimodal dialog systems

ICMI '08 Proceedings of the 10th international conference on Multimodal interfaces

Focusing computational visual attention in multi-modal human-robot interaction

International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

Pointing combined with verbal referring is one of the most paradigmatic human multimodal behaviours. The aim of this paper is foundational: to uncover the central notions that are required for a computational model of multimodal referring acts that include a pointing gesture. The paper draws on existing work on the generation of referring expressions and shows that in order to extend that work with pointing, the notion of salience needs to play a pivotal role. The paper starts by investigating the role of salience in the generation of referring expressions and introduces a distinction between two opposing approaches: salience-first and salience-last accounts. The paper then argues that these differ not only in computational efficiency, as has been pointed out previously, but also lead to incompatible empirical predictions. The second half of the paper shows how a salience-first account nicely meshes with a range of existing empirical findings on multimodal reference. A novel account of the circumstances under which speakers choose to point is proposed that directly links salience with pointing. Finally, this account is placed within a multi-dimensional model of salience for multimodal reference.