ACADI showcase - automatic character indexing in audiovisual document

Authors:
Frédéric Gianni;Julien Pinquier;Ewa Kijak Irisa
Affiliations:
IRIT - Université Paul Sabatier, Toulouse Cedex, France;IRIT - Université Paul Sabatier, Toulouse Cedex, France;Campus de Beaulieu, Rennes Cedex, France
Venue:
Proceedings of the 6th ACM international conference on Image and video retrieval
Year:
2007

Citing 4
Cited 0

Detecting Faces in Images: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
Adaptive speaker identification with audiovisual cues for movie content analysis

Pattern Recognition Letters - Video computing
Multimodal Video Indexing: A Review of the State-of-the-art

Multimedia Tools and Applications
Segregation of speakers for speech recognition and speaker identification

ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a system which permits to describe and structure audiovisual documents without training, nor corpus knowledge, and to visualize with an interface the principal interventions. It posts the most significant person list of the processed documents (news, TV games, variety programs, film, etc.). A person will be considered as significant if she/he speaks or appears on the screen during a minimum time lapse. This list is then presented with representative labels of the character (face or/and sound extract for example). Thanks to this person list, it is possible to listen and/or to view all interventions of each character by clicking on the representation of the selected one. The system, developped in the framework of the Network of Excellence (NoE) MUSCLE, is based on a face detection tool and speaker and costume segmentation tools. The interface allows to visualize (and/or to listen) the only segments where the character of interest appears, without a priori knowledge. We also have the statistics over the speaking time and appearance time of each character.