Detecting Faces in Images: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence
Adaptive speaker identification with audiovisual cues for movie content analysis
Pattern Recognition Letters - Video computing
Multimodal Video Indexing: A Review of the State-of-the-art
Multimedia Tools and Applications
Segregation of speakers for speech recognition and speaker identification
ICASSP '91 Proceedings of the Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference
Hi-index | 0.00 |
We propose a system which permits to describe and structure audiovisual documents without training, nor corpus knowledge, and to visualize with an interface the principal interventions. It posts the most significant person list of the processed documents (news, TV games, variety programs, film, etc.). A person will be considered as significant if she/he speaks or appears on the screen during a minimum time lapse. This list is then presented with representative labels of the character (face or/and sound extract for example). Thanks to this person list, it is possible to listen and/or to view all interventions of each character by clicking on the representation of the selected one. The system, developped in the framework of the Network of Excellence (NoE) MUSCLE, is based on a face detection tool and speaker and costume segmentation tools. The interface allows to visualize (and/or to listen) the only segments where the character of interest appears, without a priori knowledge. We also have the statistics over the speaking time and appearance time of each character.