Lessons for the future from a decade of informedia video analysis research

Authors:
Alexander G. Hauptmann
Affiliations:
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
Venue:
CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Year:
2005

Citing 14
Cited 23

Evolving video skims into useful multimedia abstractions

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Topic labeling of broadcast news stories in the informedia digital video library

Proceedings of the third ACM conference on Digital libraries
Intelligent image databases: towards advanced image retrieval

Intelligent image databases: towards advanced image retrieval
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Lessons Learned from Building a Terabyte Digital Video Library

Computer
Name-It: Association of Face and Name in Video

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Indexing and Search of Multimodal Information

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
A statistical approach to 3d object detection applied to faces and cars

A statistical approach to 3d object detection applied to faces and cars
Learning query-class dependent weights in automatic video retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Story boundary detection in large broadcast news video archives: techniques, experience and trends

Proceedings of the 12th annual ACM international conference on Multimedia
On the detection of semantic concepts at TRECVID

Proceedings of the 12th annual ACM international conference on Multimedia
Finding the right shots: assessing usability and performance of a digital video library interface

Proceedings of the 12th annual ACM international conference on Multimedia
Addressing the challenge of visual information access from digital image and video libraries

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Artificial intelligence techniques in the interface to a Digital Video Library

CHI EA '97 CHI '97 Extended Abstracts on Human Factors in Computing Systems

Multimedia information retrieval: what is it, and why isn't anyone using it?

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Learning concepts from large scale imbalanced data sets using support cluster machines

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Techniques used and open challenges to the analysis, indexing and retrieval of digital video

Information Systems
Probabilistic model supported rank aggregation for the semantic concept detection in video

Proceedings of the 6th ACM international conference on Image and video retrieval
Optimizing multi-graph learning: towards a unified video annotation scheme

Proceedings of the 15th international conference on Multimedia
The evolution of visual information retrieval

Journal of Information Science
Semantic representation of multimedia content: Knowledge representation and semantic indexing

Multimedia Tools and Applications
Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration

Image Communication
Semi-supervised kernel density estimation for video annotation

Computer Vision and Image Understanding
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
Beyond distance measurement: constructing neighborhood similarity for video annotation

IEEE Transactions on Multimedia - Special section on communities and media computing
Mining large-scale news video database via knowledge visualization

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Generating advertising keywords from video content

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Ifelt: accessing movies through our emotions

Proceddings of the 9th international interactive conference on Interactive television
Mining novice user activity with TRECVID interactive retrieval tasks

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Hierarchical hidden markov model for rushes structuring and indexing

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
MovieClouds: content-based overviews and exploratory browsing of movies

Proceedings of the 15th International Academic MindTrek Conference: Envisioning Future Media Environments
An affect-based video retrieval system with open vocabulary querying

AMR'10 Proceedings of the 8th international conference on Adaptive Multimedia Retrieval: context, exploration, and fusion
Going Through the Clouds: Search Overviews and Browsing of Movies

Proceeding of the 16th International Academic MindTrek Conference
SoundsLike: movies soundtrack browsing and labeling based on relevance feedback and gamification

Proceedings of the 11th european conference on Interactive TV and video
Content-based search overviews and exploratory browsing of movies with MovieClouds

International Journal of Advanced Media and Communication
We are not equally negative: fine-grained labeling for multimedia event detection

Proceedings of the 21st ACM international conference on Multimedia
Exploring movies through interactive visualizations

BCS-HCI '13 Proceedings of the 27th International BCS Human Computer Interaction Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

The overarching goal of the Informedia Digital Video Library project has been to achieve machine understanding of video media, including all aspects of search, retrieval, visualization and summarization in both contemporaneous and archival content collections. The base technology developed by the Informedia project combines speech, image and natural language understanding to automatically transcribe, segment and index broadcast video for intelligent search and image retrieval. While speech processing has been the most influential component in the success of the Informedia project, other modalities can be critical in various situations. Evaluations done in the context of the TRECVID benchmarks show that while some progress has been made, there is still a lot of work ahead. The fundamental “semantic gap” still exists, but there are a number of promising approaches to bridging it.