Using collaborative filtering to weave an information tapestry
Communications of the ACM - Special issue on information filtering
Document Representation and Its Application to Page Decomposition
IEEE Transactions on Pattern Analysis and Machine Intelligence
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Two Geometric Algorithms for Layout Analysis
DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
WWW '03 Proceedings of the 12th international conference on World Wide Web
Introduction to MPEG-7: Multimedia Content Description Interface
Introduction to MPEG-7: Multimedia Content Description Interface
Form Frame Line Detection with Directional Single-Connected Chain
ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Evaluation campaigns and TRECVid
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Semantic Multimedia: First International Conference on Semantic and Digital Media Technologies, SAMT 2006Athens, Greece, December 6-8, 2006Proceedings (Lecture Notes in Computer Science)
A review of text and image retrieval approaches for broadcast news video
Information Retrieval
Enabling MPEG-7 structural and semantic descriptions in retrieval applications
Journal of the American Society for Information Science and Technology
Semantic Multimedia and Ontologies: Theory and Applications
Semantic Multimedia and Ontologies: Theory and Applications
The MultiMatch Prototype: Multilingual/Multimedia Search for Cultural Heritage Objects
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
MM '08 Proceedings of the 16th ACM international conference on Multimedia
The MIR flickr retrieval evaluation
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
From MPEG-7 user interaction tools to hanging basket models: bridging the gap
Multimedia Tools and Applications
Foundations and Trends in Information Retrieval
Design challenges and misconceptions in named entity recognition
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Constant-Time Locally Optimal Adaptive Binarization
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
ICDAR 2009 Page Segmentation Competition
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A no-reference objective image sharpness metric based on the notion of just noticeable blur (JNB)
IEEE Transactions on Image Processing
A survey of collaborative filtering techniques
Advances in Artificial Intelligence
A system that learns to tag videos by watching youtube
ICVS'08 Proceedings of the 6th international conference on Computer vision systems
How to Build a Digital Library, Second Edition
How to Build a Digital Library, Second Edition
Restoration of digitized video sequences: an efficient drop-out detection and removal framework
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Scratch detection supported by coherency analysis of motion vector fields
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Temporal video structuring for preservation and annotation of video content
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
IEEE Transactions on Audio, Speech, and Language Processing
Computer
Shiatsu: semantic-based hierarchical automatic tagging of videos by segmentation using cuts
Proceedings of the 3rd international workshop on Automated information extraction in media production
Exploratory Semantic Video Search with yovisto
ICSC '10 Proceedings of the 2010 IEEE Fourth International Conference on Semantic Computing
A survey of semantic image and video annotation tools
Knowledge-driven multimedia information extraction and ontology evolution
Automatic table detection in document images
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Semantic Image and Video Indexing in Broad Domains
IEEE Transactions on Multimedia
Hi-index | 0.00 |
An ever-growing amount of digitized content urges libraries and archives to integrate new media types from a large number of origins such as publishers, record labels and film archives, into their existing collections. This is a challenging task, since the multimedia content itself as well as the associated metadata is inherently heterogeneous--the different sources lead to different data structures, data quality and trustworthiness. This paper presents the contentus approach towards an automated media processing chain for cultural heritage organizations and content holders. Our workflow allows for unattended processing from media ingest to availability thorough our search and retrieval interface. We aim to provide a set of tools for the processing of digitized print media, audio/visual, speech and musical recordings. Media specific functionalities include quality control for digitization of still image and audio/visual media and restoration of the most common quality issues encountered with these media. Furthermore, the contentus tools include modules for content analysis like segmentation of printed, audio and audio/visual media, optical character recognition (OCR), speech-to-text transcription, speaker recognition and the extraction of musical features from audio recordings, all aimed at a textual representation of information inherent within the media assets. Once the information is extracted and transcribed in textual form, media independent processing modules offer extraction and disambiguation of named entities and text classification. All contentus modules are designed to be flexibly recombined within a scalable workflow environment using cloud computing techniques. In the next step analyzed media assets can be retrieved and consumed through a search interface using all available metadata. The search engine combines Semantic Web technologies for representing relations between the media and entities such as persons, locations and organizations with a full-text approach for searching within transcribed information gathered through the preceding processing steps. The contentus unified search interface integrates text, images, audio and audio/visual content. Queries can be narrowed and expanded in an exploratory manner, search results can be refined by disambiguating entities and topics. Further, semantic relationships become not only apparent, but can also be navigated.