CONTENTUS--technologies for next generation multimedia libraries

Authors:
Jan Nandzik;Berenike Litz;Nicolas Flores-Herr;Aenne Löhden;Iuliu Konya;Doris Baum;André Bergholz;Dirk Schönfuβ;Christian Fey;Johannes Osterhoff;Jörg Waitelonis;Harald Sack;Ralf Köhler;Patrick Ndjiki-Nya
Affiliations:
Acosta Consult GmbH, Frankfurt am Main, Germany 60318;Deutsche Nationalbibliothek, Informationstechnik, Frankfurt am Main, Germany 60322;Acosta Consult GmbH, Frankfurt am Main, Germany 60318;Deutsche Nationalbibliothek, Informationstechnik, Frankfurt am Main, Germany 60322;Fraunhofer IAIS, Sankt Augustin, Germany 53754;Fraunhofer IAIS, Sankt Augustin, Germany 53754;Fraunhofer IAIS, Sankt Augustin, Germany 53754;mufin GmbH, Büro Dresden, Dresden, Germany 01219;Institut für Rundfunktechnik GmbH, Production Systems TV, München, Germany 80939;Hasso-Plattner-Institut für Softwaresystemtechnik GmbH, Potsdam, Germany 14482;Hasso-Plattner-Institut für Softwaresystemtechnik GmbH, Potsdam, Germany 14482;Hasso-Plattner-Institut für Softwaresystemtechnik GmbH, Potsdam, Germany 14482;Technicolor - Corporate Research Division, Hanover Image Processing Lab, Deutsche Thomson OHG, Hannover, Germany 30625;Fraunhofer-Institut für Nachrichtentechnik Heinrich-Hertz-Institut, Berlin, Germany 10587
Venue:
Multimedia Tools and Applications
Year:
2013

Citing 34
Cited 0

Using collaborative filtering to weave an information tapestry

Communications of the ACM - Special issue on information filtering
Document Representation and Its Application to Page Decomposition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Two Geometric Algorithms for Layout Analysis

DAS '02 Proceedings of the 5th International Workshop on Document Analysis Systems V
Semantic search

WWW '03 Proceedings of the 12th international conference on World Wide Web
Introduction to MPEG-7: Multimedia Content Description Interface

Introduction to MPEG-7: Multimedia Content Description Interface
Form Frame Line Detection with Directional Single-Connected Chain

ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Evaluation campaigns and TRECVid

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Semantic Multimedia: First International Conference on Semantic and Digital Media Technologies, SAMT 2006Athens, Greece, December 6-8, 2006Proceedings (Lecture Notes in Computer Science)

Semantic Multimedia: First International Conference on Semantic and Digital Media Technologies, SAMT 2006Athens, Greece, December 6-8, 2006Proceedings (Lecture Notes in Computer Science)
A review of text and image retrieval approaches for broadcast news video

Information Retrieval
Enabling MPEG-7 structural and semantic descriptions in retrieval applications

Journal of the American Society for Information Science and Technology
Semantic Multimedia and Ontologies: Theory and Applications

Semantic Multimedia and Ontologies: Theory and Applications
The MultiMatch Prototype: Multilingual/Multimedia Search for Cultural Heritage Objects

ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
Flickr distance

MM '08 Proceedings of the 16th ACM international conference on Multimedia
The MIR flickr retrieval evaluation

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
From MPEG-7 user interaction tools to hanging basket models: bridging the gap

Multimedia Tools and Applications
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Design challenges and misconceptions in named entity recognition

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Constant-Time Locally Optimal Adaptive Binarization

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
ICDAR 2009 Page Segmentation Competition

ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
A no-reference objective image sharpness metric based on the notion of just noticeable blur (JNB)

IEEE Transactions on Image Processing
A survey of collaborative filtering techniques

Advances in Artificial Intelligence
A system that learns to tag videos by watching youtube

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
How to Build a Digital Library, Second Edition

How to Build a Digital Library, Second Edition
Restoration of digitized video sequences: an efficient drop-out detection and removal framework

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Scratch detection supported by coherency analysis of motion vector fields

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Temporal video structuring for preservation and annotation of video content

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
BIC-based speaker segmentation using divide-and-conquer strategies with application to speaker diarization

IEEE Transactions on Audio, Speech, and Language Processing
Visual-Concept Search Solved?

Computer
Shiatsu: semantic-based hierarchical automatic tagging of videos by segmentation using cuts

Proceedings of the 3rd international workshop on Automated information extraction in media production
Exploratory Semantic Video Search with yovisto

ICSC '10 Proceedings of the 2010 IEEE Fourth International Conference on Semantic Computing
A survey of semantic image and video annotation tools

Knowledge-driven multimedia information extraction and ontology evolution
Automatic table detection in document images

ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Semantic Image and Video Indexing in Broad Domains

IEEE Transactions on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

An ever-growing amount of digitized content urges libraries and archives to integrate new media types from a large number of origins such as publishers, record labels and film archives, into their existing collections. This is a challenging task, since the multimedia content itself as well as the associated metadata is inherently heterogeneous--the different sources lead to different data structures, data quality and trustworthiness. This paper presents the contentus approach towards an automated media processing chain for cultural heritage organizations and content holders. Our workflow allows for unattended processing from media ingest to availability thorough our search and retrieval interface. We aim to provide a set of tools for the processing of digitized print media, audio/visual, speech and musical recordings. Media specific functionalities include quality control for digitization of still image and audio/visual media and restoration of the most common quality issues encountered with these media. Furthermore, the contentus tools include modules for content analysis like segmentation of printed, audio and audio/visual media, optical character recognition (OCR), speech-to-text transcription, speaker recognition and the extraction of musical features from audio recordings, all aimed at a textual representation of information inherent within the media assets. Once the information is extracted and transcribed in textual form, media independent processing modules offer extraction and disambiguation of named entities and text classification. All contentus modules are designed to be flexibly recombined within a scalable workflow environment using cloud computing techniques. In the next step analyzed media assets can be retrieved and consumed through a search interface using all available metadata. The search engine combines Semantic Web technologies for representing relations between the media and entities such as persons, locations and organizations with a full-text approach for searching within transcribed information gathered through the preceding processing steps. The contentus unified search interface integrates text, images, audio and audio/visual content. Queries can be narrowed and expanded in an exploratory manner, search results can be refined by disambiguating entities and topics. Further, semantic relationships become not only apparent, but can also be navigated.