Boosting retrieval of digital spoken content

Authors:
Bernardo Pereira Nunes;Alexander Mera;Marco A. Casanova;Ricardo Kawase
Affiliations:
Department of Informatics, PUC-Rio, Rio de Janeiro, RJ, Brazil,L3S Research Center, Leibniz University Hannover, Germany;Department of Informatics, PUC-Rio, Rio de Janeiro, RJ, Brazil;Department of Informatics, PUC-Rio, Rio de Janeiro, RJ, Brazil;L3S Research Center, Leibniz University Hannover, Germany
Venue:
KES'12 Proceedings of the 16th international conference on Knowledge Engineering, Machine Learning and Lattice Computing with Applications
Year:
2012

Citing 8
Cited 0

Google's Deep Web crawl

Proceedings of the VLDB Endowment
An audio indexing system for election video material

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Learning Deep Web Crawling with Diverse Features

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Automatic Extraction of Semantic Descriptions from the Lecturer's Speech

ICSC '09 Proceedings of the 2009 IEEE International Conference on Semantic Computing
Augmenting Europeana content with linked data resources

Proceedings of the 6th International Conference on Semantic Systems
Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
DBpedia spotlight: shedding light on the web of documents

Proceedings of the 7th International Conference on Semantic Systems
Automatic Video Classification: A Survey of the Literature

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

Every day, the Internet expands as millions of new multimedia objects are uploaded in the form of audio, video and images. While traditional text-based content is indexed by search engines, this indexing cannot be applied to audio and video objects, resulting in a plethora of multimedia content that is inaccessible to a majority of online users. To address this issue, we introduce a technique of automatic, semantically enhanced, description generation for multimedia content. The objective is to facilitate indexing and retrieval of the objects with the help of traditional search engines. Essentially, the technique generates static Web pages automatically, which describe the content of the digital audio and video objects. These descriptions are then organized in such a way as to facilitate locating corresponding audio and video segments. The technique employs a combination of Web services and concurrently provides description translation and semantic enhancement. Thorough analysis of the click-data, comparing accesses to the digital content before and after automatic description generation, suggests a significant increase in the number of retrieval items. This outcome, however is not limited to the terms of visibility, but in supporting multilingual access, additionally decreases the number of language barriers.