LSA-based automatic acquisition of semantic image descriptions

  • Authors:
  • Roberto Basili;Riccardo Petitti;Dario Saracino

  • Affiliations:
  • University of Rome "Tor Vergata", Department of Computer Science, Systems and Production, Roma, Italy;Exprivia S.p.A, Roma, Italy;Exprivia S.p.A, Roma, Italy

  • Venue:
  • SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web multimedia documents are characterized by visual and linguistic information expressed by structured pages of images and texts. The suitable combinations able to generalize semantic aspects of the over-all multimedia information clearly depend on applications. In this paper, an unsupervised image classification technique combining features from different media levels is proposed. In particular linguistic descriptions derived through Information Extraction from Web pages are here integrated with visual features by means of Latent Semantic Analysis. Although the higher expressivity increases the complexity of the learning process, the dimensionality reduction implied by LSA makes it largely applicable. The evaluation over an image classification task confirms that the proposed model outperforms other methods acting on the individual levels. The resulting method is cost-effective and can be easily applied to semi-automatic image semantic labeling tasks as foreseen in collaborative annotation scenarios.