Computational linguistics for metadata building (CLiMB): using text mining for the automatic identification, categorization, and disambiguation of subject terms for image metadata

  • Authors:
  • Judith L. Klavans;Carolyn Sheffield;Eileen Abels;Jimmy Lin;Rebecca Passonneau;Tandeep Sidhu;Dagobert Soergel

  • Affiliations:
  • iSchool, University of Maryland, College Park, USA and Computational Linguistics and Information Processing Laboratory (CLIP), University of Maryland, College Park, USA and University of Maryland ...;iSchool, University of Maryland, College Park, USA;College of Information Science and Technology, Drexel University, Philadelphia, USA;iSchool, University of Maryland, College Park, USA and Computational Linguistics and Information Processing Laboratory (CLIP), University of Maryland, College Park, USA and University of Maryland ...;Center for Computational Learning Systems, Columbia University, New York, USA;iSchool, University of Maryland, College Park, USA;iSchool, University of Maryland, College Park, USA

  • Venue:
  • Multimedia Tools and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a system using computational linguistic techniques to extract metadata for image access. We discuss the implementation, functionality and evaluation of an image catalogers' toolkit, developed in the Computational Linguistics for Metadata Building (CLiMB) research project. We have tested components of the system, including phrase finding for the art and architecture domain, functional semantic labeling using machine learning, and disambiguation of terms in domain-specific text vis a vis a rich thesaurus of subject terms, geographic and artist names. We present specific results on disambiguation techniques and on the nature of the ambiguity problem given the thesaurus, resources, and domain-specific text resource, with a comparison of domain-general resources and text. Our primary user group for evaluation has been the cataloger expert with specific expertise in the fields of painting, sculpture, and vernacular and landscape architecture.