Knowledge-Based Linguistic Annotation of Digital Cultural Heritage Collections

  • Authors:
  • Tuukka Ruotsalo;Lora Aroyo;Guus Schreiber

  • Affiliations:
  • Helsinki University of Technology;Vrije Universiteit Amsterdam;Vrije Universiteit Amsterdam

  • Venue:
  • IEEE Intelligent Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The authors present a method for automatic annotation of objects in digital cultural heritage collections. Given a set of objects each accompanied by a text description, a set of structured vocabularies, a metadata schema, and a training set of annotations of the text descriptions, the method produces annotations for the objects. These annotations consist of structured vocabulary concepts or named entities (for example, Paris as a city) and metadata schema roles that each concept plays in an annotation (for example, Paris as a subject matter). The method focuses on identifying the metadata schema roles. The authors have evaluated the method using the ARIA collection from Rijksmuseum Amsterdam. The evaluation used four structured vocabularies, an artwork annotation schema, and a collection of natural language descriptions of artworks. The method achieved 61.2 percent accuracy in role identification, outperforming the baseline method without background knowledge (p