Cerno: Light-weight tool support for semantic annotation of textual documents

  • Authors:
  • Nadzeya Kiyavitskaya;Nicola Zeni;James R. Cordy;Luisa Mich;John Mylopoulos

  • Affiliations:
  • Department of Information Engineering and Computer Science, University of Trento, Italy;Department of Computer and Management Sciences, University of Trento, Italy;School of Computing, Queens University, Kingston, ON, Canada;Department of Computer and Management Sciences, University of Trento, Italy;Department of Information Engineering and Computer Science, University of Trento, Italy and Department of Computer Science, University of Toronto, ON, Canada

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

Enrichment of text documents with semantic metadata reflecting their meaning facilitates document organization, indexing and retrieval. However, most web data remain unstructured because of the difficulty and the cost of manually annotating text. In this work, we present Cerno, a framework for semi-automatic semantic annotation of textual documents according to a domain-specific semantic model. The proposed framework is founded on light-weight techniques and tools intended for legacy code analysis and markup. To illustrate the feasibility of our proposal, we report experimental results of its application to two different domains. These results suggest that light-weight semi-automatic techniques for semantic annotation are feasible, require limited human effort for adaptation to a new domain, and demonstrate markup quality comparable with state-of-the-art methods.