Scalable semantic annotation of text using lexical and web resources

  • Authors:
  • Elias Zavitsanos;George Tsatsaronis;Iraklis Varlamis;Georgios Paliouras

  • Affiliations:
  • Institute of Informatics & Telecommunications, NCSR “Demokritos”;Department of Computer and Information Science, Norwegian University of Science and Technology;Department of Informatics and Telematics, Harokopio University;Institute of Informatics & Telecommunications, NCSR “Demokritos”

  • Venue:
  • SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we are dealing with the task of adding domain-specific semantic tags to a document, based solely on the domain ontology and generic lexical and Web resources In this manner, we avoid the need for trained domain-specific lexical resources, which hinder the scalability of semantic annotation More specifically, the proposed method maps the content of the document to concepts of the ontology, using the WordNet lexicon and Wikipedia The method comprises a novel combination of measures of semantic relatedness and word sense disambiguation techniques to identify the most related ontology concepts for the document We test the method on two case studies: (a) a set of summaries, accompanying environmental news videos, (b) a set of medical abstracts The results in both cases show that the proposed method achieves reasonable performance, thus pointing to a promising path for scalable semantic annotation of documents.