Can we use linked data semantic annotators for the extraction of domain-relevant expressions?

  • Authors:
  • Michel Gagnon;Amal Zouaq;Ludovic Jean-Louis

  • Affiliations:
  • Ecole Polytechnique de Montréal, Montréal, PQ, Canada;Royal Military College of Canada, Kingston, ON, Canada;Ecole Polytechnique de Montréal, Montréal, PQ, Canada

  • Venue:
  • Proceedings of the 22nd international conference on World Wide Web companion
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Semantic annotation is the process of identifying expressions in texts and linking them to some semantic structure. In particular, Linked data-based Semantic Annotators are now becoming the new Holy Grail for meaning extraction from unstructured documents. This paper presents an evaluation of the main linked data-based annotators available with a focus on domain topics and named entities. In particular, we compare the ability of each tool to annotate relevant domain expressions in text. The paper also proposes a combination of annotators through voting methods and machine learning. Our results show that some linked-data annotators, especially Alchemy, can be considered as a useful resource for topic extraction. They also show that a substantial increase in recall can be achieved by combining the annotators with a weighted voting scheme. Finally, an interesting result is that by removing Alchemy from the combination, or by combining only the more precise annotators, we get a significant increase in precision, at the cost of a lower recall.