Semi-automatic ontology extraction to create draft topic maps

  • Authors:
  • Steven Roberson;Darina Dicheva

  • Affiliations:
  • Winston-Salem State University, Winston-Salem, NC;Winston-Salem State University, Winston-Salem, NC

  • Venue:
  • ACM-SE 45 Proceedings of the 45th annual southeast regional conference
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topic maps are a Semantic Web technology that provides a human-oriented mechanism to encode knowledge by organizing web information around topics. Studies have shown, however, that authors face major difficulties in constructing topic maps. This paper discusses an approach to automatic construction of a "draft" topic map for the authors to start with. The idea is to extract topic map constructs by crawling a website and parsing its pages. We propose a set of heuristics that can be used for extracting semantic information from the HTML markup of the web pages. We have used this approach to design and implement a plug-in for the topic map editor TM4L that automatically extracts topics and relationships from a website specified by the author. An evaluation of the proposed approach in terms of Recall and Precision of the extracted data is presented.