Do HTML Tags Flag Semantic Content?
IEEE Internet Computing
Hyperlink Analysis for the Web
IEEE Internet Computing
Mining the Web: Discovering Knowledge from HyperText Data
Mining the Web: Discovering Knowledge from HyperText Data
Information Retrieval on the World Wide Web
IEEE Internet Computing
Mining topic-specific concepts and definitions on the web
WWW '03 Proceedings of the 12th international conference on World Wide Web
Authoring Educational Topic Maps: Can We Make It Easier?
ICALT '05 Proceedings of the Fifth IEEE International Conference on Advanced Learning Technologies
CITOM: An incremental construction of multilingual topic maps
Data & Knowledge Engineering
Hi-index | 0.00 |
Topic maps are a Semantic Web technology that provides a human-oriented mechanism to encode knowledge by organizing web information around topics. Studies have shown, however, that authors face major difficulties in constructing topic maps. This paper discusses an approach to automatic construction of a "draft" topic map for the authors to start with. The idea is to extract topic map constructs by crawling a website and parsing its pages. We propose a set of heuristics that can be used for extracting semantic information from the HTML markup of the web pages. We have used this approach to design and implement a plug-in for the topic map editor TM4L that automatically extracts topics and relationships from a website specified by the author. An evaluation of the proposed approach in terms of Recall and Precision of the extracted data is presented.