WikiOnto: A System for Semi-automatic Extraction and Modeling of Ontologies Using Wikipedia XML Corpus

Authors:
Lalindra Niranjan De Silva;Lakshman Jayaratne
Affiliations:
-;-
Venue:
ICSC '09 Proceedings of the 2009 IEEE International Conference on Semantic Computing
Year:
2009

Citing 0
Cited 3

Building ontological models from Arabic Wikipedia: a proposed hybrid approach

Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
Using SOA governance design methodologies to augment enterprise service descriptions

CAiSE'11 Proceedings of the 23rd international conference on Advanced information systems engineering
Construction of Domain Ontologies: Sourcing the World Wide Web

International Journal of Intelligent Information Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces WikiOnto: a system that assists in the extraction and modeling of topic ontologies in a semi-automatic manner using a preprocessed document corpus of one of the largest knowledge bases in the world - the Wikipedia. Based on the Wikipedia XML Corpus, we present a three-tiered framework for extracting topic ontologies in quick time and a modeling environment to refine these ontologies. Using Natural Language Processing (NLP) and other Machine Learning (ML) techniques along with a very rich document corpus, this system proposes a solution to a task that is generally considered extremely cumbersome. The initial results of the prototype suggest strong potential of the system to become highly successful in ontology extraction and modeling and also inspire further research on extracting ontologies from other semi-structured document corpora as well.