Conceptual clustering of documents for automatic ontology generation

  • Authors:
  • Reshmy Krishnan;Amir Hussain;Sherimon P.C.

  • Affiliations:
  • Department of Computing, Muscat College, Muscat, Sultanate of Oman;Department of Computing Science and Mathematics, University of Stirling, Scotland, UK;Faculty of Computer Studies, Arab Open University, Muscat, Sultanate of Oman

  • Venue:
  • BICS'13 Proceedings of the 6th international conference on Advances in Brain Inspired Cognitive Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In Information retrieval, Keyword based retrieval is unsatisfactory for user needs since it can't always retrieve relevant words according to the concept. Since different words can represent the same concept (polysemy) and one word can represent different concepts (homonymy), mapping problem will lead to word sense Disambiguation. Through the implementation of domain dependent ontology, concept based information retrieval (IR) can be achieved. Since Semantic concept extraction from keywords is the initial phase for automatic construction of ontology process, this paper propose an effective method for it. Reuters21578 is used as the input of this process, followed by indexing, training and clustering using self-Organizing Map. Based on the feature vector, the clustering of documents are formed using automatic concept selections, in order to make the hierarchy. Clusters are represented hierarchically based on the topics assigned .Ontology will be generated automatically for each cluster, based on the topic assigned.