Ontology Construction Based on Latent Topic Extraction in a Digital Library

  • Authors:
  • Jian-Hua Yeh;Naomi Yang

  • Affiliations:
  • Department of Computer Science and Information Engineering, Aletheia University,;Graduate Institute of Library and Information Studies, National Taiwan Normal University,

  • Venue:
  • ICADL 08 Proceedings of the 11th International Conference on Asian Digital Libraries: Universal and Ubiquitous Access to Information
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper discusses the automatic ontology construction process in a digital library. Traditional automatic ontology construction uses hierarchical clustering to group similar terms, and the result hierarchy is usually not satisfactory for human's recognition. Human-provided knowledge network presents strong semantic features, but this generation process is both labor-intensive and inconsistent under large scale scenario. The method proposed in this paper combines the statistical correction and latent topic extraction of textual data in a digital library, which produces a semantic-oriented and OWL-based ontology. The experimental document collection used here is the Chinese Recorder, which served as a link between the various missions that were part of the rise and heyday of the Western effort to Christianize the Far East. The ontology construction process is described and a final ontology in OWL format is shown in our result.