Building automatically a business registration ontology

  • Authors:
  • Melania Degeratu;Vasileios Hatzivassiloglou

  • Affiliations:
  • Columbia University, New York, NY;Columbia University, New York, NY

  • Venue:
  • dg.o '02 Proceedings of the 2002 annual national conference on Digital government research
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We discuss a domain-independent, corpus based method for dictionary-less automatic extraction of ontological knowledge from domain-specific unannotated documents. We present the architecture, algorithms, and results for ONTOSTRUCT---a new system that uses machine learning and statistical techniques to analyze text sources, discover terms, link equivalent terms into concepts, learn both hierarchical and non-hierarchical conceptual relations, and build an extensive, semantically sound hierarchy of concepts. We report on ONTOSTRUCT's results in constructing a domain-specific ontology for the business registration domain, and evaluate the performance of two of its modules.