Constructing Web Corpora through Topical Web Partitioning for Term Recognition

  • Authors:
  • Wilson Wong;Wei Liu;Mohammed Bennamoun

  • Affiliations:
  • School of Computer Science and Software Engineering, University of Western Australia, Crawley, WA 6009;School of Computer Science and Software Engineering, University of Western Australia, Crawley, WA 6009;School of Computer Science and Software Engineering, University of Western Australia, Crawley, WA 6009

  • Venue:
  • AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The need for on-demand discovery of very large, incremental text corpora for unrestricted range of domains for term recognition in ontology learning is becoming more and more pressing. In this paper, we introduce a new 3-phase web partitioning approach for automatically constructing web corpora to support term recognition. An evaluation of the web corpora constructed using our web partitioning approach demonstrated high precision in the context of term recognition, a result comparable to the use of manually-created local corpora.