Classification based automatic information extraction system from free text

  • Authors:
  • Myat Myo Nwe Wai

  • Affiliations:
  • University of Computer Studies, Banmaw, Myanmar

  • Venue:
  • ACELAE'11 Proceedings of the 10th WSEAS international conference on communications, electrical & computer engineering, and 9th WSEAS international conference on Applied electromagnetics, wireless and optical communications
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The ever increasing on-line text information can be made available to automatic processing by information system. Several machine learning techniques have been applied in order to build automatic information that rivaling knowledge engineering approach. The past reporting information extraction system using machine learning techniques that expend the separate classifiers for each category of entities. In this paper, we introduce the new classification based information extraction system utilizing only one Random Forest classifier for all candidate entities to save the computational costs of algorithms. Our approach extends the original idea of Random Forest to deal with the data sparseness problem in information extraction engine. Experimental results of this system indicate that the proposed method can be a practical solution for building extraction system reaching an F-measure as high as 87.5%.