Automatic domain-ontology structure and example acquisition from semi-structured texts

  • Authors:
  • Cheng Xiao;Dequan Zheng;Yuhang Yang

  • Affiliations:
  • MOE-MS Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, Harbin;MOE-MS Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, Harbin;MOE-MS Key Laboratory of Natural Language Processing and Speech, Harbin Institute of Technology, Harbin

  • Venue:
  • FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 7
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a new method to acquire Domain-Ontology structure and examples from semi-structured data sources. Firstly, extract Domain-Ontology structure, including candidate attributes extraction using certain patterns and applying a statistic method to filter out the incorrect attributes. Secondly, using Domain-Ontology structure as a clue, automatically generate example extraction patterns. Finally, acquire Ontology examples taking advantage of the special structure feature of the Web pages. Experiments are carried out in the field of film, the precision of the Ontology structure extraction is 83.7%, and the highest recall of the examples extraction reaches 90%. Experimental results demonstrate that the method developed in this paper is fairly efficient.