Constructing domain ontology using structural and semantic characteristics of web-table head

  • Authors:
  • Sung-won Jung;Mi-young Kang;Hyuk-chul Kwon

  • Affiliations:
  • Pusan National University, Korean Language Processing Laboratory, Department of Computer Science Engineering and Pusan National University, Center for U-Port IT Research and Education, Busan, Kore ...;Pusan National University, Korean Language Processing Laboratory, Department of Computer Science Engineering;Pusan National University, Korean Language Processing Laboratory, Department of Computer Science Engineering

  • Venue:
  • IEA/AIE'07 Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This study concerns the constructing of domain ontology from web tables in a specific domain. Ontology defines the common terms and their meaning (concepts) within a context. Thus only meaningful tables are our concern. The meaningful table is composed of a head and a body, which are formatted in rows and columns. The head abstracts the meaning expressed in the body. Thus, in order to obtain a table-information-extraction framework, this study extracts, as prerequisite work, the structural semantic, that is, the domain ontology that frames web-table information, from the head. We suggest a method for automatically extracting domain ontology using the structural and semantic characteristics of the web-table head. The construction of domain ontology proceeds through two steps: (a) extracting table schema as pseudoontology from each table from the same domain and (b) constructing domain ontology combining those extracted table schemata. The combination of schemata proceeds through splitting and clustering using (a) statistical information and (b) heuristics based on the structural and semantic characteristics of the web-table head.