Learning Rules for Conceptual Structure on the Web

  • Authors:
  • Hyoil Han;Ramez Elmasri

  • Affiliations:
  • Colledge of Information Science and Technology, Drexel Univeristy&semi/ Department of Computer Science and Engineering, The University of Texas at Arlington. hhan@cis.drexel.edu;Colledge of Information Science and Technology, Drexel Univeristy&semi/ Department of Computer Science and Engineering, The University of Texas at Arlington. elmasri@cse.uta.edu

  • Venue:
  • Journal of Intelligent Information Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an infrastructure and methodology to extract conceptual structure from Web pages, which are mainly constructed by HTML tags and incomplete text. Human beings can easily read Web pages and grasp an idea about the conceptual structure of underlying data, but cannot handle excessive amounts of data due to lack of patience and time. However, it is extremely difficult for machines to accurately determine the content of Web pages due to lack of understanding of context and semantics. Our work provides a methodology and infrastructure to process Web data and extract the underlying conceptual structure, in particular relationships between ontological concepts using Inductive Logic Programming in order to help with automating the processing of the excessive amount of Web data by capturing its conceptual structures.