Learning Rules for Conceptual Structure on the Web

Authors:
Hyoil Han;Ramez Elmasri
Affiliations:
Colledge of Information Science and Technology, Drexel Univeristy&semi/ Department of Computer Science and Engineering, The University of Texas at Arlington. hhan@cis.drexel.edu;Colledge of Information Science and Technology, Drexel Univeristy&semi/ Department of Computer Science and Engineering, The University of Texas at Arlington. elmasri@cse.uta.edu
Venue:
Journal of Intelligent Information Systems
Year:
2004

Citing 18
Cited 4

Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Extensible markup language

World Wide Web Journal - Special issue on XML: principles, tools, and techniques
Database techniques for the World-Wide Web: a survey

ACM SIGMOD Record
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Snowball: extracting relations from large plain-text collections

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Learning to construct knowledge bases from the World Wide Web

Artificial Intelligence - Special issue on Intelligent internet systems
Web mining research: a survey

ACM SIGKDD Explorations Newsletter
Machine Learning

Machine Learning
Foundations of Inductive Logic Programming

Foundations of Inductive Logic Programming
Feature Selection for Knowledge Discovery and Data Mining

Feature Selection for Knowledge Discovery and Data Mining
Fundamentals of Database Systems

Fundamentals of Database Systems
Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Machine Learning
Extracting Patterns and Relations from the World Wide Web

WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
A Conceptual-Modeling Approach to Extracting Data from the Web

ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Recognizing Ontology-Applicable Multiple-Record Web Documents

ER '01 Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling
Relational learning techniques for natural language information extraction

Relational learning techniques for natural language information extraction
Machine learning for information extraction in informal domains

Machine learning for information extraction in informal domains
Conceptual modeling and ontology extraction for web information

Conceptual modeling and ontology extraction for web information

Clustering techniques utilized in web usage mining

AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Managing knowledge on the Web - Extracting ontology from HTML Web

Decision Support Systems
The bootstrapping based recognition of conceptual relationship for text retrieval

NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
A semantic role labelling-based framework for learning ontologies from Spanish documents

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an infrastructure and methodology to extract conceptual structure from Web pages, which are mainly constructed by HTML tags and incomplete text. Human beings can easily read Web pages and grasp an idea about the conceptual structure of underlying data, but cannot handle excessive amounts of data due to lack of patience and time. However, it is extremely difficult for machines to accurately determine the content of Web pages due to lack of understanding of context and semantics. Our work provides a methodology and infrastructure to process Web data and extract the underlying conceptual structure, in particular relationships between ontological concepts using Inductive Logic Programming in order to help with automating the processing of the excessive amount of Web data by capturing its conceptual structures.