Ontology creation: extraction of domain knowledge from web documents

Authors:
Veda C. Storey;Roger Chiang;G. Lily Chen
Affiliations:
Department of Computer Information Systems, J. Mack Robinson College of Business, Georgia State University, Atlanta, GA;Information Systems Department, College of Business, University of Cincinnati, Cincinnati, Ohio;Department of Computer Information Systems, J. Mack Robinson College of Business, Georgia State University, Atlanta, GA
Venue:
ER'05 Proceedings of the 24th international conference on Conceptual Modeling
Year:
2005

Citing 13
Cited 1

A translation approach to portable ontology specifications

Knowledge Acquisition - Special issue: Current issues in knowledge modeling
A linguistic ontology

International Journal of Human-Computer Studies - Special issue: the role of formal ontology in the information technology
The World-Wide Web: quagmire or gold mine?

Communications of the ACM
Learning Information Extraction Rules for Semi-Structured and Free Text

Machine Learning - Special issue on natural language learning
Relational learning of pattern-match rules for information extraction

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Conceptual-model-based data extraction from multiple-record Web pages

Data & Knowledge Engineering
Machine Learning for Information Extraction in Informal Domains

Machine Learning - Special issue on information retrieval
Web mining research: a survey

ACM SIGKDD Explorations Newsletter
A smart web query method for semantic retrieval of web data

Data & Knowledge Engineering
A brief survey of web data extraction tools

ACM SIGMOD Record
Dealing with Semantic Heterogeneity During Data Integration

ER '99 Proceedings of the 18th International Conference on Conceptual Modeling
Toward semantic understanding: an approach based on information extraction ontologies

ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Comparing Relationships in Conceptual Modeling: Mapping to Semantic Classifications

IEEE Transactions on Knowledge and Data Engineering

The role of domain ontologies in database design: An ontology management and conceptual modeling environment

ACM Transactions on Database Systems (TODS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Considerable research has gone into developing ontologies and applying them to a variety of applications. The extraction of domain knowledge for developing these ontologies is often performed on a manual basis. The World Wide Web contains a wealth of knowledge about an application domain; however it is embedded within web pages. This research presents a methodology for semi-automatically extracting knowledge from the World Wide Web and organizing it into domain ontologies. Initial semantics of a target domain are provided by a set of keywords. From these, web pages are identified that contain relevant information for the subject domain using search engines. Web data extraction techniques are employed to extract information from these web pages and infer how the information is related. Extracted knowledge is then organized into a domain ontology. Testing of the methodology on various application domains illustrates the feasibility of the approach.