A query language and optimization techniques for unstructured data
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A query language for a Web-site management system
ACM SIGMOD Record
Database techniques for the World-Wide Web: a survey
ACM SIGMOD Record
Modeling Web sources for information integration
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Hierarchical Wrapper Induction for Semistructured Information Sources
Autonomous Agents and Multi-Agent Systems
Object Exchange Across Heterogeneous Information Sources
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Integration of Semistructured Data with Partial and Inconsistent Information
IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
A Conceptual Model and Rule-Based Query Language for HTML
World Wide Web
A Rule-Based Conversion of a DTD to a Conceptual Schema
ER '01 Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling
Informatica
Hi-index | 0.00 |
Most documents available over the web conform to the HTML specification. Such documents are hierarchically structured in nature. The existing graph-based or tree-based data models for the web only provide a very low level representation of such hierarchical structure. In this paper, we introduce a conceptual model for the web that is able to represent the complex hierarchical structure within the web documents at a high level that is close to human conceptualization/visualization of the documents. We also describe how to convert HTML documents based on this conceptual model. Using the conceptual model and conversion method, we can capture the essence (i.e., semistructure) of HTML documents in a natural and simple way.