Wrapper generation for semi-structured Internet sources
ACM SIGMOD Record
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Generating finite-state transducers for semi-structured data extraction from the Web
Information Systems - Special issue on semistructured data
Journal of Computer and System Sciences
Conceptual-model-based data extraction from multiple-record Web pages
Data & Knowledge Engineering
Building intelligent web applications using lightweight wrappers
Data & Knowledge Engineering - Special issue on heterogeneous information resources need semantic access
Monadic datalog and the expressive power of languages for web information extraction
Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Object Database Standard: ODMG-93, Release 1.2
Object Database Standard: ODMG-93, Release 1.2
A brief survey of web data extraction tools
ACM SIGMOD Record
A visual tool for building logical data models of websites
Proceedings of the 4th international workshop on Web information and data management
DEByE - Date extraction by example
Data & Knowledge Engineering
Hierarchical Wrapper Induction for Semistructured Information Sources
Autonomous Agents and Multi-Agent Systems
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Visual Web Information Extraction with Lixto
Proceedings of the 27th International Conference on Very Large Data Bases
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
Proceedings of the 27th International Conference on Very Large Data Bases
An Information Concierge for the Web
DEXA '01 Proceedings of the 12th International Workshop on Database and Expert Systems Applications
Jedi: Extracting and Synthesizing Information from the Web
COOPIS '98 Proceedings of the 3rd IFCIS International Conference on Cooperative Information Systems
Wiccap Data Model: Mapping Physical Websites to Logical Views
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Personalized Web Views for Multilingual Web Sources
IEEE Internet Computing
Clustering web pages based on their structure
Data & Knowledge Engineering - Special issue: WIDM 2003
A web content manipulation technique based on page Fragmentation
Journal of Network and Computer Applications
Hi-index | 0.00 |
Information presented in a Website is usually organized into certain logical structure that is intuitive to users. It would be useful to model websites with such logical structure so that extraction of Web data from these sites can be performed in a simple and efficient manner. However, the recognition and reconstruction of such logical structure by software agent is not straightforward due to the complex hyper-link structure among webpages and the HTML formatting within each webpage. In this paper, we propose the WICCAP Data Model, a data model that maps websites from their physical structure into commonly perceived logical views. To enable easy and rapid creation of such data models, we have implemented a visual tool, called the Mapping Wizard, to facilitate and automate the process of producing WICCAP Data Models. Using the tool, the time required to construct a logical representation for a given Website is significantly reduced.