Automatically maintaining navigation sequences for querying semi-structured web sources
Data & Knowledge Engineering
Maintaining web navigation flows for wrappers
DEECS'06 Proceedings of the Second international conference on Data Engineering Issues in E-Commerce and Services
Hi-index | 0.00 |
In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of todayýs Web sources are "human-readable" but not "machine-readable", these systems must address a number of difficult challenges, such as dealing with complex navigation sequences, extracting data from HTML pages and reacting to source changes. Denodo Corporation has developed ITPilot, an industrial-strength solution that allows complex "wrappers" for Web sources to be graphically generated and automatically maintained. This paper presents the architecture and the basic ideas "behind the scenes" in ITPilot.