A hierarchical approach to wrapper induction
Proceedings of the third annual conference on Autonomous Agents
PATRICIA—Practical Algorithm To Retrieve Information Coded in Alphanumeric
Journal of the ACM (JACM)
IEPAD: information extraction based on pattern discovery
Proceedings of the 10th international conference on World Wide Web
IEEE Transactions on Knowledge and Data Engineering
Engineering agent-mediated integration of bioinformatics analysis tools
Multiagent and Grid Systems - Multi-agent systems for medicine, computational biology, and bioinformatics
Adaptable wrapper generation for web page format change
ACOS'06 Proceedings of the 5th WSEAS international conference on Applied computer science
Configurable meta-search for integrating web public access catalogs
ICADL'05 Proceedings of the 8th international conference on Asian Digital Libraries: implementing strategies and sharing experiences
Hi-index | 0.00 |
The DeepSpot Agent Toolbox exploits online Web data sources using reconfigurable Web wrapper agents. These agents are rapidly generated and executed on the basis of the XML-based Web Navigation Description Language and extraction rule generator IEPAD (information extraction based on pattern discovery). A WNDL script describes how to locate, extract, and combine data. By executing different WNDL scripts, users can automate all types of Web browsing sessions. They also describe IEPAD, a data extractor based on pattern discovery techniques. IEPAD lets software agents automatically discover the extraction rules to extract the contents of a structurally formatted Web page without the need to label a Web page to train a wrapper. With this programming-by-example authoring tool, users can generate a complete Web wrapper agent by browsing the target Web sites. Various applications demonstrate this approach's feasibility.