A scalable comparison-shopping agent for the World-Wide Web
AGENTS '97 Proceedings of the first international conference on Autonomous agents
Building intelligent web applications using lightweight wrappers
Data & Knowledge Engineering - Special issue on heterogeneous information resources need semantic access
Outlier finding: focusing user attention on possible errors
Proceedings of the 14th annual ACM symposium on User interface software and technology
Multiple selections in smart text editing
Proceedings of the 7th international conference on Intelligent user interfaces
A brief survey of web data extraction tools
ACM SIGMOD Record
World Wide Web
Visual Web Information Extraction with Lixto
Proceedings of the 27th International Conference on Very Large Data Bases
Representing Web Data as Complex Objects
EC-WEB '00 Proceedings of the First International Conference on Electronic Commerce and Web Technologies
An Example-Based Environment for Wrapper Generation
ER '00 Proceedings of the Workshops on Conceptual Modeling Approaches for E-Business and The World Wide Web and Conceptual Modeling: Conceptual Modeling for E-Business and the Web
XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Wrapper induction for information extraction
Wrapper induction for information extraction
Lightweight structure in text
Integrating a command shell into a web browser
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Lightweight structured text processing
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
Proceedings of the 17th annual ACM symposium on User interface software and technology
A specification language and service-oriented architecture to support distributed data management
Software—Practice & Experience
Interactive wrapper generation with minimal user effort
Proceedings of the 15th international conference on World Wide Web
Documentum ECI self-repairing wrappers: performance analysis
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
A Survey of Web Information Extraction Systems
IEEE Transactions on Knowledge and Data Engineering
Making mashups with marmite: towards end-user programming for the web
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Supporting end-users in the creation of dependable web clips
Proceedings of the 16th international conference on World Wide Web
Wrapper-based personalised mobile meta portal
International Journal of Autonomous and Adaptive Communications Systems
Scalable web data extraction for online market intelligence
Proceedings of the VLDB Endowment
Automated Ontology-Driven Metasearch Generation with Metamorph
WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
No Code Required: Giving Users Tools to Transform the Web
No Code Required: Giving Users Tools to Transform the Web
On the complexity of regular-grammars with integer attributes
Journal of Computer and System Sciences
The personal publication reader
ISWC'05 Proceedings of the 4th international conference on The Semantic Web
A logic-based tool for semantic information extraction
JELIA'06 Proceedings of the 10th European conference on Logics in Artificial Intelligence
Semantic web enabled information systems: personalized views on web data
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
Proceedings of the 2005 international conference on Federation over the Web
ESWC'05 Proceedings of the Second European conference on The Semantic Web: research and Applications
Information extraction for the semantic web
Proceedings of the First international conference on Reasoning Web
Automatic image description based on textual data
Journal on Data Semantics VII
The HiLeX system for semantic information extraction
Transactions on Large-Scale Data- and Knowledge-Centered Systems V
Datalog-Related aspects in lixto visual developer
Datalog'10 Proceedings of the First international conference on Datalog Reloaded
Hi-index | 0.00 |
Various web applications in e-business, such as online price comparisons, competition monitoring and personalised newsletters require retrieval of distributed information from the Internet. This paper examines the suitability of software toolkits for the extraction of data from web sites. The term wrapper is defined and an overview of presently available toolkits for generating wrappers is provided. In order to give a better insight into the workings of such toolkits, a detailed analysis of the non-commercial software program LAPIS is presented. An example application using this toolkit demonstrates how acceptable results can be achieved with relative ease. The functionality of the program is compared with the functionality of the commercial toolkit RoboMaker and the differences are highlighted. With the aim of providing improved ease-of-use and faster wrapper generation in mind, possible areas for further development of toolkits for automated web data extraction are discussed.