The WebBook and the Web Forager: an information workspace for the World-Wide Web
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Information archiving with bookmarks: personal Web space construction and organization
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Data mountain: using spatial memory for document management
Proceedings of the 11th annual ACM symposium on User interface software and technology
TopicShop: enhanced support for evaluating and organizing collections of Web sites
UIST '00 Proceedings of the 13th annual ACM symposium on User interface software and technology
A technique for computer detection and correction of spelling errors
Communications of the ACM
Hunter gatherer: interaction support for the creation and management of within-web-page collections
Proceedings of the 11th international conference on World Wide Web
Thresher: automating the unwrapping of semantic content from the World Wide Web
WWW '05 Proceedings of the 14th international conference on World Wide Web
Piggy Bank: Experience the Semantic Web inside your web browser
Web Semantics: Science, Services and Agents on the World Wide Web
Relations, cards, and search templates: user-guided web data integration and layout
Proceedings of the 20th annual ACM symposium on User interface software and technology
Data integration for the relational web
Proceedings of the VLDB Endowment
Building data warehouses with semantic data
Proceedings of the 2010 EDBT/ICDT Workshops
Integrating web feed opinions into a corporate data warehouse
Proceedings of the 2nd International Workshop on Business intelligencE and the WEB
Hi-index | 0.00 |
We present an approach to web content aggregation that allows information to be harvested from web pages, independent of specific markup languages. It builds on ideas from data warehousing and we present solutions to the well-known problems of data integration, namely detection of equivalences and data cleaning, adapted to this context. We describe how the content aggregation engine has been realised as an extensible framework in such a way that end-users as well as developers can use the associated tools to create personal libaries of content extracted from the web.