An investigation of documents from the World Wide Web
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
A query language and optimization techniques for unstructured data
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Database techniques for the World-Wide Web: a survey
ACM SIGMOD Record
DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
WebOQL: Restructuring Documents, Databases, and Webs
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
W3QS: A Query System for the World-Wide Web
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Information Coupling in Web Databases
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Web Warehousing: Design and Issues
ER '98 Proceedings of the Workshops on Data Warehousing and Data Mining: Advances in Database Technologies
Web Warehousing: An Algebra for Web Information
ADL '98 Proceedings of the Advances in Digital Libraries Conference
A Declarative Language for Querying and Restructuring the Web
RIDE '96 Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database Systems
Knowledge discovery using Web bags in a Web warehouse
Information organization and databases
Hi-index | 0.00 |
Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called WHOWEDA (Warehouse Of Weda Data) which we are currently building. Informally, a web bag is a web table which allows multiple occurrences of identical web tuples.Web bag helps to discover useful knowledge from a web table such as visible documents (or web sites), luminous docu-ments and luminous paths. In this paper, we provide a cost-benefit analysis of materializing web bags as compared to web tables with distinct web tuples.