The magic of duplicates and aggregates
Proceedings of the sixteenth international conference on Very large databases
Towards tractable algebras for bags
PODS '93 Proceedings of the twelfth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
New techniques for studying set languages, bag languages and aggregate functions
PODS '94 Proceedings of the thirteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
An investigation of documents from the World Wide Web
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
A first course in database systems
A first course in database systems
Database techniques for the World-Wide Web: a survey
ACM SIGMOD Record
DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Algebraic Properties of Bag Data Types
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Information Coupling in Web Databases
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Web Warehousing: Design and Issues
ER '98 Proceedings of the Workshops on Data Warehousing and Data Mining: Advances in Database Technologies
Some Properties of Query Languages for Bags
DBLP-4 Proceedings of the Fourth International Workshop on Database Programming Languages - Object Models and Languages
Calculi for Bags and their Complexity
DBLP-4 Proceedings of the Fourth International Workshop on Database Programming Languages - Object Models and Languages
Web Warehousing: An Algebra for Web Information
ADL '98 Proceedings of the Advances in Digital Libraries Conference
Cost-Benefit Analysis of Web Bag in a Web Warehouse
IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
What can a web bag discover for you?
Data & Knowledge Engineering
Hi-index | 0.00 |
Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called Whoweda (W areh ouse O f We da Da ta) which we are currently building. Informally, a web bag is a web table which allows multiple occurrences of identical web tuples . Web bag helps to discover useful knowledge from a web table such as visible documents (or web sites), luminous documents and luminous paths . In this paper, we perform a cost-benefit analysis with respect to storage, transmission and operational cost of web bags and discussed issues and implication of materializing web bags as opposed to web tables containing distinct web tuples. We have computed analytically the upper and lower bounds for the parameters which affect the cost of materializing web bags.