Cost-Benefit Analysis of Web Bag in a Web Warehouse

  • Authors:
  • Sourav S. Bhowmick;Wee-Keong Ng;Ee-Peng Lim;Sanjay Madria

  • Affiliations:
  • -;-;-;-

  • Venue:
  • IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sets and bags are closely related structures and have been studied in relational databases. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of a web warehouse called WHOWEDA (Warehouse Of Weda Data) which we are currently building. Informally, a web bag is a web table which allows multiple occurrences of identical web tuples.Web bag helps to discover useful knowledge from a web table such as visible documents (or web sites), luminous docu-ments and luminous paths. In this paper, we provide a cost-benefit analysis of materializing web bags as compared to web tables with distinct web tuples.