Knowledge discovery using Web bags in a Web warehouse

  • Authors:
  • Sourav B. Bhowmick;Sanjay M. Kumar;Wee Keong Ng;Ee Peng Lim

  • Affiliations:
  • Nanyang Technological Univ., Singapore;Purdue Univ., West Lafayette, IN;Nanyang Technological Univ., Singapore;Nanyang Technological Univ., Singapore

  • Venue:
  • Information organization and databases
  • Year:
  • 2000

Quantified Score

Hi-index 0.01

Visualization

Abstract

Sets and bags are closely related structures. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of Web bag in a Web warehouse as a part of our the WHOWEDA project. Informally, a Web bag is a Web table which allows multiple occurrences of identical Web tuples. Web bag helps to discover useful knowledge from a Web table such as visible documents (or Web sites), luminous documents and luminous paths. We formally discuss the semantics and properties Web bags, and illustrate with examples applications of Web bag in knowledge discovery in a Web warehouse.