The magic of duplicates and aggregates
Proceedings of the sixteenth international conference on Very large databases
Towards tractable algebras for bags
PODS '93 Proceedings of the twelfth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
An investigation of documents from the World Wide Web
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
A query language and optimization techniques for unstructured data
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
STRUDEL: a Web site management system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
A first course in database systems
A first course in database systems
Database techniques for the World-Wide Web: a survey
ACM SIGMOD Record
The Asilomar report on database research
ACM SIGMOD Record
DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
The power of languages for the manipulation of complex values
The VLDB Journal — The International Journal on Very Large Data Bases
Queries and Computation on the Web
ICDT '97 Proceedings of the 6th International Conference on Database Theory
WebOQL: Restructuring Documents, Databases, and Webs
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Algebraic Properties of Bag Data Types
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
W3QS: A Query System for the World-Wide Web
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Information Coupling in Web Databases
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Structure-Based Queries over the World Wide Web
ER '98 Proceedings of the 17th International Conference on Conceptual Modeling
Web Warehousing: Design and Issues
ER '98 Proceedings of the Workshops on Data Warehousing and Data Mining: Advances in Database Technologies
Some Properties of Query Languages for Bags
DBLP-4 Proceedings of the Fourth International Workshop on Database Programming Languages - Object Models and Languages
Join Processing in Web Databases
DEXA '98 Proceedings of the 9th International Conference on Database and Expert Systems Applications
Web Warehousing: An Algebra for Web Information
ADL '98 Proceedings of the Advances in Digital Libraries Conference
WebDB: A Web Query System and Its Modeling, Language, and Implementation
ADL '98 Proceedings of the Advances in Digital Libraries Conference
A Declarative Language for Querying and Restructuring the Web
RIDE '96 Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database Systems
Guest editorial: XML schema and data management
Data & Knowledge Engineering - Special issue: XML schema and data management
Hi-index | 0.01 |
Sets and bags are closely related structures. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of the web warehouse project called WHOWEDA (Warehouse of Web Data). Informally, a web bag is a web table which allows multiple occurrences of identical web tuples. We have used web bag to discover useful knowledge from a web table such as visible documents (or web sites), luminous documents and luminous paths. In this paper, we formally discuss the semantics and properties of web bags. We design formal algorithms for the construction of a web bag and its schema. In addition, we also provide formal algorithms for various types of knowledge discovery in a web warehouse using web bag and illustrate them with examples.