Description and performance analysis of signature file methods for office filing
ACM Transactions on Information Systems (TOIS)
Lecture notes in computer science on ICDT '88
STRUDEL: a Web site management system
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Foundations of Databases: The Logical Level
Foundations of Databases: The Logical Level
Adaptive algorithms for set containment joins
ACM Transactions on Database Systems (TODS)
Efficiently Computing Inclusion Dependencies for Schema Discovery
ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
Linkage of compound objects for supporting maintenance of large-scale web sites
Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication
Hi-index | 0.00 |
Today, publishing information on Web sites is common. And the size of the Web contents that need to be managed is increasing. Therefore it is important to maintain content integrities on the Web. This paper proposes a system to maintain the content integrity of Web sites without backend databases. First, we explain the architecture of the proposed system. Second, we address the problem of finding integrity constraints used as the input to the system. We focus on inclusion dependencies among HTML/XML elements and discuss how to find inclusion relationships that can be used as hints to find inclusion dependencies. In particular, we propose to introduce weak inclusion relationships, which are inclusion relationships associated with inclusion ratios. Finally, we propose a filter-based approach to the efficient discovery of weak inclusion relationships and discuss some of its possible implementations.