A constraint-based tool for data integrity management on the web

  • Authors:
  • Masami Takahashi;Atsuyuki Morishima;Hiroyuki Kitagawa;Shigeo Sugimoto

  • Affiliations:
  • Univ. of Tsukuba, Kasuga, Tsukuba, Japan;Univ. of Tsukuba, Kasuga, Tsukuba, Japan;Univ. of Tsukuba, Tennohdai, Tsukuba, Japan;Univ. of Tsukuba, Kasuga, Tsukuba, Japan

  • Venue:
  • Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Today, publishing information on Web sites is common. And the size of the Web contents that need to be managed is increasing. Therefore it is important to maintain content integrities on the Web. This paper proposes a system to maintain the content integrity of Web sites without backend databases. First, we explain the architecture of the proposed system. Second, we address the problem of finding integrity constraints used as the input to the system. We focus on inclusion dependencies among HTML/XML elements and discuss how to find inclusion relationships that can be used as hints to find inclusion dependencies. In particular, we propose to introduce weak inclusion relationships, which are inclusion relationships associated with inclusion ratios. Finally, we propose a filter-based approach to the efficient discovery of weak inclusion relationships and discuss some of its possible implementations.