Pattern detection from web using AFA set theory

  • Authors:
  • Ikumi Horie;Kazunori Yamaguchi;Kenji Kashiwabara

  • Affiliations:
  • Dokkyo University, Saitama, Japan;University of Tokyo, Tokyo, Japan;University of Tokyo, Tokyo, Japan

  • Venue:
  • Proceedings of the 9th annual ACM international workshop on Web information and data management
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

Recurring patterns of the same link structure can often be observed on Web sites. Patterns are important for site administrators in revising Web sites but are difficult to find empirically. We propose a method for detecting such patterns. The first step in the method is viewing a Web site as a directed graph, and identifying pages that have the same substructure by the Anti-Foundation Axiom (AFA). The AFA is a non-standard set theory that allows for a circular structure. The pages identified by AFA are divided into connected components. Then, meaningful sets of pages are selected as patterns by using the Galois lattice of the binary relation between the pages and the connected components. We apply our method to three actual Web sites and succeed in detecting patterns within the target sites.