Concept Analysis andWeb Clustering using Combinatorial Topology

  • Authors:
  • Tsau Young (T. Y. ). Lin;Albert Sutojo;Jean-David Hsu

  • Affiliations:
  • San Jose State University;San Jose State University;San Jose State University

  • Venue:
  • ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The collection of the concepts that are discussed in a document set can be represented by a geometric structure, called simplical complex, of combinatorial topology. A simplex is a high-frequency keywordset that co-occurs closely which, we believe, carries a concept in the document set. The collection of all these simplexes that forms the simplical complex represents the structure of these concepts. Based on the topological structure of this complex, the documents are clustered. Several clustering schemes are presented. Our initial experiments, as expected, do support the theory.