Clustering of document collection - A weighting approach

  • Authors:
  • Ramiz M. Aliguliyev

  • Affiliations:
  • Institute of Information Technology of National Academy of Sciences of Azerbaijan, 9, F. Agayev Street, AZ1141 Baku, Azerbaijan

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2009

Quantified Score

Hi-index 12.06

Visualization

Abstract

Clustering algorithms are used to assess the interaction among documents by organizing documents into clusters such that document within a cluster are more similar to each other than are documents belonging to different clusters. Document clustering has been traditionally investigated as a means of improving the performance of search engines by pre-clustering the entire corpus, and a post-retrieval document browsing technique as well. It has long been studied as a post-retrieval document visualization technique. The purpose of present paper to show that assignment weight to documents improves clustering solution.