Using Element and Document Profile for Information Clustering

  • Authors:
  • Jun Lai;Ben Soh

  • Affiliations:
  • -;-

  • Venue:
  • EEE '04 Proceedings of the 2004 IEEE International Conference on e-Technology, e-Commerce and e-Service (EEE'04)
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The tremendous growth in the amount of informationavailable and the number of visitors to web sites in therecent years poses some key challenges for informationfiltering and retrieval. Web visitors not only expect highquality and relevant information, but also wish that theinformation be presented in an as efficient way aspossible. The traditional filtering methods, however,only consider the relevant values of document. Theseconventional methods fail to consider the efficiency ofdocuments retrieval. In this paper, we propose a newalgorithm to calculate an index called documentsimilarity score based on elements of the document.Using the index, document profile will be derived. Anydocuments with the similarity score above a giventhreshold will be clustered. Using these pre-clustereddocuments, information filtering and retrieval can bemade more efficient.