A Novel Document Analysis Method Using Compressibility Vector

  • Authors:
  • Nuo Zhang;Toshinori Watanabe;Daisuke Matsuzaki;Hisashi Koga

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ISDPE '07 Proceedings of the The First International Symposium on Data, Privacy, and E-Commerce
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Similarity analysis and keyword extraction are widely used as document relation analysis techniques. These meth- ods are based on dictionary-base morphological analysis. However, they cannot meet the need when Internet grows fast and new words appear but dictionary can not be re- newed fast enough. In this study, we propose a new doc- ument relation analysis method based on the document's compressibility. The effectiveness of the proposed method will be examined in simulations.