A differential LSI method for document classification

  • Authors:
  • Liang Chen;Naoyuki Tokuda;Akira Nagai

  • Affiliations:
  • University of Northern British Columbia, Prince George, BC, Canada;R & D Center, Sunflare Company, Tokyo, Japan;Utsunomiya University, Utsunomiya, Tochigi, Japan

  • Venue:
  • AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We have developed an effective probabilistic classifier for document classification by introducing the concept of the differential document vectors and DLSI (differential latent semantics index) spaces. A simple posteriori calculation using the intra- and extra-document statistics demonstrates the advantage of the DLSI space-based probabilistic classifier over the popularly used LSI space-based classifier in classification performance.