A feature selection algorithm based on poisson estimates

  • Authors:
  • Yingfan Gao;Hui-Iin Wang

  • Affiliations:
  • Institute of Scientific and Technical Information of China, Beijing, China;Institute of Scientific and Technical Information of China, Beijing, China

  • Venue:
  • FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Feature selection is one of the key technologies for text categorization. Currently, it mainly includes technologies based statistics which is primarily from information theory and technologies based semantics which covers natural language processing, semantic web etc., Based on Poisson Hypothesis, this article presents a new method combining both and tries to find features in documents with more semantic information. The contrast experiments carried on the Reuters-21578 corpus with the IG, Chi2 and WN algorithms show that this method has more advantages than other algorithms.