Subspace mapping of noisy text documents

  • Authors:
  • Axel J. Soto;Marc Strickert;Gustavo E. Vazquez;Evangelos Milios

  • Affiliations:
  • Faculty of Computer Science, Dalhousie University, Canada;Institute for Vision and Graphics, Siegen University, Germany;Dept. Computer Science, Univ. Nacional del Sur, Argentina;Faculty of Computer Science, Dalhousie University, Canada

  • Venue:
  • Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Subspace mapping methods aim at projecting high-dimensional data into a subspace where a specific objective function is optimized. Such dimension reduction allows the removal of collinear and irrelevant variables for creating informative visualizations and task-related data spaces. These specific and generally de-noised subspaces spaces enable machine learning methods to work more efficiently. We present a new and general subspace mapping method, Correlative Matrix Mapping (CMM), and evaluate its abilities for category-driven text organization by assessing neighborhood preservation, class coherence, and classification. This approach is evaluated for the challenging task of processing short and noisy documents.