Correntropy-Based document clustering via nonnegative matrix factorization

  • Authors:
  • Tolga Ensari;Jan Chorowski;Jacek M. Zurada

  • Affiliations:
  • Computer Engineering Department, Istanbul University, Istanbul, Turkey;Electrical and Computer Engineering, University of Louisville, Louisville;Electrical and Computer Engineering, University of Louisville, Louisville

  • Venue:
  • ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Nonnegative Matrix Factorization (NMF) is one of the popular techniques to reduce the number of attributes of the data. It has been also widely used for clustering. Several types of the objective functions have been used for NMF in the literature. In this paper, we propose to maximize the correntropy similarity measure to produce the factorization itself. Correntropy is an entropy-based criterion defined as a nonlinear similarity measure. Following the discussion of minimization of the correntropy function, we use it to cluster document data set and compare its clustering performance with the Euclidean Distance (EucD)-based NMF. The comparison is illustrated with 20-Newsgroups data set. The results show that our approach has better clustering compared with other methods which use EucD as an objective function.