An Enhanced Probabilistic Neural Network Approach Applied to Text Classification
CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Hi-index | 0.00 |
Techniques for categorization and clustering, range from support vector machines, neural networks to Bayesian inference and algebraic methods. The k-Nearest Neighbor Algorithm (kNN) is a popular example of the latter classof these algorithms. Recently, a slight modification of it has been proposed so that the Multi-Label k-Nearest Neighbor Algorithm (ML-kNN) can deal better with multi-label classification problems. In this paper we are interested in automatic text categorization, which are becoming more and more important as the amount of text in electronic format grows and the access to it becomes more necessary and widespread. We proposed a Probabilistic Neural Network Algorithm (PNN) tailored to also deal with multi-label classification problems, and compared it against the ML-kNN algorithm. Our implementation surpass the ML-kNN algorithm in four metrics typically used in the literature for multi-label categorization problems.