Intelligent document classification

Authors:
Rafael A. Calvo;H. A. Ceccatto
Affiliations:
Instituto de F\''{\'i}sica Rosario (CONICET-UNR), 27 de Febrero 210bis, 2000 Rosario, Argentina. E-mail: rafa@ifir.edu.ar;Instituto de F\''{\'i}sica Rosario (CONICET-UNR), 27 de Febrero 210bis, 2000 Rosario, Argentina. E-mail: rafa@ifir.edu.ar
Venue:
Intelligent Data Analysis
Year:
2000

Citing 11
Cited 6

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Introduction to statistical pattern recognition (2nd ed.)

Introduction to statistical pattern recognition (2nd ed.)
Introduction to the theory of neural computation

Introduction to the theory of neural computation
A comparison of classifiers and document representations for the routing problem

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Feature selection, perceptron learning, and a usability case study for text categorization

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A re-examination of text categorization methods

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
Information Retrieval

Information Retrieval
Pattern Recognition and Neural Networks

Pattern Recognition and Neural Networks
CONSTRUE/TIS: A System for Content-Based Indexing of a Database of News Stories

IAAI '90 Proceedings of the The Second Conference on Innovative Applications of Artificial Intelligence
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning

Web page feature selection and classification using neural networks

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Informatics and computer science intelligent systems applications
Scalable document classification

Intelligent Data Analysis
A study of local and global thresholding techniques in text categorization

AusDM '06 Proceedings of the fifth Australasian conference on Data mining and analystics - Volume 61
SVM and Collaborative Filtering-Based Prediction of User Preference for Digital Fashion Recommendation Systems

IEICE - Transactions on Information and Systems
A novel framework for web page classification using two-stage neural network

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Acquire job opportunities for chinese disabled persons based on improved text classification

ISNN'10 Proceedings of the 7th international conference on Advances in Neural Networks - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this work we investigate some technical questions related tothe application of neural networks in document classification.First, we discuss the effects of different averaging protocols forthe \chi ^{2} statistic used to remove non-informative terms. Thisis an especially relevant issue for the neural network technique,which requires an aggressive dimensionality reduction to befeasible. Second, we estimate the importance of performancefluctuations due to inherent randomness in the training process ofa neural network, a point not properly addressed in previous works.Finally, we compare the neural network results with those obtainedusing the best methods for this application. For this we optimizethe network architecture by evaluating much larger nets thanpreviously considered in similar studies in the literature.