Feature Reduction for Neural Network Based Text Categorization

Authors:
Savio L. Y. Lam;Dik Lun Lee
Affiliations:
-;-
Venue:
DASFAA '99 Proceedings of the Sixth International Conference on Database Systems for Advanced Applications
Year:
1999

Citing 7
Cited 22

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Connectionist learning procedures

Artificial Intelligence
What size net gives valid generalization?

Neural Computation
Connectionist ideas and algorithms

Communications of the ACM
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Evaluating text categorization

HLT '91 Proceedings of the workshop on Speech and Natural Language
The basic ideas in neural networks

Communications of the ACM

Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
Meaningful term extraction and discriminative term selection in text categorization via unknown-word methodology

ACM Transactions on Asian Language Information Processing (TALIP)
Introduction to the JASIST special topic section on web retrieval and mining: a machine learning perspective

Journal of the American Society for Information Science and Technology
Web page feature selection and classification using neural networks

Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Informatics and computer science intelligent systems applications
Filtering search results using an optimal set of terms identified by an artificial neural network

Information Processing and Management: an International Journal
Building a scientific knowledge web portal: the NanoPort experience

Decision Support Systems
A machine learning approach to web page filtering using content and structure analysis

Decision Support Systems
Latent semantic analysis for text categorization using neural network

Knowledge-Based Systems
An efficient document classification model using an improved back propagation neural network and singular value decomposition

Expert Systems with Applications: An International Journal
Improving Automatic Text Classification by Integrated Feature Analysis

IEICE - Transactions on Information and Systems
Combination of modified BPNN algorithms and an efficient feature selection method for text categorization

Information Processing and Management: an International Journal
Using phrases as features in email classification

Journal of Systems and Software
Combining neural networks and semantic feature space for email classification

Knowledge-Based Systems
Parametric and nonparametric evolutionary computing with a content-based feature selection approach for parallel categorization

Expert Systems with Applications: An International Journal
Filtering search results using an optimal set of terms identified by an artificial neural network

Information Processing and Management: an International Journal
Boosting algorithm to improve a voltage waveform classifier based on artificial neural network

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Text and hypertext categorization

Artificial intelligence
An impact of linguistic features on automated classification of OCR texts

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Using web sources for improving video categorization

Journal of Intelligent Information Systems
Text categorization based on artificial neural networks

ICONIP'06 Proceedings of the 13th international conference on Neural information processing - Volume Part III
A novel algorithm for text categorization using improved back-propagation neural network

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Chinese text classification based on neural network

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

In a text categorization model using an artificial neural network as the text classifier, scalability is poor if the neural network is trained using the raw feature space since textural data has a very high-dimension feature space.We proposed and compared four dimensionality reduction techniques to reduce the feature space into an input space of much lower dimension for the neural network classifier. To test the effectiveness of the proposed model, experiments were conducted using a subset of the Reuters-22173 test collection for text categorization.The results showed that the proposed model was able to achieve high categorization effectiveness as measured by precision and recall. Among the four dimensionality reduction techniques proposed, Principal Component Analysis was found to be the most effective in reducing the dimensionality of the feature space.