Applied multivariate statistical analysis
Applied multivariate statistical analysis
Evaluating and optimizing autonomous text classification systems
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Information storage and retrieval
Information storage and retrieval
Delay bounded buffered tree construction for timing driven floorplanning
ICCAD '97 Proceedings of the 1997 IEEE/ACM international conference on Computer-aided design
Proceedings of the ninth international conference on Information and knowledge management
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Exploiting Hierarchy in Text Categorization
Information Retrieval
Neural Network Agents for Learning Semantic Text Classification
Information Retrieval
Hierarchical Text Categorization Using Neural Networks
Information Retrieval
Automatic Text Categorization and Its Application to Text Retrieval
IEEE Transactions on Knowledge and Data Engineering
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Improving Short-Text Classification using Unlabeled Data for Classification Problems
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A Robust Meaning Extraction Methodology Using Supervised Neural Networks
AI '02 Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
Feature Reduction for Neural Network Based Text Categorization
DASFAA '99 Proceedings of the Sixth International Conference on Database Systems for Advanced Applications
Topic Extraction from Text Documents Using Multiple-Cause Networks
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Intelligent document classification
Intelligent Data Analysis
The growing hierarchical self-organizing map: exploratory analysis of high-dimensional data
IEEE Transactions on Neural Networks
Automatic metadata generation based on neural network
InfoSecu '04 Proceedings of the 3rd international conference on Information security
Analysis on the performance of mobile agents for query retrieval
Information Sciences—Informatics and Computer Science: An International Journal
Blocking objectionable web content by leveraging multiple information sources
ACM SIGKDD Explorations Newsletter
Mining web browsing patterns for E-commerce
Computers in Industry
Information Sciences: an International Journal
Using SVD and demographic data for the enhancement of generalized Collaborative Filtering
Information Sciences: an International Journal
Document Classification Based on Support Vector Machine Using a Concept Vector Model
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
A novel approach to construct grey principal component analysis evaluation model
Expert Systems with Applications: An International Journal
Sales Intelligence Using Web Mining
ICDM '09 Proceedings of the 9th Industrial Conference on Advances in Data Mining. Applications and Theoretical Aspects
Analysis on the performance of mobile agents for query retrieval
Information Sciences: an International Journal
CIT'09 Proceedings of the 3rd International Conference on Communications and information technology
A non-linear index to evaluate a journal's scientific impact
Information Sciences: an International Journal
ISPRA'10 Proceedings of the 9th WSEAS international conference on Signal processing, robotics and automation
Analytical evaluation of term weighting schemes for text categorization
Pattern Recognition Letters
Expert Systems with Applications: An International Journal
A coarse-to-fine framework to efficiently thwart plagiarism
Pattern Recognition
Improve feature selection method of web page language identification using fuzzy ARTMAP
International Journal of Intelligent Information and Database Systems
A Web page classification system based on a genetic algorithm using tagged-terms as features
Expert Systems with Applications: An International Journal
A new feature selection method based on support vector machines for text categorisation
International Journal of Data Analysis Techniques and Strategies
A tool for link-based web page classification
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Automatic web pages hierarchical classification using dynamic domain ontologies
International Journal of Knowledge and Web Intelligence
A novel framework for web page classification using two-stage neural network
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Text mining technique for chinese written judgment of criminal case
PAISI'10 Proceedings of the 2010 Pacific Asia conference on Intelligence and Security Informatics
FDIA'09 Proceedings of the Third BCS-IRSG conference on Future Directions in Information Access
CALA: An unsupervised URL-based web page classification system
Knowledge-Based Systems
Hi-index | 0.00 |
Automatic categorization is the only viable method to deal with the scaling problem of the World Wide Web (WWW). In this paper, we propose a news web page classification method (WPCM). The WPCM uses a neural network with inputs obtained by both the principal components and class profile-based features. Each news web page is represented by the term-weighting scheme. As the number of unique words in the collection set is big, the principal component analysis (PCA) has been used to select the most relevant features for the classification. Then the final output of the PCA is combined with the feature vectors from the class-profile which contains the most regular words in each class. We have manually selected the most regular words that exist in each class and weighted them using an entropy weighting scheme. The fixed number of regular words from each class will be used as a feature vectors together with the reduced principal components from the PCA. These feature vectors are then used as the input to the neural networks for classification. The experimental evaluation demonstrates that the WPCM method provides acceptable classification accuracy with the sports news datasets.