Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Applying Cascaded Feature Selection to SVM Text Categorization
DEXA '02 Proceedings of the 13th International Workshop on Database and Expert Systems Applications
Semantic indexing using WordNet senses
RANLPIR '00 Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11
A WordNet-based algorithm for word sense disambiguation
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
An Iterative Hybrid Filter-Wrapper Approach to Feature Selection for Document Clustering
Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
A feature selection algorithm based on poisson estimates
FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 1
IEA/AIE'06 Proceedings of the 19th international conference on Advances in Applied Artificial Intelligence: industrial, Engineering and Other Applications of Applied Intelligent Systems
Feature annotation for text categorization
Proceedings of the CUBE International Information Technology Conference
International Journal of Web Engineering and Technology
Text Document Clustering with Hybrid Feature Selection
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Hi-index | 0.00 |
The web has caused an explosion of documents, requiring the need for an automated text categorization system. This paper explores the notion of semantic feature selection by employing WordNet [Introduction to WordNet: An On-line Lexical Database], a lexical database. The proposed semantic approach employs noun synonyms and word senses for feature selection to select terms that are semantically representative of a category of documents. The categorical sense disambiguation extends the use of WordNet, which has been typically used for text retrieval and word sense disambiguation [A WordNet-based Algorithm for Word Sense Disambiguation]. Our experiments on the Reuters-21578 dataset have shown that automated semantic feature selection is able to perform better than well known statistical feature selection methods, Information Gain and Chi-Square as a feature selection method.