Case-based reasoning
A sequential algorithm for training text classifiers
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Threading electronic mail: a preliminary study
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Selection of relevant features and examples in machine learning
Artificial Intelligence - Special issue on relevance
A rough set approach to attribute generalization in data mining
Information Sciences: an International Journal
Learning to extract symbolic knowledge from the World Wide Web
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Reduction algorithms based on discernibility matrix: the ordered attributes method
Journal of Computer Science and Technology
Rough set methods in feature selection and recognition
Pattern Recognition Letters - Special issue: Rough sets, pattern recognition and data mining
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Boolean Reasoning for Feature Extraction Problems
ISMIS '97 Proceedings of the 10th International Symposium on Foundations of Intelligent Systems
A Rough Set-Based Approach to Text Classification
RSFDGrC '99 Proceedings of the 7th International Workshop on New Directions in Rough Sets, Data Mining, and Granular-Soft Computing
Integrating feature and instance selection for text classification
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
A Rough Set-Based Hybrid Method to Text Categorization
WISE '01 Proceedings of the Second International Conference on Web Information Systems Engineering (WISE'01) Volume 1 - Volume 1
Foundations of Soft Case-Based Reasoning
Foundations of Soft Case-Based Reasoning
Applying rough sets to market timing decisions
Decision Support Systems - Special issue: Data mining for financial decision making
Rough-DBSCAN: A fast hybrid density based clustering method for large data sets
Pattern Recognition Letters
A case based reasoning approach on supplier selection in petroleum enterprises
Expert Systems with Applications: An International Journal
Case-based classifiers with fuzzy rough sets
RSKT'11 Proceedings of the 6th international conference on Rough sets and knowledge technology
Combining rough set and case based reasoning for process conditions selection in camshaft grinding
Journal of Intelligent Manufacturing
Hi-index | 0.00 |
This paper presents a novel rough set-based case-based reasoner for use in text categorization (TC). The reasoner has four main components: feature term extractor, document representor, case selector, and case retriever. It operates by first reducing the number of feature terms in the documents using the rough set technique. Then, the number of documents is reduced using a new document selection approach based on the case-based reasoning (CBR) concepts of coverage and reachability. As a result, both the number of feature terms and documents are reduced with only minimal loss of information. Finally, this smaller set of documents with fewer feature terms is used in TC. The proposed rough set-based case-based reasoner was tested on the Reuters21578 text datasets. The experimental results demonstrate its effectiveness and efficiency as it significantly reduced feature terms and documents, important for improving the efficiency of TC, while preserving and even improving classification accuracy.