Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
From rough set theory to evidence theory
Advances in the Dempster-Shafer theory of evidence
A decision-theoretic generalization of on-line learning and an application to boosting
Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
IEEE Transactions on Pattern Analysis and Machine Intelligence
Interpretations of belief functions in the theory of rough sets
Information Sciences: an International Journal - From rough sets to soft computing
Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Rough computational methods for information systems
Artificial Intelligence
A simple, fast, and effective rule learner
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Machine learning and data mining
Communications of the ACM
BoosTexter: A Boosting-based Systemfor Text Categorization
Machine Learning - Special issue on information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Rough Sets: Theoretical Aspects of Reasoning about Data
Rough Sets: Theoretical Aspects of Reasoning about Data
Information Retrieval
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Discretizing continuous attributes in AdaBoost for text categorization
ECIR'03 Proceedings of the 25th European conference on IR research
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
A neural network classifier based on Dempster-Shafer theory
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Is a Voting Approach Accurate for Opinion Mining?
DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
Soft fuzzy rough sets and its application in decision making
Artificial Intelligence Review
Hi-index | 0.00 |
An important issue in text mining is how to make use of multiple pieces knowledge discovered to improve future decisions. In this paper, we propose a new approach to combining multiple sets of rules for text categorization using Dempster's rule of combination. We develop a boosting-like technique for generating multiple sets of rules based on rough set theory and model classification decisions from multiple sets of rules as pieces of evidence which can be combined by Dempster's rule of combination. We apply these methods to 10 of the 20-newsgroups--a benchmark data collection (Baker and McCallum 1998), individually and in combination. Our experimental results show that the performance of the best combination of the multiple sets of rules on the 10 groups of the benchmark data is statistically significant and better than that of the best single set of rules. The comparative analysis between the Dempster---Shafer and the majority voting (MV) methods along with an overfitting study confirm the advantage and the robustness of our approach.