Quantifying inductive bias: AI learning algorithms and Valiant's learning framework
Artificial Intelligence
Attribute-oriented induction in data mining
Advances in knowledge discovery and data mining
Machine Learning - Special issue on learning with probabilistic representations
Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
CACTUS—clustering categorical data using summaries
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Applications of Data Mining to Electronic Commerce
Data Mining and Knowledge Discovery
Using Feature Hierarchies in Bayesian Network Learning
SARA '02 Proceedings of the 4th International Symposium on Abstraction, Reformulation, and Approximation
Clustering categorical data: an approach based on dynamical systems
The VLDB Journal — The International Journal on Very Large Data Bases
Ontologies: A Silver Bullet for Knowledge Management and Electronic Commerce
Ontologies: A Silver Bullet for Knowledge Management and Electronic Commerce
Distributional clustering of English words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Proceedings of the 15th international conference on World Wide Web
Learning accurate and concise naïve Bayes classifiers from attribute value taxonomies and data
Knowledge and Information Systems
Canonicalization of database records using adaptive similarity measures
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning decision trees with taxonomy of propositionalized attributes
Pattern Recognition
Multinomial event model based abstraction for sequence and text classification
SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Multi-level rough set reduction for decision rule mining
Applied Intelligence
Hi-index | 12.05 |
In this paper, we consider the problem of generating concise but accurate naive Bayes classifiers using taxonomy of propositionalized attributes. For the problem, we introduce propositionalized attribute taxonomy guided naive Bayes Learner (PAT-NBL), a machine learning algorithm that effectively utilizes taxonomy to generate compact classifiers. We extend classical naive Bayes learner to the PAT-NBL algorithm that traverses over a propositionalized taxonomy to search for a locally optimal cut. PAT-NBL uses bottom-up search to find the locally optimal cut on a given taxonomy. For the evaluation of candidate cuts, we apply conditional log-likelihood, conditional minimum description length, and conditional Akaike information criterion. The detected cut enables PAT-NBL to construct an instance space which corresponds to the taxonomy and the data. That is, after PAT-NBL determines a cut according to its information-theoretic criteria, the algorithm generates a concise naive Bayes classifier based on the cut. Our experimental results on UCI Machine Learning benchmark data sets indicate that the proposed algorithm can generate naive Bayes classifiers that are more compact and often comparably accurate to those produced by standard naive Bayes learners.