Machine intelligence 12
C4.5: programs for machine learning
C4.5: programs for machine learning
Machine Learning
Overcoming Process Delays with Decision Tree Induction
IEEE Expert: Intelligent Systems and Their Applications
Supporting Start-to-Finish Development of Knowledge Bases
Machine Learning
Data mining tasks and methods: Rule discovery: association rules
Handbook of data mining and knowledge discovery
Industry: using decision tree induction to minimize process delays in the printing industry
Handbook of data mining and knowledge discovery
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Bootstrapping rule induction to achieve rule stability and reduction
Journal of Intelligent Information Systems
Flexibly exploiting prior knowledge in empirical learning
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Searching for meaningful feature interactions with backward-chaining rule induction
IDA'05 Proceedings of the 6th international conference on Advances in Intelligent Data Analysis
Hi-index | 0.00 |
Exploring the vast number of possible feature interactions in domains such as gene expression microarray data is an onerous task. We describe Backward-Chaining Rule Induction (BCRI) as a semi-supervised mechanism for biasing the search for IF-THEN rules that express plausible feature interactions. BCRI adds to a relatively limited tool-chest of hypothesis generation software and is an alternative to purely unsupervised association-rule learning. We illustrate BCRI by using it to search for gene-to-gene causal mechanisms that underlie lung cancer. Mapping hypothesized gene interactions against prior knowledge offers support and explanations for hypothesized interactions, and suggests gaps in current knowledge that induction might help fill. Our assumption is that "good" hypotheses incrementally extend/revise existing knowledge. BCRI is implemented as a wrapper around a base supervised-rule-learning algorithm. We summarize our prior work with an adaptation of C4.5 as the base algorithm (C45-BCRI), extending this in the current study to use Brute as the base algorithm (Brute-BCRI). In contrast to C4.5's greedy strategy, Brute extensively searches the rule space. Moreover, Brute returns many more rules (i.e., hypothesized feature interactions) than does C4.5. To remain an effective hypothesis-generation tool requires that Brute-BCRI more carefully rank and prune hypothesized interactions than does C45-BCRI. Prior knowledge serves to evaluate final Brute-BCRI rules just as it does with C45-BCRI, but prior knowledge also serves to evaluate and prune intermediate search states, thus maintaining a manageable number of rules for evaluation by a domain expert.