Class-Dependent Discretization for Inductive Learning from Continuous and Mixed-Mode Data

Authors:
John Y. Ching;Andrew K. C. Wong;Keith C. C. Chan
Affiliations:
-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
1995

Citing 11
Cited 44

Learning decision rules in noisy domains

Proceedings of Expert Systems '86, The 6Th Annual Technical Conference on Research and development in expert systems III
An event-covering method for effective probabilistic inference

Pattern Recognition
Inductive knowledge acquisition: a case study

Proceedings of the Second Australian Conference on Applications of expert systems
Synthesizing Statistical Knowledge from Incomplete Mixed-Mode Data

IEEE Transactions on Pattern Analysis and Machine Intelligence
Simplifying decision trees

International Journal of Man-Machine Studies - Special Issue: Knowledge Acquisition for Knowledge-based Systems. Part 5
APACS: a system for the automatic analysis and classification of conceptual patterns

Computational Intelligence
On changing continuous attributes into ordered discrete attributes

EWSL-91 Proceedings of the European working session on learning on Machine learning
On the Handling of Continuous-Valued Attributes in Decision Tree Generation

Machine Learning
Clustering Algorithms

Clustering Algorithms
Induction of Decision Trees

Machine Learning
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)

Mining fuzzy association rules

CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
An effective algorithm for mining interesting quantitative association rules

SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
High-Order Pattern Discovery from Discrete-Valued Data

IEEE Transactions on Knowledge and Data Engineering
Trading off between Misclassification, Recognition and Generalization in Data Mining with Continuous Features

IEA/AIE '02 Proceedings of the 15th international conference on Industrial and engineering applications of artificial intelligence and expert systems: developments in applied artificial intelligence
CAIM Discretization Algorithm

IEEE Transactions on Knowledge and Data Engineering
Efficient Multisplitting Revisited: Optima-Preserving Elimination of Partition Candidates

Data Mining and Knowledge Discovery
CLIP4: hybrid inductive machine learning algorithm that generates inequality rules

Information Sciences: an International Journal - Special issue: Soft computing data mining
Genetic fuzzy discretization with adaptive intervals for classification problems

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
A Discretization Algorithm Based on a Heterogeneity Criterion

IEEE Transactions on Knowledge and Data Engineering
A feature selection technique for classificatory analysis

Pattern Recognition Letters
A Fuzzy Approach to Partitioning Continuous Attributes for Classification

IEEE Transactions on Knowledge and Data Engineering
A Self-Organizing Computing Network for Decision-Making in Data Sets with a Diversity of Data Types

IEEE Transactions on Knowledge and Data Engineering
Effective classification of noisy data streams with attribute-oriented dynamic classifier selection

Knowledge and Information Systems
Learning multicriteria fuzzy classification method PROAFTN from data

Computers and Operations Research
Decision Support Analysis for Software Effort Estimation by Analogy

PROMISE '07 Proceedings of the Third International Workshop on Predictor Models in Software Engineering
A global optimal algorithm for class-dependent discretization of continuous data

Intelligent Data Analysis
A discretization algorithm based on Class-Attribute Contingency Coefficient

Information Sciences: an International Journal
Wrapper discretization by means of estimation of distribution algorithms

Intelligent Data Analysis
Neighborhood rough set based heterogeneous feature subset selection

Information Sciences: an International Journal
An adaptive partitioning approach for mining discriminant regions in 3D image data

Journal of Intelligent Information Systems
Rough sets approach to symbolic value partition

International Journal of Approximate Reasoning
Ameva: An autonomous discretization algorithm

Expert Systems with Applications: An International Journal
Feature Selection in Genetic Fuzzy Discretization for the Pattern Classification Problems

IEICE - Transactions on Information and Systems
A Discretization Process in Accordance with a Qualitative Ordered Output

Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Using Resampling Techniques for Better Quality Discretization

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
A novel Chi2 algorithm for discretization of continuous attributes

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Mining gene expression patterns for the discovery of overlapping clusters

EvoBIO'08 Proceedings of the 6th European conference on Evolutionary computation, machine learning and data mining in bioinformatics
Information distance based fitness and diversity metrics

Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Pattern discovery for large mixed-mode database

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Review:

The Knowledge Engineering Review
Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study

IEEE Transactions on Evolutionary Computation
A granular agent evolutionary algorithm for classification

Applied Soft Computing
An effective discretization based on Class-Attribute Coherence Maximization

Pattern Recognition Letters
Retail clients latent segments

EPIA'05 Proceedings of the 12th Portuguese conference on Progress in Artificial Intelligence
A hyper-heuristic evolutionary algorithm for automatically designing decision-tree algorithms

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Neighborhood effective information ratio for hybrid feature subset evaluation and selection

Neurocomputing
Generation of sufficient cut points to discretize network traffic data sets

SEMCCO'12 Proceedings of the Third international conference on Swarm, Evolutionary, and Memetic Computing
UniDis: a universal discretization technique

Journal of Intelligent Information Systems
Examination and comparison of conflicting data in granulated datasets: Equal width interval vs. equal frequency interval

Information Sciences: an International Journal
Software effort prediction: a hyper-heuristic decision-tree based approach

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Mutual information-based method for selecting informative feature sets

Pattern Recognition
Automatic design of decision-tree algorithms with evolutionary algorithms

Evolutionary Computation
A novel variable precision (θ,σ)-fuzzy rough set model based on fuzzy granules

Fuzzy Sets and Systems
Letters: A new approach for discretizing continuous attributes in learning systems

Neurocomputing

Quantified Score

Hi-index	0.15

Visualization

Abstract

Inductive learning systems can be effectively used to acquire classification knowledge from examples. Many existing symbolic learning algorithms can be applied in domains with continuous attributes when integrated with a discretization algorithm to transform the continuous attributes into ordered discrete ones. In this paper, a new information theoretic discretization method optimized for supervised learning is proposed and described. This approach seeks to maximize the mutual dependence as measured by the interdependence redundancy between the discrete intervals and the class labels, and can automatically determine the most preferred number of intervals for an inductive learning application. The method has been tested in a number of inductive learning examples to show that the class-dependent discretizer can significantly improve the classification performance of many existing learning algorithms in domains containing numeric attributes.