Efficient agnostic PAC-learning with simple hypothesis
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
IEEE Transactions on Knowledge and Data Engineering
Computers and Operations Research
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Typicality, Diversity, and Feature Pattern of an Ensemble
IEEE Transactions on Computers
A discretization algorithm based on Class-Attribute Contingency Coefficient
Information Sciences: an International Journal
Content-based personalised recommendation in virtual shopping environment
International Journal of Business Intelligence and Data Mining
A multimodal data mining framework for soccer goal detection based on decision tree logic
International Journal of Computer Applications in Technology
ICMLA '08 Proceedings of the 2008 Seventh International Conference on Machine Learning and Applications
Correlation-Based Video Semantic Concept Detection Using Multiple Correspondence Analysis
ISM '08 Proceedings of the 2008 Tenth IEEE International Symposium on Multimedia
On biases in estimating multi-valued attributes
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Correlation-based interestingness measure for video semantic concept detection
IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
A discretization algorithm for uncertain data
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part II
Association mining of dependency between time series using Genetic Algorithm and discretisation
International Journal of Business Intelligence and Data Mining
A high-order feature synthesis and selection algorithm applied to insurance risk modelling
International Journal of Business Intelligence and Data Mining
Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework
IEEE Transactions on Multimedia
Hi-index | 0.00 |
This paper proposes a novel supervised discretisation algorithm based on Correlation Maximisation (CM) using Multiple Correspondence Analysis (MCA). MCA is an effective technique to capture the correlation between multiple variables. For each numeric feature, the proposed discretisation algorithm utilises MCA to measure the correlations between feature intervals/items and classes, and the set of cut-points yielding the maximum correlation is chosen as the discretisation scheme for that feature. Therefore, the discretised feature can not only produce a concise summarisation of the original numeric feature but also provide the maximum correlation information to predict class labels. Experiments are conducted by comparing to seven state-of-the-art supervised discretisation algorithms using six well-known classifiers on 19 UCI data sets. Experimental results demonstrate that the proposed discretisation algorithm can automatically generate a set of features (feature intervals) that produce the best classification results on average.