Positive-versus-Negative Classification for Model Aggregation in Predictive Data Mining

Authors:
Patricia E. N. Lutu;Andries P. Engelbrecht
Affiliations:
Department of Computer Science, University of Pretoria, Pretoria 0002, South Africa;Department of Computer Science, University of Pretoria, Pretoria 0002, South Africa
Venue:
INFORMS Journal on Computing
Year:
2013

Citing 33
Cited 0

Neural networks and the bias/variance dilemma

Neural Computation
C4.5: programs for machine learning

C4.5: programs for machine learning
Bagging predictors

Machine Learning
Error reduction through learning multiple descriptions

Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
A note on sampling a tape-file

Communications of the ACM
Robust Classification for Imprecise Environments

Machine Learning
The UCI KDD archive of large data sets for data mining research and experimentation

ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
A framework for constructing features and models for intrusion detection systems

ACM Transactions on Information and System Security (TISSEC)
Principles of data mining

Principles of data mining
A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems

Machine Learning
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality

Data Mining and Knowledge Discovery
Neural Network Ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence
Using Rule Sets to Maximize ROC Performance

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Multiple decision trees

UAI '88 Proceedings of the Fourth Annual Conference on Uncertainty in Artificial Intelligence
Combining Pattern Classifiers: Methods and Algorithms

Combining Pattern Classifiers: Methods and Algorithms
In Defense of One-Vs-All Classification

The Journal of Machine Learning Research
Lessons and Challenges from Mining Retail E-Commerce Data

Machine Learning
An introduction to ROC analysis

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets

Data Mining and Knowledge Discovery
Top 10 algorithms in data mining

Knowledge and Information Systems
Data Analysis in the 21st Century

Statistical Analysis and Data Mining
A decision rule-based method for feature selection in predictive data mining

Expert Systems with Applications: An International Journal
Applied Data Mining for Business and Industry

Applied Data Mining for Business and Industry
Using confusion matrices and confusion graphs to design ensemble classification models from large datasets

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Empirical comparison of four classifier fusion strategies for positive-versus-negative ensembles

Proceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment
Using OVA modeling to improve classification performance for large datasets

Expert Systems with Applications: An International Journal
Learning intrusion detection: supervised or unsupervised?

ICIAP'05 Proceedings of the 13th international conference on Image Analysis and Processing
Using attack-specific feature subsets for network intrusion detection

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Nearest neighbor pattern classification

IEEE Transactions on Information Theory
Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models

INFORMS Journal on Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The process of constructing several base models that are then combined into a single classification model for prediction is called model aggregation or ensemble classification. Positive-versus-negative pVn classification is a new method for the implementation of base models for aggregation. pVn classification involves the decomposition of a k-class prediction task into mm k subproblems. One base model is constructed for each subproblem to predict a subset of the k classes. The base models are then combined into one aggregate model for prediction. This paper reports studies that were conducted to demonstrate the performance of pVn classification when large volumes of data are available for modeling as is commonly the case in data mining. It is demonstrated in this paper that pVn modeling provides the capability to use a large amount of available data in a large data set for base model training. It is also demonstrated that pVn models created from large data sets provide a higher level of predictive performance compared to single k-class models.