Neural networks and the bias/variance dilemma
Neural Computation
C4.5: programs for machine learning
C4.5: programs for machine learning
Machine Learning
Error reduction through learning multiple descriptions
Machine Learning
A decision-theoretic generalization of on-line learning and an application to boosting
Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
The Random Subspace Method for Constructing Decision Forests
IEEE Transactions on Pattern Analysis and Machine Intelligence
A note on sampling a tape-file
Communications of the ACM
Robust Classification for Imprecise Environments
Machine Learning
The UCI KDD archive of large data sets for data mining research and experimentation
ACM SIGKDD Explorations Newsletter - Special issue on “Scalable data mining algorithms”
A framework for constructing features and models for intrusion detection systems
ACM Transactions on Information and System Security (TISSEC)
Principles of data mining
Neural Networks for Pattern Recognition
Neural Networks for Pattern Recognition
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality
Data Mining and Knowledge Discovery
IEEE Transactions on Pattern Analysis and Machine Intelligence
Using Rule Sets to Maximize ROC Performance
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
UAI '88 Proceedings of the Fourth Annual Conference on Uncertainty in Artificial Intelligence
Combining Pattern Classifiers: Methods and Algorithms
Combining Pattern Classifiers: Methods and Algorithms
In Defense of One-Vs-All Classification
The Journal of Machine Learning Research
Lessons and Challenges from Mining Retail E-Commerce Data
Machine Learning
An introduction to ROC analysis
Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Data Mining and Knowledge Discovery
Top 10 algorithms in data mining
Knowledge and Information Systems
Data Analysis in the 21st Century
Statistical Analysis and Data Mining
A decision rule-based method for feature selection in predictive data mining
Expert Systems with Applications: An International Journal
Applied Data Mining for Business and Industry
Applied Data Mining for Business and Industry
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Empirical comparison of four classifier fusion strategies for positive-versus-negative ensembles
Proceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment
Using OVA modeling to improve classification performance for large datasets
Expert Systems with Applications: An International Journal
Learning intrusion detection: supervised or unsupervised?
ICIAP'05 Proceedings of the 13th international conference on Image Analysis and Processing
Using attack-specific feature subsets for network intrusion detection
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Nearest neighbor pattern classification
IEEE Transactions on Information Theory
INFORMS Journal on Computing
Hi-index | 0.00 |
The process of constructing several base models that are then combined into a single classification model for prediction is called model aggregation or ensemble classification. Positive-versus-negative pVn classification is a new method for the implementation of base models for aggregation. pVn classification involves the decomposition of a k-class prediction task into mm k subproblems. One base model is constructed for each subproblem to predict a subset of the k classes. The base models are then combined into one aggregate model for prediction. This paper reports studies that were conducted to demonstrate the performance of pVn classification when large volumes of data are available for modeling as is commonly the case in data mining. It is demonstrated in this paper that pVn modeling provides the capability to use a large amount of available data in a large data set for base model training. It is also demonstrated that pVn models created from large data sets provide a higher level of predictive performance compared to single k-class models.