A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems

Authors:
David J. Hand;Robert J. Till
Affiliations:
Department of Mathematics, Imperial College, Huxley Building, 180 Queen's Gate, London SW7 2BZ, UK. d.j.hand@ic.ac.uk;Department of Mathematics, Imperial College, Huxley Building, 180 Queen's Gate, London SW7 2BZ, UK. r.till@ic.ac.uk
Venue:
Machine Learning
Year:
2001

Citing 4
Cited 167

Robust classification systems for imprecise environments

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
The Case against Accuracy Estimation for Comparing Induction Algorithms

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Improving the Practice of Classifier Performance Assessment

Neural Computation
The use of the area under the ROC curve in the evaluation of machine learning algorithms

Pattern Recognition

Toward Bayesian Classifiers with Accurate Probabilities

PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Tree Induction for Probability-Based Ranking

Machine Learning
Improved Rooftop Detection in Aerial Images with Machine Learning

Machine Learning
Impact Studies and Sensitivity Analysis in Medical Data Mining with ROC-based Genetic Learning

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Subgroup Discovery with CN2-SD

The Journal of Machine Learning Research
Extreme re-balancing for SVMs: a case study

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Delegating classifiers

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Naive Bayesian Classification of Structured Data

Machine Learning
Coordinated internet attacks: responding to attack complexity

Journal of Computer Security
Using AUC and Accuracy in Evaluating Learning Algorithms

IEEE Transactions on Knowledge and Data Engineering
TAN Classifiers Based on Decomposable Distributions

Machine Learning
A Probabilistic Model for Mining Labeled Ordered Trees: Capturing Patterns in Carbohydrate Sugar Chains

IEEE Transactions on Knowledge and Data Engineering
Case studies in the use of ROC curve analysis for sensor-based estimates in human computer interaction

GI '05 Proceedings of Graphics Interface 2005
Augmenting naive Bayes for ranking

ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning Instance Greedily Cloning Naive Bayes for Ranking

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Partial Ensemble Classifiers Selection for Better Ranking

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
The use of receiver operating characteristic curves in biomedical informatics

Journal of Biomedical Informatics - Special issue: Clinical machine learning
Evaluating the performance of cost-based discretization versus entropy-and error-based discretization

Computers and Operations Research
A new efficient probabilistic model for mining labeled ordered trees

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Preface: Special issue on "ROC Analysis in Pattern Recognition"

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
An introduction to ROC analysis

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
ROC graphs with instance-varying costs

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Learning probabilistic decision trees for AUC

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Exploiting AUC for optimal linear combinations of dichotomizers

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Multi-class ROC analysis from a multi-objective optimisation perspective

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Multi-paradigm learning of declarative models: Thesis

AI Communications
Selecting features in microarray classification using ROC curves

Pattern Recognition
Toolkit support for developing and deploying sensor-based statistical models of human situations

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
The feasibility of constructing a Predictive Outcome Model for breast cancer using the tools of data mining

Expert Systems with Applications: An International Journal
Approximating the multiclass ROC by pairwise analysis

Pattern Recognition Letters
Regularized estimation for preference disaggregation in multiple criteria decision making

Computational Optimization and Applications
A mathematical framework to optimize ATR systems with non-declarations and sensor fusion

Computers and Operations Research
ROC analysis in ordinal regression learning

Pattern Recognition Letters
A weighted rough set based method developed for class imbalance learning

Information Sciences: an International Journal
A new efficient probabilistic model for mining labeled ordered trees applied to glycobiology

ACM Transactions on Knowledge Discovery from Data (TKDD)
Maximizing the area under the ROC curve by pairwise feature combination

Pattern Recognition
On the scalability of ordered multi-class ROC analysis

Computational Statistics & Data Analysis
On reoptimizing multi-class classifiers

Machine Learning
About the relationship between ROC curves and Cohen's kappa

Engineering Applications of Artificial Intelligence
Instance weighting versus threshold adjusting for cost-sensitive classification

Knowledge and Information Systems
A critical analysis of variants of the AUC

Machine Learning
Maximizing area under ROC curve for biometric scores fusion

Pattern Recognition
Multi-class Prediction Using Stochastic Logic Programs

Inductive Logic Programming
Survey of Improving Naive Bayes for Classification

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Hinge Rank Loss and the Area Under the ROC Curve

ECML '07 Proceedings of the 18th European conference on Machine Learning
Proper Model Selection with Significance Test

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Naive Bayes for optimal ranking

Journal of Experimental & Theoretical Artificial Intelligence
A comparative study on rough set based class imbalance learning

Knowledge-Based Systems
An experimental comparison of performance measures for classification

Pattern Recognition Letters
Microarray Design Using the Hilbert---Schmidt Independence Criterion

PRIB '08 Proceedings of the Third IAPR International Conference on Pattern Recognition in Bioinformatics
Learning layered ranking functions with structured support vector machines

Neural Networks
Support Vector Machine for Outlier Detection in Breast Cancer Survivability Prediction

Advanced Web and NetworkTechnologies, and Applications
An Experimental Comparison of Different Inclusion Relations in Frequent Tree Mining

Fundamenta Informaticae - Progress on Multi-Relational Data Mining
A Combined Classification Algorithm Based on C4.5 and NB

ISICA '08 Proceedings of the 3rd International Symposium on Advances in Computation and Intelligence
Multi-class support vector machine for classification of the ultrasonic images of supraspinatus

Expert Systems with Applications: An International Journal
Active Sampling for Rank Learning via Optimizing the Area under the ROC Curve

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Learning decision tree for ranking

Knowledge and Information Systems
On multi-class cost-sensitive learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Measuring classifier performance: a coherent alternative to the area under the ROC curve

Machine Learning
Binary Decomposition Methods for Multipartite Ranking

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Constructing new and better evaluation measures for machine learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
The value of parsing as feature generation for gene mention recognition

Journal of Biomedical Informatics
AUC: a statistically consistent and more discriminating measure than accuracy

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Repairing concavities in ROC curves

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Predicting forensic admission among the mentally ill in a multinational setting: A Bayesian modelling approach

Data & Knowledge Engineering
Indexes for three-class classification performance assessment: an empirical comparison

IEEE Transactions on Information Technology in Biomedicine
Implicit context representation Cartesian genetic programming for the assessment of visuo-spatial ability

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
An Empirical Comparison of Probability Estimation Techniques for Probabilistic Rules

DS '09 Proceedings of the 12th International Conference on Discovery Science
A novel measure for evaluating classifiers

Expert Systems with Applications: An International Journal
Auto claim fraud detection using Bayesian learning neural networks

Expert Systems with Applications: An International Journal
A survey of collaborative filtering techniques

Advances in Artificial Intelligence
Performance evaluation of multiple classification of the ultrasonic supraspinatus images by using ML, RBFNN and SVM classifiers

Expert Systems with Applications: An International Journal
AUC maximization linear classifier based on active learning and its application

Neurocomputing
Combining SVM classifiers using genetic fuzzy systems based on AUC for gene expression data analysis

ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
AUC: a better measure than accuracy in comparing learning algorithms

AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Learning locally weighted C4.4 for class probability estimation

DS'07 Proceedings of the 10th international conference on Discovery science
Protein fold discovery using stochastic logic programs

Probabilistic inductive logic programming
Modeling radiation-induced lung injury risk with an ensemble of support vector machines

Neurocomputing
Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream Data

Fundamenta Informaticae - Intelligent Data Analysis in Granular Computing
Evaluating learning algorithms and classifiers

International Journal of Intelligent Information and Database Systems
Learning predictive models that use pattern discovery-A bootstrap evaluative approach applied in organ functioning sequences

Journal of Biomedical Informatics
Training and testing of recommender systems on data missing not at random

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Two information-theoretic tools to assess the performance of multi-class classifiers

Pattern Recognition Letters
A transitivity analysis of bipartite rankings in pairwise multi-class classification

Information Sciences: an International Journal
Multi-class imbalanced data-sets with linguistic fuzzy rule based classification systems based on pairwise learning

IPMU'10 Proceedings of the Computational intelligence for knowledge-based systems design, and 13th international conference on Information processing and management of uncertainty
Adapting decision DAGs for multipartite ranking

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part III
Determining the optimal re-sampling strategy for a classification model with imbalanced data using design of experiments and response surface methodologies

Expert Systems with Applications: An International Journal
Random one-dependence estimators

Pattern Recognition Letters
Toponym resolution in social media

ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Optimal selection of potential customer range through the union sequential pattern by using a response model

Expert Systems with Applications: An International Journal
Learning Instance-Specific Predictive Models

The Journal of Machine Learning Research
Learning random forests for ranking

Frontiers of Computer Science in China
A dynamic over-sampling procedure based on sensitivity for multi-class problems

Pattern Recognition
On the ERA ranking representability of pairwise bipartite ranking functions

Artificial Intelligence
A comparative analysis of methods for probability estimation tree

WSEAS Transactions on Computers
Information, Divergence and Risk for Binary Experiments

The Journal of Machine Learning Research
Using confusion matrices and confusion graphs to design ensemble classification models from large datasets

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Boosting inspired process for improving AUC

MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
An investigation concerning the generation of text summarisation classifiers using secondary data

MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
Efficient semi-supervised learning on locally informative multiple graphs

Pattern Recognition
Text mining and probabilistic language modeling for online review spam detection

ACM Transactions on Management Information Systems (TMIS)
Processing and analysis of serum antibody binding signals from Printed Glycan Arrays for diagnostic and prognostic applications

International Journal of Bioinformatics Research and Applications
Empirical comparison of four classifier fusion strategies for positive-versus-negative ensembles

Proceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment
An alternative to ROC and AUC analysis of classifiers

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Processing and analysis of serum antibody binding signals from Printed Glycan Arrays for diagnostic and prognostic applications

International Journal of Bioinformatics Research and Applications
Using OVA modeling to improve classification performance for large datasets

Expert Systems with Applications: An International Journal
Improving the ranking performance of decision trees

ECML'06 Proceedings of the 17th European conference on Machine Learning
Group-aware prediction with exponential smoothing for collaborative filtering

Proceedings of the 2nd Challenge on Context-Aware Movie Recommendation
Rank measures for ordering

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Dynamic ensemble re-construction for better ranking

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Preprocessing time series data for classification with application to CRM

AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
A new Fruit Fly Optimization Algorithm: Taking the financial distress model as an example

Knowledge-Based Systems
Improving Tree augmented Naive Bayes for class probability estimation

Knowledge-Based Systems
Hellinger distance decision trees are robust and skew-insensitive

Data Mining and Knowledge Discovery
Robust bayesian linear classifier ensembles

ECML'05 Proceedings of the 16th European conference on Machine Learning
Isolation-Based Anomaly Detection

ACM Transactions on Knowledge Discovery from Data (TKDD)
Learning k-nearest neighbor naive bayes for ranking

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
One dependence augmented naive bayes

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
The effect of attribute scaling on the performance of support vector machines

AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Evolving neural networks with maximum AUC for imbalanced data classification

HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part I
Learning tree augmented naive bayes for ranking

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Evaluation of Fuzzy Relation Method for Medical Decision Support

Journal of Medical Systems
The use of genetic programming for the construction of a financial management model in an enterprise

Applied Intelligence
Learning naïve bayes tree for conditional probability estimation

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Probabilistic inference trees for classification and ranking

AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Training classifiers for unbalanced distribution and cost-sensitive domains with ROC analysis

PKAW'06 Proceedings of the 9th Pacific Rim Knowledge Acquisition international conference on Advances in Knowledge Acquisition and Management
Feature weighted minimum distance classifier with multi-class confidence estimation

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Lazy learning for improving ranking of decision trees

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
A novel scalable multi-class ROC for effective visualization and computation

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
ClasSi: measuring ranking quality in the presence of object classes with similarity information

PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Mixed-sampling approach to unbalanced data distributions: a case study involving Leukemia's document profiling

WSEAS Transactions on Information Science and Applications
Feature selection for MAUC-oriented classification systems

Neurocomputing
Not so greedy: Randomly Selected Naive Bayes

Expert Systems with Applications: An International Journal
Two New Prediction-Driven Approaches to Discrete Choice Prediction

ACM Transactions on Management Information Systems (TMIS)
Novelty detection in wildlife scenes through semantic context modelling

Pattern Recognition
An exact test of the accuracy of binary classification models based on the probability distribution of the average rank

Mathematical and Computer Modelling: An International Journal
The AUK: A simple alternative to the AUC

Engineering Applications of Artificial Intelligence
Fast anomaly detection for streaming data

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Including spatial interdependence in customer acquisition models: A cross-category comparison

Expert Systems with Applications: An International Journal
A semi-automated approach to building text summarisation classifiers

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
An Experimental Comparison of Different Inclusion Relations in Frequent Tree Mining

Fundamenta Informaticae - Progress on Multi-Relational Data Mining
An unsupervised framework for sensing individual and cluster behavior patterns from human mobile data

Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Modeling Paradigms for Medical Diagnostic Decision Support: A Survey and Future Directions

Journal of Medical Systems
Unsupervised feature selection in digital mammogram image using rough set theory

International Journal of Bioinformatics Research and Applications
Fuzzy similarity-based nearest-neighbour classification as alternatives to their fuzzy-rough parallels

International Journal of Approximate Reasoning
Stratified sampling for feature subspace selection in random forests for high dimensional data

Pattern Recognition
Differentially private projected histograms: construction and use for prediction

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Strength-based learning classifier systems revisited: Effective rule evolution in supervised classification tasks

Engineering Applications of Artificial Intelligence
Modeling dynamic behavior in large evolving graphs

Proceedings of the sixth ACM international conference on Web search and data mining
Towards cooperative brain-computer interfaces for space navigation

Proceedings of the 2013 international conference on Intelligent user interfaces
GAB-EPA: a GA based ensemble pruning approach to tackle multiclass imbalanced problems

ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Latent Business Networks Mining: A Probabilistic Generative Model

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Ranking data with ordinal labels: optimality and pairwise aggregation

Machine Learning
Studying page life patterns in dynamical web

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
On the effect of calibration in classifier combination

Applied Intelligence
Area under the distance threshold curve as an evaluation measure for probabilistic classifiers

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Positive-versus-Negative Classification for Model Aggregation in Predictive Data Mining

INFORMS Journal on Computing
Integrated Fisher linear discriminants: An empirical study

Pattern Recognition
Exploiting the relationships among several binary classifiers via data transformation

Pattern Recognition
Proximity Measures for Clustering Gene Expression Microarray Data: A Validation Methodology and a Comparative Analysis

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Bayesian classifiers based on probability density estimation and their applications to simultaneous fault diagnosis

Information Sciences: an International Journal
Learning attribute weighted AODE for ROC area ranking

International Journal of Information and Communication Technology
A novel method for combining Bayesian networks, theoretical analysis, and its applications

Pattern Recognition
Predicting pupylation sites in prokaryotic proteins using pseudo-amino acid composition and extreme learning machine

Neurocomputing
Editor's Choice Article: Sparse feature selection based on graph Laplacian for web image annotation

Image and Vision Computing
ROC analysis of classifiers in machine learning: A survey

Intelligent Data Analysis
Exploring medical diagnostic performance using interactive, multi-parameter sourced receiver operating characteristic scatter plots

Computers in Biology and Medicine

Quantified Score

Hi-index	0.02

Visualization

Abstract

The area under the ROC curve, or the equivalent Gini index, is a widely used measure of performance of supervised classification rules. It has the attractive property that it side-steps the need to specify the costs of the different kinds of misclassification. However, the simple form is only applicable to the case of two classes. We extend the definition to the case of more than two classes by averaging pairwise comparisons. This measure reduces to the standard form in the two class case. We compare its properties with the standard measure of proportion correct and an alternative definition of proportion correct based on pairwise comparison of classes for a simple artificial case and illustrate its application on eight data sets. On the data sets we examined, the measures produced similar, but not identical results, reflecting the different aspects of performance that they were measuring. Like the area under the ROC curve, the measure we propose is useful in those many situations where it is impossible to give costs for the different kinds of misclassification.