SERA: selectively recursive approach towards nonstationary imbalanced stream data mining
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
An asymmetric classifier based on partial least squares
Pattern Recognition
FSVM-CIL: fuzzy support vector machines for class imbalance learning
IEEE Transactions on Fuzzy Systems - Special section on computing with words
Analysis of an evolutionary RBFN design algorithm, CO2RBFN, for imbalanced data sets
Pattern Recognition Letters
Evaluation of unsupervised emotion models to textual affect recognition
CAAGET '10 Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text
Integrating selective pre-processing of imbalanced data with Ivotes ensemble
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Learning from imbalanced data in presence of noisy and borderline examples
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Incremental multi-classifier learning algorithm on grid'5000 for large scale image annotation
Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval
RAMOBoost: ranked minority oversampling in boosting
IEEE Transactions on Neural Networks
Multimedia news exploration and retrieval by integrating keywords, relations and visual features
Multimedia Tools and Applications
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Exploring the performance of resampling strategies for the class imbalance problem
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
Design and evaluation of neural networks for an embedded application
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III
A dynamic over-sampling procedure based on sensitivity for multi-class problems
Pattern Recognition
An exploration of learning when data is noisy and imbalanced
Intelligent Data Analysis
Borderline over-sampling for imbalanced data classification
International Journal of Knowledge Engineering and Soft Data Paradigms
Cost-sensitive neural networks and editing techniques for imbalance problems
MCPR'10 Proceedings of the 2nd Mexican conference on Pattern recognition: Advances in pattern recognition
Expert Systems with Applications: An International Journal
Genetic algorithms as a pre processing strategy for imbalanced datasets
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
Resampling methods versus cost functions for training an MLP in the class imbalance context
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Distributed learning with data reduction
Transactions on computational collective intelligence IV
Classifying severely imbalanced data
Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Classification of high dimensional and imbalanced hyperspectral imagery data
IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I
Finding rare classes: adapting generative and discriminative models in active learning
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Margin-based over-sampling method for learning from imbalanced datasets
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
IWANN'11 Proceedings of the 11th international conference on Artificial neural networks conference on Advances in computational intelligence - Volume Part I
IWANN'11 Proceedings of the 11th international conference on Artificial neural networks conference on Advances in computational intelligence - Volume Part II
Evolutionary-based selection of generalized instances for imbalanced classification
Knowledge-Based Systems
Authorship similarity detection from email messages
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
On the stratification of multi-label data
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Iranian cancer patient detection using a new method for learning at imbalanced datasets
IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Exploiting online music tags for music emotion classification
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Ensembles of decision trees for imbalanced data
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Compact ensemble trees for imbalanced data
MCS'11 Proceedings of the 10th international conference on Multiple classifier systems
Detection of cancer patients using an innovative method for learning at imbalanced datasets
RSKT'11 Proceedings of the 6th international conference on Rough sets and knowledge technology
PBN: towards practical activity recognition using smartphone-based body sensor networks
Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems
Sub-sampling: Real-time vision for micro air vehicles
Robotics and Autonomous Systems
Expert Systems with Applications: An International Journal
Pushing the boundaries of crowd-enabled databases with query-driven schema expansion
Proceedings of the VLDB Endowment
Artificial Intelligence in Medicine
A novel synthetic minority oversampling technique for imbalanced data set learning
ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
DCPE co-training for classification
Neurocomputing
AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
Feature selection for MAUC-oriented classification systems
Neurocomputing
Information Processing and Management: an International Journal
Identification of different types of minority class examples in imbalanced data
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Modifications of classification strategies in rule set based bagging for imbalanced data
HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
The AUK: A simple alternative to the AUC
Engineering Applications of Artificial Intelligence
Prediction of liquefaction potential based on CPT up-sampling
Computers & Geosciences
Semi-supervised learning for imbalanced sentiment classification
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Using a shallow linguistic kernel for drug-drug interaction extraction
Journal of Biomedical Informatics
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
International Journal of Business Intelligence and Data Mining
A hierarchical neural network architecture for classification
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
Exploratory class-imbalanced and non-identical data distribution in automatic keyphrase extraction
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Proceedings of the 8th International Conference on Predictive Models in Software Engineering
MCPR'12 Proceedings of the 4th Mexican conference on Pattern Recognition
Classifier Ensemble for Imbalanced Data Stream Classification
Proceedings of the CUBE International Information Technology Conference
Class distribution estimation based on the Hellinger distance
Information Sciences: an International Journal
BRACID: a comprehensive approach to learning rules from imbalanced data
Journal of Intelligent Information Systems
Synthetic pattern generation for imbalanced learning in image retrieval
Pattern Recognition Letters
Road type classification through data mining
Proceedings of the 4th International Conference on Automotive User Interfaces and Interactive Vehicular Applications
Weighted extreme learning machine for imbalance learning
Neurocomputing
Sample cutting method for imbalanced text sentiment classification based on BRC
Knowledge-Based Systems
RFM analysis for detecting future core technology
Proceedings of the 2012 ACM Research in Applied Computation Symposium
Improving ANNs performance on unbalanced data with an AUC-Based learning algorithm
ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part II
Effects of data set features on the performances of classification algorithms
Expert Systems with Applications: An International Journal
Over-Sampling from an auxiliary domain
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Pattern Recognition Letters
Engineering Applications of Artificial Intelligence
GAB-EPA: a GA based ensemble pruning approach to tackle multiclass imbalanced problems
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Coarse-to-fine multiclass learning and classification for time-critical domains
Pattern Recognition Letters
Heterogeneous features and model selection for event-based media classification
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Neurocomputing
Empirical study of bagging predictors on medical data
AusDM '11 Proceedings of the Ninth Australasian Data Mining Conference - Volume 121
Tag recommendation in software information sites
Proceedings of the 10th Working Conference on Mining Software Repositories
A threshold method for imbalanced multiple noisy labeling
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Violence detection in hollywood movies by the fusion of visual and mid-level audio cues
Proceedings of the 21st ACM international conference on Multimedia
Early prediction on imbalanced multivariate time series
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Fit or unfit: analysis and prediction of 'closed questions' on stack overflow
Proceedings of the first ACM conference on Online social networks
Polarity analysis of micro reviews in foursquare
Proceedings of the 19th Brazilian symposium on Multimedia and the web
Exploring discriminatory features for automated malware classification
DIMVA'13 Proceedings of the 10th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Evaluation of sampling methods for learning from imbalanced data
ICIC'13 Proceedings of the 9th international conference on Intelligent Computing Theories
Class imbalance and the curse of minority hubs
Knowledge-Based Systems
Graph classification with imbalanced class distributions and noise
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Integrated Fisher linear discriminants: An empirical study
Pattern Recognition
Training and assessing classification rules with imbalanced data
Data Mining and Knowledge Discovery
Cost-sensitive decision tree ensembles for effective imbalanced classification
Applied Soft Computing
Information Sciences: an International Journal
Chaff from the wheat: characterization and modeling of deleted questions on stack overflow
Proceedings of the 23rd international conference on World wide web
Multi-class boosting with asymmetric binary weak-learners
Pattern Recognition
Boosting weighted ELM for imbalanced learning
Neurocomputing
Weighted Online Sequential Extreme Learning Machine for Class Imbalance Learning
Neural Processing Letters
Multimedia Tools and Applications
GSVM: An SVM for handling imbalanced accuracy between classes inbi-classification problems
Applied Soft Computing
Information Sciences: an International Journal
Review: A review of novelty detection
Signal Processing
Imbalanced evolving self-organizing learning
Neurocomputing
Learning a taxonomy of predefined and discovered activity patterns
Journal of Ambient Intelligence and Smart Environments
IIvotes ensemble for imbalanced data
Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data
Robust classification of imbalanced data using one-class and two-class SVM-based multiclassifiers
Intelligent Data Analysis - Business Analytics and Intelligent Optimization
DConfusion: a technique to allow cross study performance evaluation of fault prediction studies
Automated Software Engineering
Hi-index | 0.01 |
With the continuous expansion of data availability in many large-scale, complex, and networked systems, such as surveillance, security, Internet, and finance, it becomes critical to advance the fundamental understanding of knowledge discovery and analysis from raw data to support decision-making processes. Although existing knowledge discovery and data engineering techniques have shown great success in many real-world applications, the problem of learning from imbalanced data (the imbalanced learning problem) is a relatively new challenge that has attracted growing attention from both academia and industry. The imbalanced learning problem is concerned with the performance of learning algorithms in the presence of underrepresented data and severe class distribution skews. Due to the inherent complex characteristics of imbalanced data sets, learning from such data requires new understandings, principles, algorithms, and tools to transform vast amounts of raw data efficiently into information and knowledge representation. In this paper, we provide a comprehensive review of the development of research in learning from imbalanced data. Our focus is to provide a critical review of the nature of the problem, the state-of-the-art technologies, and the current assessment metrics used to evaluate learning performance under the imbalanced learning scenario. Furthermore, in order to stimulate future research in this field, we also highlight the major opportunities and challenges, as well as potential important research directions for learning from imbalanced data.