Original Contribution: Stacked generalization
Neural Networks
Improved boosting algorithms using confidence-rated predictions
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Mining in a data-flow environment: experience in network intrusion detection
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
AdaCost: Misclassification Cost-Sensitive Boosting
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A Neural Classifier with Fraud Density Map for Effective Credit Card Fraud Detection
IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
A Synthetic Fraud Data Generation Methodology
ICICS '02 Proceedings of the 4th International Conference on Information and Communications Security
On finding common neighborhoods in massive graphs
Theoretical Computer Science
Minority report in fraud detection: classification of skewed data
ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Generation of synthetic data sets for evaluating the accuracy of knowledge discovery systems
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Wrapper-based computation and evaluation of sampling methods for imbalanced datasets
UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Sharing Classifiers among Ensembles from Related Problem Domains
ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
International Journal of Human-Computer Studies
Online decoding of Markov models under latency constraints
ICML '06 Proceedings of the 23rd international conference on Machine learning
Fighting cybercrime: a review and the Taiwan experience
Decision Support Systems - Special issue: Intelligence and security informatics
Behavior-based modeling and its application to Email analysis
ACM Transactions on Internet Technology (TOIT)
Ensemble Pruning Via Semi-definite Programming
The Journal of Machine Learning Research
Developing Mining-Grid Centric e-Finance Portal
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Knowledge modeling -- State of the art
Integrated Computer-Aided Engineering
Distributed feature extraction in a p2p setting: a case study
Future Generation Computer Systems - Special section: Data mining in grid computing environments
An incremental cluster-based approach to spam filtering
Expert Systems with Applications: An International Journal
Appraisal of companies with Bayesian networks
International Journal of Business Intelligence and Data Mining
An efficient algorithm for mining frequent closed itemsets in dynamic transaction databases
International Journal of Intelligent Systems Technologies and Applications
Back propagation networks for credit card fraud prediction using stratified personalized data
ISP'06 Proceedings of the 5th WSEAS International Conference on Information Security and Privacy
Location of trusted email for prevention of credit card fraud in soft-products e-commerce
AIC'04 Proceedings of the 4th WSEAS International Conference on Applied Informatics and Communications
New results for finding common neighborhoods in massive graphs in the data stream model
Theoretical Computer Science
Local reweight wrapper for the problem of imbalance
International Journal of Artificial Intelligence and Soft Computing
Handling imbalanced data sets with a modification of Decorate algorithm
International Journal of Computer Applications in Technology
Transaction aggregation as a strategy for credit card fraud detection
Data Mining and Knowledge Discovery
Error analysis in artificial neural networks: the imbalanced distribution case
SMO'08 Proceedings of the 8th conference on Simulation, modelling and optimization
Information Market-Based Decision Fusion
Management Science
Knowledge-Rich Data Mining in Financial Risk Detection
ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Agent-Based Non-distributed and Distributed Clustering
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
A comprehensive survey of numeric and symbolic outlier mining techniques
Intelligent Data Analysis
Anytime classification for a pool of instances
Machine Learning
Fighting cybercrime: a review and the Taiwan experience
Decision Support Systems - Special issue: Intelligence and security informatics
Developing mining-grid centric e-finance portals for risk management
JSAI'06 Proceedings of the 20th annual conference on New frontiers in artificial intelligence
An unbalanced data classification model using hybrid sampling technique for fraud detection
PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
An analytic approach to select data mining for business decision
Expert Systems with Applications: An International Journal
A hybrid fraud scoring and spike detection technique in streaming data
Intelligent Data Analysis
Data clustering by minimizing disconnectivity
Information Sciences: an International Journal
Data mining for credit card fraud: A comparative study
Decision Support Systems
Novel questionnaire-responded transaction approach with SVM for credit card fraud detection
ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
Learning of neural networks for fraud detection based on a partial area under curve
ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
Usability of display-equipped RFID tags for security purposes
ESORICS'11 Proceedings of the 16th European conference on Research in computer security
Context-Sensitive regression analysis for distributed data
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Two-Stage credit card fraud detection using sequence alignment
ICISS'06 Proceedings of the Second international conference on Information Systems Security
A game-theoretic approach to credit card fraud detection
ICISS'05 Proceedings of the First international conference on Information Systems Security
A double-ensemble approach for classifying skewed data streams
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Employing transaction aggregation strategy to detect credit card fraud
Expert Systems with Applications: An International Journal
The role of intelligent agents and data mining in electronic partnership management
Expert Systems with Applications: An International Journal
An effective early fraud detection method for online auctions
Electronic Commerce Research and Applications
A new probabilistic active sample selection algorithm for class imbalance problem
International Journal of Knowledge Engineering and Soft Data Paradigms
Neurocomputing
Using social network knowledge for detecting spider constructions in social security fraud
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Multimedia Tools and Applications
Editorial: data mining in electronic commerce - support vs. confidence
Journal of Theoretical and Applied Electronic Commerce Research
Can Jannie verify? Usability of display-equipped RFID tags for security purposes
Journal of Computer Security - Research in Computer Security and Privacy: Emerging Trends
Hi-index | 0.01 |
Credit card transactions continue to grow in number, taking a larger share of the US payment system, and have led to a higher rate of stolen account numbers and subsequent losses by banks. Hence, improved fraud detection has become essential to maintain the viability of the US payment system. Banks have been fielding early fraud warning systems for some years. We seek to improve upon the state-of-the-art in commercial practice via large scale data mining. Scalable techniques to analyze massive amounts of transaction data to compute efficient fraud detectors in a timely manner is an important problem, especially for e-commerce. Besides scalability and efficiency, the fraud detection task exhibits technical problems that include skewed distributions of training data and non-uniform cost per error, both of which have not been widely studied in the knowledge discovery and data mining community. In this article we survey and evaluate a number of techniques that we have proposed and implemented that address these three main issues concurrently. Our proposed methods of combining multiple learned fraud detectors under a "cost model" are general and demonstrably useful; our empirical results demonstrate that we can significantly reduce loss due to fraud through distributed data mining of fraud models.