Distributed Data Mining in Credit Card Fraud Detection

Authors:
Philip K. Chan;Wei Fan;Andreas L. Prodromidis;Salvatore J. Stolfo
Affiliations:
-;-;-;-
Venue:
IEEE Intelligent Systems
Year:
1999

Citing 5
Cited 62

Original Contribution: Stacked generalization

Neural Networks
Improved boosting algorithms using confidence-rated predictions

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Mining in a data-flow environment: experience in network intrusion detection

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
AdaCost: Misclassification Cost-Sensitive Boosting

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Pruning Adaptive Boosting

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning

Using Trusted Email to Prevent Credit Card Frauds in Multimedia Products

World Wide Web
Data mining-based intrusion detectors: an overview of the columbia IDS project

ACM SIGMOD Record
A Neural Classifier with Fraud Density Map for Effective Credit Card Fraud Detection

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
A Synthetic Fraud Data Generation Methodology

ICICS '02 Proceedings of the 4th International Conference on Information and Communications Security
On finding common neighborhoods in massive graphs

Theoretical Computer Science
Minority report in fraud detection: classification of skewed data

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Generation of synthetic data sets for evaluating the accuracy of knowledge discovery systems

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Wrapper-based computation and evaluation of sampling methods for imbalanced datasets

UBDM '05 Proceedings of the 1st international workshop on Utility-based data mining
Sharing Classifiers among Ensembles from Related Problem Domains

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
The frequent wayfinding-sequence (FWS) methodology: Finding preferred routes in complex virtual environments

International Journal of Human-Computer Studies
Online decoding of Markov models under latency constraints

ICML '06 Proceedings of the 23rd international conference on Machine learning
Fighting cybercrime: a review and the Taiwan experience

Decision Support Systems - Special issue: Intelligence and security informatics
Behavior-based modeling and its application to Email analysis

ACM Transactions on Internet Technology (TOIT)
Ensemble Pruning Via Semi-definite Programming

The Journal of Machine Learning Research
Developing Mining-Grid Centric e-Finance Portal

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Knowledge modeling -- State of the art

Integrated Computer-Aided Engineering
Distributed feature extraction in a p2p setting: a case study

Future Generation Computer Systems - Special section: Data mining in grid computing environments
Classifying under computational resource constraints: anytime classification using probabilistic estimators

Machine Learning
An incremental cluster-based approach to spam filtering

Expert Systems with Applications: An International Journal
Appraisal of companies with Bayesian networks

International Journal of Business Intelligence and Data Mining
An efficient algorithm for mining frequent closed itemsets in dynamic transaction databases

International Journal of Intelligent Systems Technologies and Applications
Back propagation networks for credit card fraud prediction using stratified personalized data

ISP'06 Proceedings of the 5th WSEAS International Conference on Information Security and Privacy
Location of trusted email for prevention of credit card fraud in soft-products e-commerce

AIC'04 Proceedings of the 4th WSEAS International Conference on Applied Informatics and Communications
Classifying under computational resource constraints: anytime classification using probabilistic estimators

Machine Learning
New results for finding common neighborhoods in massive graphs in the data stream model

Theoretical Computer Science
Local reweight wrapper for the problem of imbalance

International Journal of Artificial Intelligence and Soft Computing
Handling imbalanced data sets with a modification of Decorate algorithm

International Journal of Computer Applications in Technology
Transaction aggregation as a strategy for credit card fraud detection

Data Mining and Knowledge Discovery
Error analysis in artificial neural networks: the imbalanced distribution case

SMO'08 Proceedings of the 8th conference on Simulation, modelling and optimization
Information Market-Based Decision Fusion

Management Science
Credit card fraud detection: A fusion approach using Dempster-Shafer theory and Bayesian learning

Information Fusion
Knowledge-Rich Data Mining in Financial Risk Detection

ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Agent-Based Non-distributed and Distributed Clustering

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
A comprehensive survey of numeric and symbolic outlier mining techniques

Intelligent Data Analysis
Anytime classification for a pool of instances

Machine Learning
Fighting cybercrime: a review and the Taiwan experience

Decision Support Systems - Special issue: Intelligence and security informatics
Developing mining-grid centric e-finance portals for risk management

JSAI'06 Proceedings of the 20th annual conference on New frontiers in artificial intelligence
An unbalanced data classification model using hybrid sampling technique for fraud detection

PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
An analytic approach to select data mining for business decision

Expert Systems with Applications: An International Journal
A hybrid fraud scoring and spike detection technique in streaming data

Intelligent Data Analysis
Data clustering by minimizing disconnectivity

Information Sciences: an International Journal
The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature

Decision Support Systems
Data mining for credit card fraud: A comparative study

Decision Support Systems
Novel questionnaire-responded transaction approach with SVM for credit card fraud detection

ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
Learning of neural networks for fraud detection based on a partial area under curve

ISNN'05 Proceedings of the Second international conference on Advances in neural networks - Volume Part II
On the effectiveness of preprocessing methods when dealing with different levels of class imbalance

Knowledge-Based Systems
Usability of display-equipped RFID tags for security purposes

ESORICS'11 Proceedings of the 16th European conference on Research in computer security
Improved competitive learning neural networks for network intrusion and fraud detection

Neurocomputing
Context-Sensitive regression analysis for distributed data

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Two-Stage credit card fraud detection using sequence alignment

ICISS'06 Proceedings of the Second international conference on Information Systems Security
A game-theoretic approach to credit card fraud detection

ICISS'05 Proceedings of the First international conference on Information Systems Security
A double-ensemble approach for classifying skewed data streams

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Employing transaction aggregation strategy to detect credit card fraud

Expert Systems with Applications: An International Journal
The role of intelligent agents and data mining in electronic partnership management

Expert Systems with Applications: An International Journal
An effective early fraud detection method for online auctions

Electronic Commerce Research and Applications
A new probabilistic active sample selection algorithm for class imbalance problem

International Journal of Knowledge Engineering and Soft Data Paradigms
The fuzzy Laplacianclassifier

Neurocomputing
Metafraud: a meta-learning framework for detecting financial fraud

MIS Quarterly
Using social network knowledge for detecting spider constructions in social security fraud

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Enhanced and hierarchical structure algorithm for data imbalance problem in semantic extraction under massive video dataset

Multimedia Tools and Applications
Editorial: data mining in electronic commerce - support vs. confidence

Journal of Theoretical and Applied Electronic Commerce Research
Can Jannie verify? Usability of display-equipped RFID tags for security purposes

Journal of Computer Security - Research in Computer Security and Privacy: Emerging Trends

Quantified Score

Hi-index	0.01

Visualization

Abstract

Credit card transactions continue to grow in number, taking a larger share of the US payment system, and have led to a higher rate of stolen account numbers and subsequent losses by banks. Hence, improved fraud detection has become essential to maintain the viability of the US payment system. Banks have been fielding early fraud warning systems for some years. We seek to improve upon the state-of-the-art in commercial practice via large scale data mining. Scalable techniques to analyze massive amounts of transaction data to compute efficient fraud detectors in a timely manner is an important problem, especially for e-commerce. Besides scalability and efficiency, the fraud detection task exhibits technical problems that include skewed distributions of training data and non-uniform cost per error, both of which have not been widely studied in the knowledge discovery and data mining community. In this article we survey and evaluate a number of techniques that we have proposed and implemented that address these three main issues concurrently. Our proposed methods of combining multiple learned fraud detectors under a "cost model" are general and demonstrably useful; our empirical results demonstrate that we can significantly reduce loss due to fraud through distributed data mining of fraud models.