SNARE: a link analytic system for graph labeling and risk detection

Authors:
Mary McGlohon;Stephen Bay;Markus G. Anderle;David M. Steier;Christos Faloutsos
Affiliations:
Carnegie Mellon University, Pittsburgh, PA, USA;PricewaterhouseCoopers, LLC, San Jose, CA, USA;PricewaterhouseCoopers, LLC, San Jose, CA, USA;PricewaterhouseCoopers, LLC, San Jose, CA, USA;Carnegie Mellon University, Pittsburgh, PA, USA
Venue:
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2009

Citing 13
Cited 11

Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Adaptive Fraud Detection

Data Mining and Knowledge Discovery
Understanding belief propagation and its generalizations

Exploring artificial intelligence in the new millennium
The myth of the double-blind review?: author identification using only citations

ACM SIGKDD Explorations Newsletter
Propagation of trust and distrust

Proceedings of the 13th international conference on World Wide Web
Why collective inference improves relational classification

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Using relational knowledge discovery to prevent securities fraud

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Semi-supervised learning with graphs

Semi-supervised learning with graphs
Propagation Models for Trust and Distrust in Social Networks

Information Systems Frontiers
Large Scale Detection of Irregularities in Accounting Data

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Netprobe: a fast and scalable system for fraud detection in online auction networks

Proceedings of the 16th international conference on World Wide Web
Finding tribes: identifying close-knit individuals from employment patterns

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Relational data pre-processing techniques for improved securities fraud detection

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining

Identifying non-explicit citing sentences for citation-based summarization

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Catching bad guys with graph mining

XRDS: Crossroads, The ACM Magazine for Students - The Fate of Money
Apolo: making sense of large network data by combining rich user interaction and machine learning

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Apolo: interactive large graph sensemaking by combining machine learning and visualization

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Unifying guilt-by-association approaches: theorems and fast algorithms

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Identify Online Store Review Spammers via Social Review Graph

ACM Transactions on Intelligent Systems and Technology (TIST)
An effective early fraud detection method for online auctions

Electronic Commerce Research and Applications
Top-N recommendation through belief propagation

Proceedings of the 21st ACM international conference on Information and knowledge management
Anomaly, event, and fraud detection in large network datasets

Proceedings of the sixth ACM international conference on Web search and data mining
Inside insider trading: patterns & discoveries from a large scale exploratory analysis

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Early security classification of skype users via machine learning

Proceedings of the 2013 ACM workshop on Artificial intelligence and security

Quantified Score

Hi-index	0.00

Visualization

Abstract

Classifying nodes in networks is a task with a wide range of applications. It can be particularly useful in anomaly and fraud detection. Many resources are invested in the task of fraud detection due to the high cost of fraud, and being able to automatically detect potential fraud quickly and precisely allows human investigators to work more efficiently. Many data analytic schemes have been put into use; however, schemes that bolster link analysis prove promising. This work builds upon the belief propagation algorithm for use in detecting collusion and other fraud schemes. We propose an algorithm called SNARE (Social Network Analysis for Risk Evaluation). By allowing one to use domain knowledge as well as link knowledge, the method was very successful for pinpointing misstated accounts in our sample of general ledger data, with a significant improvement over the default heuristic in true positive rates, and a lift factor of up to 6.5 (more than twice that of the default heuristic). We also apply SNARE to the task of graph labeling in general on publicly-available datasets. We show that with only some information about the nodes themselves in a network, we get surprisingly high accuracy of labels. Not only is SNARE applicable in a wide variety of domains, but it is also robust to the choice of parameters and highly scalable-linearly with the number of edges in a graph.