Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Data Mining and Knowledge Discovery
Understanding belief propagation and its generalizations
Exploring artificial intelligence in the new millennium
The myth of the double-blind review?: author identification using only citations
ACM SIGKDD Explorations Newsletter
Propagation of trust and distrust
Proceedings of the 13th international conference on World Wide Web
Why collective inference improves relational classification
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Using relational knowledge discovery to prevent securities fraud
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Semi-supervised learning with graphs
Semi-supervised learning with graphs
Propagation Models for Trust and Distrust in Social Networks
Information Systems Frontiers
Large Scale Detection of Irregularities in Accounting Data
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Netprobe: a fast and scalable system for fraud detection in online auction networks
Proceedings of the 16th international conference on World Wide Web
Finding tribes: identifying close-knit individuals from employment patterns
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Relational data pre-processing techniques for improved securities fraud detection
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Identifying non-explicit citing sentences for citation-based summarization
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Catching bad guys with graph mining
XRDS: Crossroads, The ACM Magazine for Students - The Fate of Money
Apolo: making sense of large network data by combining rich user interaction and machine learning
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Apolo: interactive large graph sensemaking by combining machine learning and visualization
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Unifying guilt-by-association approaches: theorems and fast algorithms
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Identify Online Store Review Spammers via Social Review Graph
ACM Transactions on Intelligent Systems and Technology (TIST)
An effective early fraud detection method for online auctions
Electronic Commerce Research and Applications
Top-N recommendation through belief propagation
Proceedings of the 21st ACM international conference on Information and knowledge management
Anomaly, event, and fraud detection in large network datasets
Proceedings of the sixth ACM international conference on Web search and data mining
Inside insider trading: patterns & discoveries from a large scale exploratory analysis
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Early security classification of skype users via machine learning
Proceedings of the 2013 ACM workshop on Artificial intelligence and security
Hi-index | 0.00 |
Classifying nodes in networks is a task with a wide range of applications. It can be particularly useful in anomaly and fraud detection. Many resources are invested in the task of fraud detection due to the high cost of fraud, and being able to automatically detect potential fraud quickly and precisely allows human investigators to work more efficiently. Many data analytic schemes have been put into use; however, schemes that bolster link analysis prove promising. This work builds upon the belief propagation algorithm for use in detecting collusion and other fraud schemes. We propose an algorithm called SNARE (Social Network Analysis for Risk Evaluation). By allowing one to use domain knowledge as well as link knowledge, the method was very successful for pinpointing misstated accounts in our sample of general ledger data, with a significant improvement over the default heuristic in true positive rates, and a lift factor of up to 6.5 (more than twice that of the default heuristic). We also apply SNARE to the task of graph labeling in general on publicly-available datasets. We show that with only some information about the nodes themselves in a network, we get surprisingly high accuracy of labels. Not only is SNARE applicable in a wide variety of domains, but it is also robust to the choice of parameters and highly scalable-linearly with the number of edges in a graph.