Populated IP addresses: classification and applications

Authors:
Chi-Yao Hong;Fang Yu;Yinglian Xie
Affiliations:
UIUC, Urbana, IL, USA;MSR Silicon Valley, Mountain View, CA, USA;MSR Silicon Valley, Mountain View, CA, USA
Venue:
Proceedings of the 2012 ACM conference on Computer and communications security
Year:
2012

Citing 25
Cited 0

C4.5: programs for machine learning

C4.5: programs for machine learning
An empirical study of spam traffic and the use of DNS black lists

Proceedings of the 4th ACM SIGCOMM conference on Internet measurement
Aberrant Behavior Detection in Time Series for Network Monitoring

LISA '00 Proceedings of the 14th USENIX conference on System administration
Worm Origin Identification Using Random Moonwalks

SP '05 Proceedings of the 2005 IEEE Symposium on Security and Privacy
Towards IP geolocation using delay and topology measurements

Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
Reliability and security in the CoDeeN content distribution network

ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Revealing botnet membership using DNSBL counter-intelligence

SRUTI'06 Proceedings of the 2nd conference on Steps to Reducing Unwanted Traffic on the Internet - Volume 2
Tor: the second-generation onion router

SSYM'04 Proceedings of the 13th conference on USENIX Security Symposium - Volume 13
Dryad: distributed data-parallel programs from sequential building blocks

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
How dynamic are IP addresses?

Proceedings of the 2007 conference on Applications, technologies, architectures, and protocols for computer communications
A note on Platt's probabilistic outputs for support vector machines

Machine Learning
Filtering spam with behavioral blacklisting

Proceedings of the 14th ACM conference on Computer and communications security
Spamming botnets: signatures and characteristics

Proceedings of the ACM SIGCOMM 2008 conference on Data communication
LIBLINEAR: A Library for Large Linear Classification

The Journal of Machine Learning Research
BotGraph: large scale spamming botnet detection

NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
De-anonymizing the internet using unreliable IDs

Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Understanding block-level address usage in the visible internet

Proceedings of the ACM SIGCOMM 2010 conference
DryadLINQ: a system for general-purpose distributed data-parallel computing using a high-level language

OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation
Detecting spammers with SNARE: spatio-temporal network-level automatic reputation engine

SSYM'09 Proceedings of the 18th conference on USENIX security symposium
BotGrep: finding P2P bots with structured graph analysis

USENIX Security'10 Proceedings of the 19th USENIX conference on Security
Searching the searchers with searchaudit

USENIX Security'10 Proceedings of the 19th USENIX conference on Security
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
Peering through the shroud: the effect of edge opacity on ip-based client identification

NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
Estimating the number of users behind ip addresses for combating abusive traffic

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
BOTMAGNIFIER: locating spambots on the internet

SEC'11 Proceedings of the 20th USENIX conference on Security

Quantified Score

Hi-index	0.00

Visualization

Abstract

Populated IP addresses (PIP) -- IP addresses that are associated with a large number of user requests are important for online service providers to efficiently allocate resources and to detect attacks. While some PIPs serve legitimate users, many others are heavily abused by attackers to conduct malicious activities such as scams, phishing, and malware distribution. Unfortunately, commercial proxy lists like Quova have a low coverage of PIP addresses and offer little support for distinguishing good PIPs from abused ones. In this study, we propose PIPMiner, a fully automated method to extract and classify PIPs through analyzing service logs. Our methods combine machine learning and time series analysis to distinguish good PIPs from abused ones with over 99.6% accuracy. When applying the derived PIP list to several applications, we can identify millions of malicious Windows Live accounts right on the day of their sign-ups, and detect millions of malicious Hotmail accounts well before the current detection system captures them.