Anagram: a content anomaly detector resistant to mimicry attack

Authors:
Ke Wang;Janak J. Parekh;Salvatore J. Stolfo
Affiliations:
Computer Science Department, Columbia University, New York, NY;Computer Science Department, Columbia University, New York, NY;Computer Science Department, Columbia University, New York, NY
Venue:
RAID'06 Proceedings of the 9th international conference on Recent Advances in Intrusion Detection
Year:
2006

Citing 27
Cited 58

Universal one-way hash functions and their cryptographic applications

STOC '89 Proceedings of the twenty-first annual ACM symposium on Theory of computing
Space/time trade-offs in hash coding with allowable errors

Communications of the ACM
Characterizing the behavior of a program using multiple-length N-grams

Proceedings of the 2000 workshop on New security paradigms
Service specific anomaly detection for network intrusion detection

Proceedings of the 2002 ACM symposium on Applied computing
Mimicry attacks on host-based intrusion detection systems

Proceedings of the 9th ACM conference on Computer and communications security
Specification-based anomaly detection: a new approach for detecting network intrusions

Proceedings of the 9th ACM conference on Computer and communications security
"Why 6?" Defining the Operational Limits of Stide, an Anomaly-Based Intrusion Detector

SP '02 Proceedings of the 2002 IEEE Symposium on Security and Privacy
Intrusion Detection via Static Analysis

SP '01 Proceedings of the 2001 IEEE Symposium on Security and Privacy
Shield: vulnerability-driven network filters for preventing known vulnerability exploits

Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
Polygraph: Automatically Generating Signatures for Polymorphic Worms

SP '05 Proceedings of the 2005 IEEE Symposium on Security and Privacy
Fast and automated generation of attack signatures: a basis for building self-protecting servers

Proceedings of the 12th ACM conference on Computer and communications security
On deriving unknown vulnerabilities from zero-day polymorphic and metamorphic worm exploits

Proceedings of the 12th ACM conference on Computer and communications security
Can machine learning be secure?

ASIACCS '06 Proceedings of the 2006 ACM Symposium on Information, computer and communications security
Privacy-preserving payload-based correlation for accurate malicious traffic detection

Proceedings of the 2006 SIGCOMM workshop on Large-scale attack defense
Building a reactive immune system for software services

ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Automated worm fingerprinting

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Static analysis of executables to detect malicious patterns

SSYM'03 Proceedings of the 12th conference on USENIX Security Symposium - Volume 12
Autograph: toward automated, distributed worm signature detection

SSYM'04 Proceedings of the 13th conference on USENIX Security Symposium - Volume 13
SigFree: a signature-free buffer overflow attack blocker

USENIX-SS'06 Proceedings of the 15th conference on USENIX Security Symposium - Volume 15
StackGuard: automatic adaptive detection and prevention of buffer-overflow attacks

SSYM'98 Proceedings of the 7th conference on USENIX Security Symposium - Volume 7
Undermining an anomaly-based intrusion detection system using common exploits

RAID'02 Proceedings of the 5th international conference on Recent advances in intrusion detection
A sense of self for Unix processes

SP'96 Proceedings of the 1996 IEEE conference on Security and privacy
Application communities: using monoculture for dependability

HotDep'05 Proceedings of the First conference on Hot topics in system dependability
FLIPS: hybrid adaptive intrusion prevention

RAID'05 Proceedings of the 8th international conference on Recent Advances in Intrusion Detection
Polymorphic worm detection using structural information of executables

RAID'05 Proceedings of the 8th international conference on Recent Advances in Intrusion Detection
Anomalous payload-based worm detection and signature generation

RAID'05 Proceedings of the 8th international conference on Recent Advances in Intrusion Detection
A dynamic mechanism for recovering from buffer overflow attacks

ISC'05 Proceedings of the 8th international conference on Information Security

Privacy-preserving payload-based correlation for accurate malicious traffic detection

Proceedings of the 2006 SIGCOMM workshop on Large-scale attack defense
On the infeasibility of modeling polymorphic shellcode

Proceedings of the 14th ACM conference on Computer and communications security
Data sanitization: improving the forensic utility of anomaly detection systems

HotDep'07 Proceedings of the 3rd workshop on on Hot Topics in System Dependability
High-speed detection of unsolicited bulk emails

Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems
BotHunter: detecting malware infection through IDS-driven dialog correlation

SS'07 Proceedings of 16th USENIX Security Symposium on USENIX Security Symposium
Anomaly-based fault detection in pervasive computing system

Proceedings of the 5th international conference on Pervasive services
A Study of Malcode-Bearing Documents

DIMVA '07 Proceedings of the 4th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Traffic Aggregation for Malware Detection

DIMVA '08 Proceedings of the 5th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Behavior-Based Network Access Control: A Proof-of-Concept

ISC '08 Proceedings of the 11th international conference on Information Security
Automatic feature selection for anomaly detection

Proceedings of the 1st ACM workshop on Workshop on AISec
Eureka: A Framework for Enabling Static Malware Analysis

ESORICS '08 Proceedings of the 13th European Symposium on Research in Computer Security: Computer Security
A Self-learning System for Detection of Anomalous SIP Messages

Principles, Systems and Applications of IP Telecommunications. Services and Security for Next Generation Networks
Boosting Web Intrusion Detection Systems by Inferring Positive Signatures

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part II on On the Move to Meaningful Internet Systems
Online Network Forensics for Automatic Repair Validation

IWSEC '08 Proceedings of the 3rd International Workshop on Security: Advances in Information and Computer Security
Incorporation of Application Layer Protocol Syntax into Anomaly Detection

ICISS '08 Proceedings of the 4th International Conference on Information Systems Security
McPAD: A multiple classifier system for accurate payload-based anomaly detection

Computer Networks: The International Journal of Computer and Telecommunications Networking
Using Artificial Intelligence for Intrusion Detection

Proceedings of the 2007 conference on Emerging Artificial Intelligence Applications in Computer Engineering: Real Word AI Systems with Applications in eHealth, HCI, Information Retrieval and Pervasive Technologies
On Improving the Accuracy and Performance of Content-Based File Type Identification

ACISP '09 Proceedings of the 14th Australasian Conference on Information Security and Privacy
Active and Semi-supervised Data Domain Description

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Feature Selection for Density Level-Sets

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Keep your friends close: the necessity for updating an anomaly sensor with legitimate environment changes

Proceedings of the 2nd ACM workshop on Security and artificial intelligence
Active learning for network intrusion detection

Proceedings of the 2nd ACM workshop on Security and artificial intelligence
Panacea: Automating Attack Classification for Anomaly-Based Network Intrusion Detection Systems

RAID '09 Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection
Protecting a Moving Target: Addressing Web Application Concept Drift

RAID '09 Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection
Adaptive Anomaly Detection via Self-calibration and Dynamic Updating

RAID '09 Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection
TokDoc: a self-healing web application firewall

Proceedings of the 2010 ACM Symposium on Applied Computing
Emulation-based detection of non-self-contained polymorphic shellcode

RAID'07 Proceedings of the 10th international conference on Recent advances in intrusion detection
Client-side detection of XSS worms by monitoring payload propagation

ESORICS'09 Proceedings of the 14th European conference on Research in computer security
Lightweight static analysis to detect polymorphic exploit code with static analysis resistant technique

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
A behaviour study of network-aware stealthy worms

ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Payload modeling for network intrusion detection systems

MILCOM'09 Proceedings of the 28th IEEE conference on Military communications
Machine learning in adversarial environments

Machine Learning
Mimimorphism: a new approach to binary code obfuscation

Proceedings of the 17th ACM conference on Computer and communications security
Cyber-critical infrastructure protection using real-time payload-based anomaly detection

CRITIS'09 Proceedings of the 4th international conference on Critical information infrastructures security
KIDS: keyed intrusion detection system

DIMVA'10 Proceedings of the 7th international conference on Detection of intrusions and malware, and vulnerability assessment
Behavior-based worm detectors compared

RAID'10 Proceedings of the 13th international conference on Recent advances in intrusion detection
Cujo: efficient detection and prevention of drive-by-download attacks

Proceedings of the 26th Annual Computer Security Applications Conference
Enhancing Intrusion Detection System with proximity information

International Journal of Security and Networks
A two-tier system for web attack detection using linear discriminant method

ICICS'10 Proceedings of the 12th international conference on Information and communications security
Summary-invisible networking: techniques and defenses

ISC'10 Proceedings of the 13th international conference on Information security
Classification of packet contents for malware detection

Journal in Computer Virology
A multilayer overlay network architecture for enhancing IP services availability against dos

ICISS'11 Proceedings of the 7th international conference on Information Systems Security
Cross-Domain collaborative anomaly detection: so far yet so close

RAID'11 Proceedings of the 14th international conference on Recent Advances in Intrusion Detection
Adversarial support vector machine learning

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Data flow analysis for anomaly detection and identification toward resiliency in extreme scale systems

The Journal of Supercomputing
Query strategies for evading convex-inducing classifiers

The Journal of Machine Learning Research
Frankenstein: stitching malware from benign binaries

WOOT'12 Proceedings of the 6th USENIX conference on Offensive Technologies
N-Gram against the machine: on the feasibility of the n-gram network analysis for binary protocols

RAID'12 Proceedings of the 15th international conference on Research in Attacks, Intrusions, and Defenses
Improvement of an anagram based NIDS by reducing the storage space of bloom filters (poster abstract)

RAID'12 Proceedings of the 15th international conference on Research in Attacks, Intrusions, and Defenses
Concurrency optimization for NIDS (poster abstract)

RAID'12 Proceedings of the 15th international conference on Research in Attacks, Intrusions, and Defenses
Wild-Inspired Intrusion Detection System Framework for High Speed Networks f|p IDS Framework

International Journal of Information Security and Privacy
Scalable fine-grained behavioral clustering of HTTP-based malware

Computer Networks: The International Journal of Computer and Telecommunications Networking
Review Article: RePIDS: A multi tier Real-time Payload-based Intrusion Detection System

Computer Networks: The International Journal of Computer and Telecommunications Networking
Machine-oriented biometrics and cocooning for dynamic network defense

Proceedings of the Eighth Annual Cyber Security and Information Intelligence Research Workshop
Security analysis of online centroid anomaly detection

The Journal of Machine Learning Research
Administrative evaluation of intrusion detection system

Proceedings of the 2nd annual conference on Research in information technology
Toward supervised anomaly detection

Journal of Artificial Intelligence Research
A close look on n-grams in intrusion detection: anomaly detection vs. classification

Proceedings of the 2013 ACM workshop on Artificial intelligence and security

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present Anagram, a content anomaly detector that models a mixture ofhigh-order n-grams (n 1) designed to detect anomalous and “suspicious” network packet payloads. By using higher-order n-grams, Anagram can detect significant anomalous byte sequences and generate robust signatures of validated malicious packet content. The Anagram content models are implemented using highly efficient Bloom filters, reducing space requirements and enabling privacy-preserving cross-site correlation. The sensor models the distinct content flow of a network or host using a semi-supervised training regimen. Previously known exploits, extracted from the signatures of an IDS, are likewise modeled in a Bloom filter and are used during training as well as detection time. We demonstrate that Anagram can identify anomalous traffic with high accuracy and low false positive rates. Anagram's high-order n-gram analysis technique is also resilient against simple mimicry attacks that blend exploits with “normal” appearing byte padding, such as the blended polymorphic attack recently demonstrated in [1]. We discuss randomized n-gram models, which further raises the bar and makes it more difficult for attackers to build precise packet structures to evade Anagram even if they know the distribution of the local site content flow. Finally, Anagram's speed and high detection rate makes it valuable not only as a standalone sensor, but also as a network anomaly flow classifier in an instrumented fault-tolerant host-based environment; this enables significant cost amortization and the possibility of a “symbiotic” feedback loop that can improve accuracy and reduce false positive rates over time.