Survey							
                            
		                
		                * Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Distributed Network Intrusion Detection
An Immunological Approach
Steven Hofmeyr
Stephanie Forrest
Patrik D’haeseleer
Dept. of Computer Science
University of New Mexico
Albuquerque, NM
{steveah, forrest, patrik}@cs.unm.edu
http://cs.unm.edu/~steveah/research.html
Introduction
• Intrusion detection:
– Assume that systems are not secure.
– Attempt to detect violations of security policy (intrusions) by monitoring and
analyzing system behavior.
– Construct a model of normal behavior and look for deviations from the model
(anomaly detection).
• Building the model (defining self):
– TCP/IP traffic over a broadcast LAN.
– Based on Network Security Monitor (NSM).
• Every computer on the network should participate in IDS:
– Distributed detection
– Use negative-selection algorithm
• Diversity of protection:
– Permutation masks
Background: Defining Self
•
NSM: Network Security Monitor (UCDavis)
{Mukherjee et al. Network Intrusion Detection. IEEE Network, pp26-41, 1994}
External host
Internal host
10.10.10.2
Datapath triple
(10.10.10.2, 20.20.20.5, ftp)
Broadcast
LAN
•
The right approach:
–
–
–
–
•
Anomaly detection
Sparsely connected graph
Normal patterns reasonably stable
Attackers highly likely to perturb graph
Disadvantages:
– Heavyweight
– Single point of failure
– Not scalable
20.20.20.5
The Biological Viewpoint
•
•
•
Self (proteins) = normal datapath triples
Nonself (proteins) = triples generated during an attack
Universe = Self  Nonself
•
Anomaly detection:
– Detection system trained on self
– Detection system classifies new triples as self (normal) or nonself
(anomalous)
•
NSM: a single monolithic detector matching self (positive detection)
How the Immune System
Distributes Detection
•
Immune system: Many small detectors matching nonself (negative
detection).
•
Advantages of distributed negative detection:
–
–
–
–
–
Localized (no communication costs)
Scalable
Tunable
Robust (no single point of failure)
Negative selection algorithm minimizes false positives
The Negative Selection Algorithm
1. Randomly generate a
detector string.
2. Does the detector string
match self?
3. If no, accept
If yes, go to 1.
(regenerate).
Results in a set of valid detectors
NO
YES
ACCEPT
REJECT
Applying Negative Detection to Network Traffic
•
Representation:
– SYN packet triples mapped to 49-bit strings
•
Generalized detection:
– Partial matching with r-contiguous bits rule
Triple
0110100101
0100110100
Detector
1110111101
1110111101
Match
No Match
r=4
•
Consequences of Partial Matching:
– Advantage: Lightweight (few detectors per host)
– Disadvantage: Holes limit detection
Holes
Overcoming Holes
•
•
Problem: Holes limit detection for any partial match rule.
Solution: A different permutation mask for each host.
•
Result: In the broadcast network, detection is limited by the intersection of all
hole sets.
Experimental Setup
• UNM CS subnet of 50 machines on a switched segment.
– 100 49-bit string detectors per machine
• Training set (self):
– Collected over 43 days
– 1 266 000 TCP SYN packets
– 3763 unique binary self strings
• Normal test set (supposedly self):
– Collected over 7 days
– 182 629 TCP SYN packets
– 626 unique binary self strings
• Abnormal test set (nonself):
– 8 different incidents, 7 real occurrences, 1 synthetic
– Real abnormal behavior includes: massive portscanning, limited
probing, address-space probing, local host compromise
– Synthetic: 200 random connections between internal (LAN) hosts
Experimental Results
•
Normal is reasonably stable.
Title:
Cre at or:
Prev ie w :
This EPS pict ure w a s no t s av ed
w ith a pre view inc lu de d in it.
Co mmen t:
This EPS pict ure w ill p rin t to a
Pos tSc rip t p rin ter, bu t n ot to
o the r ty pe s of p rin te rs.
•
Low false positives:
– P(false positive per self string) = 0.000304
– 55 strings, but only 10 unique
– Effectively: under 2 false alarms per day
•
High detection rates with few detectors
– 100% successful detection: 8 out of 8 abnormal incidents detected
– Only 100 detectors per host
•
Permutation masks improve detection
– Up to an order of magnitude improvement
– Overcomes hole limitation
The Problem of Incomplete Self Sets
(Suppose the training set is incomplete)
• Activation threshold:
–
–
–
–
–
Detector is not activated on every match.
Must have exceeded x matches before activation.
No time horizon.
Helps with stealth attacks (distributed in time).
Reduced false positives by an order of magnitude.
• Adaptive activation:
– Tune local activation thresholds dynamically.
– Whenever a detector matches its first pattern, the activation
threshold for that computer is reduced by 1.
– Has a time horizon (threshold gradually returns to default value).
– Hypothesized to help with distributed coordinated attacks.
Experimental Results
Intrusions with and without permutation masks
Fraction
nonself
Phear
1.00
Cartan
0.44
Dt03ln93 0.17
Anomaly Signal
Threshold 1
Threshold 10
permutation No permutation
1.00
0.50
0.09
0.44
0.43
0.34
0.17
0.16
0.16
Xtream
Cougar
Sauron
Pc35nl
0.62
0.54
0.10
1.00
0.62
0.58
0.10
1.00
0.59
0.53
0.09
0.84
0.61
0.49
0.09
0.43
Synthetic 1.00
0.94
0.33
0.01
Incident
Experimental and Theoretical Results:
Permutation Masks Overcome the Hole Limit
Title:
Creator:
Prev iew :
This EPS picture w as not s av ed
w ith a preview inc luded in it.
Comment:
This EPS picture w ill print to a
Pos tSc ript printer, but not to
other ty pes of printers.
Pushing the Immune Metaphor
• The analogy thus far:
– Distributed networks and immunology
– Combining negative detection and network
intrusion detection
– Diversity via permutation masks
• For the future:
– Distributed generation of detectors
– Dynamic detector sets
– Adaptation and memory (misuse detection)