Computer systems that learn: classification and prediction methods from statistics, neural nets, machine learning, and expert systems
Intrusion detection
Data mining: concepts and techniques
Data mining: concepts and techniques
Locally adaptive dimensionality reduction for indexing large time series databases
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Machine Learning
Efficient Similarity Search In Sequence Databases
FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
k-nearest Neighbor Classification on Spatial Data Streams Using P-trees
PAKDD '02 Proceedings of the 6th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Using Text Categorization Techniques for Intrusion Detection
Proceedings of the 11th USENIX Security Symposium
Similarity between Euclidean and cosine angle distance for nearest neighbor queries
Proceedings of the 2004 ACM symposium on Applied computing
Intrusion detection using sequences of system calls
Journal of Computer Security
Customer-adapted coupon targeting using feature selection
Expert Systems with Applications: An International Journal
A sense of self for Unix processes
SP'96 Proceedings of the 1996 IEEE conference on Security and privacy
A cryptography based privacy preserving solution to mine cloud data
Proceedings of the Third Annual ACM Bangalore Conference
A New Similarity Metric for Sequential Data
International Journal of Data Warehousing and Mining
Hi-index | 0.00 |
With the enormous growth of data, which exhibit sequentiality, it has become important to investigate the impact of embedded sequential information within the data. Sequential data are growing enormously, hence an efficient classification of sequential data is needed. k-Nearest Neighbor (kNN) has been used and proved to be an efficient classification technique for two-class problems. This paper uses sliding window approach to extract sub-sequences of various lengths and classification using kNN. We conducted experiments on DARPA 98 IDS dataset using various distance/similarity measures such as Jaccard similarity, Cosine similarity, Euclidian distance and Binary Weighted Cosine (BWC) measure. Our results demonstrate that sub-sequence information enhances kNN classification accuracy for sequential data, irrespective of the distance/similarity metric used.