Learning Classifiers for Misuse and Anomaly Detection Using a Bag of System Calls Representation

Kang, Dae-Ki; Fuller, Doug; Honavar, Vasant

Learning Classifiers for Misuse and Anomaly Detection Using a Bag of System Calls Representation

File

isi05.pdf (183.48 KB)

Date

2005-01-01

Authors

Kang, Dae-Ki

Fuller, Doug

Honavar, Vasant

Organizational Units

Organizational Unit

Computer Science

Department

Computer Science

Abstract

In this paper, we propose a ``bag of system calls'' representation for intrusion detection in system call sequences and describe misuse and anomaly detection results with standard machine learning techniques on University of New Mexico (UNM) and MIT Lincoln Lab (MIT LL) system call sequences with the proposed representation. With the feature representation as input, we compare the performance of several machine learning techniques for misuse detection and show experimental results on anomaly detection. The results show that standard machine learning and clustering techniques on simple ``bag of system calls'' representation of system call sequences is effective and often performs better than those approaches that use foreign contiguous subsequences in detecting intrusive behaviors of compromised processes.