Publication:
Detecting Rare and Collective Anomalies in Network Traffic Data using Summarization

dc.contributor.advisor Mahmood, Abdun en_US
dc.contributor.advisor Maher, Michael en_US
dc.contributor.author Ahmed, Mohiuddin en_US
dc.date.accessioned 2022-03-15T11:25:00Z
dc.date.available 2022-03-15T11:25:00Z
dc.date.issued 2016 en_US
dc.description.abstract Network anomaly detection is becoming increasingly challenging due to the amount of global Internet traffic being produced at an unprecedented rate. This thesis identifies summarization as a key component for improving the scalability and accuracy of anomaly detection techniques. Instead of analysing a large amount of data to find anomalies, a summary of the data can be used for detection of anomalies. The goal of this thesis is to investigate three key research issues related to summarization based anomaly detection. The first research problem is identifying anomalies from large amount of data. When data size increases, the anomaly detection techniques perform poorly, due to increasing false alarms and computational cost. Detecting anomalies from a summary could address these issues, but existing summarization techniques cannot accurately represent the rare anomalies present in the dataset. This thesis proposes several summarization techniques based on sampling and partitional clustering that achieve significant improvement in anomaly detection accuracy and execution time over a wide range of benchmark datasets. In certain cyber-attack scenarios, such as flooding Denial of Service attacks, the data distribution changes significantly. This forms a collective anomaly, where some similar kinds of normal data instances appear in abnormally large numbers. Since they are not rare anomalies, existing anomaly detection techniques cannot properly identify them. The second research problem of the thesis investigates detecting this behaviour using a number of clustering and co-clustering based techniques. Experimental evaluation demonstrates that a Hurst parameter-based technique outperforms existing collective and rare anomaly detection techniques in terms of detection accuracy and false positive rate. Solutions of the two research problems are integrated into a general summarization based anomaly detection framework. Many online applications need to process and analyse continuously arriving streaming data, where data cannot be stored indefinitely or revisited. To address this problem, this thesis investigates new sampling based summarization techniques which demonstrate that the summaries produced from stream using pair-wise distance and template matching techniques can retain more anomalies than existing stream summarization techniques. en_US
dc.identifier.uri http://hdl.handle.net/1959.4/56990
dc.language English
dc.language.iso EN en_US
dc.publisher UNSW, Sydney en_US
dc.rights CC BY-NC-ND 3.0 en_US
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/3.0/au/ en_US
dc.subject.other Data Summarization en_US
dc.subject.other Cyber Attack en_US
dc.subject.other Anomoly Detection en_US
dc.subject.other Data Sstreams en_US
dc.subject.other Reservoir en_US
dc.subject.other Clustering en_US
dc.subject.other Rare Anomaly en_US
dc.subject.other Collective Anomaly en_US
dc.subject.other Network Traffic Analysis en_US
dc.subject.other Denial of Service en_US
dc.subject.other Flooding Attack en_US
dc.subject.other Unsupervised Learning en_US
dc.subject.other Sampling en_US
dc.title Detecting Rare and Collective Anomalies in Network Traffic Data using Summarization en_US
dc.type Thesis en_US
dcterms.accessRights open access
dcterms.rightsHolder Ahmed, Mohiuddin
dspace.entity.type Publication en_US
unsw.accessRights.uri https://purl.org/coar/access_right/c_abf2
unsw.date.embargo 2018-11-30 en_US
unsw.description.embargoNote Embargoed until 2018-11-30
unsw.identifier.doi https://doi.org/10.26190/unsworks/3066
unsw.relation.faculty UNSW Canberra
unsw.relation.originalPublicationAffiliation Ahmed, Mohiuddin, Engineering & Information Technology, UNSW Canberra, UNSW en_US
unsw.relation.originalPublicationAffiliation Mahmood , Abdun, Engineering & Information Technology, UNSW Canberra, UNSW en_US
unsw.relation.originalPublicationAffiliation Maher , Michael , Engineering & Information Technology, UNSW Canberra, UNSW en_US
unsw.relation.school School of Engineering and Information Technology *
unsw.thesis.degreetype PhD Doctorate en_US
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
public version.pdf
Size:
5.32 MB
Format:
application/pdf
Description:
Resource type