|COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring.|
Extracting the Semantic Signature of Malware, Metamorphic Viruses and Worms
If you have a question about this talk, please contact Alan Mycroft.
[Shyam is visiting the CL until 14 October 2010.]
Malware is increasingly becoming a serious threat and a nuisance in the information and network age. Human experts extract (involves complex analysis of encrypted and/or packed binaries) a signature (usually a text pattern) of the malware and deploy it, to protect against a malware.
However, this approach does not work for polymorphic and metamorphic malware, which have the ability to change shape from attack to attack; also, metamorphic virus detection (even assuming fixed length) is NP-complete. To counter these advanced forms of malware we need semantic signatures which capture the essential behaviour of the malware (which remains unchanged across variants). In this talk, we present an algorithmic approach for extracting the semantic signature of a malware—as a regular expression over API calls—and demonstrate via experiments its efficacy in detecting and predicting malware variants. Our approach involves two steps. In the first step, we collect and abstract the behaviour (as a sequence of security relevant API /system calls) of the malware in different runs. In the second step, we inductively learn (under the supervision of a human expert) a regular expression that tightly fits these behaviours (generalizing where necessary). This regular expression then acts as the semantic signature of the malware. We performed experiments with the metamorphic virus Etap/Simile, and the email worms Beagle, Netsky and MyDoom.
Experimental results give us a good confidence that our approach can be effectively used for malware detection.
This talk is part of the Computer Laboratory Programming Research Group Seminar series.
This talk is included in these lists:
Note that ex-directory lists are not shown.
Other listsCambridge Past, Present & Future Asian Archaeology Group Organic Chemistry
Other talksThinking religion today with Gandhi Healthy neurocognitive aging with big data: A multivariate dive into Biobank (N=500,000) Genetic prediction of cardiovascular disease Local testability in group theory II Cultural History of Transparency Interplay between cellular senescence and reprogramming during tissue repair