Fuzzy-import hashing: A static analysis technique for malware detection

Naik, Nitin, Jenkins, Paul, Savage, Nick, Yang, Longzhi, Boongoen, Tossapon and Iam-On, Natthakan (2021) Fuzzy-import hashing: A static analysis technique for malware detection. Forensic Science International: Digital Investigation, 37. p. 301139. ISSN 2666-2817

FSIDI-301139-Fuzzy-Import-Hashing-DrNaik.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives 4.0.

Download (706kB) | Preview
Official URL: https://doi.org/10.1016/j.fsidi.2021.301139


The advent of new malware types and their attack vectors poses serious challenges for security experts in discovering effective malware detection and analysis techniques. The preliminary step in malware analysis is filtering out samples of counterfeit malware from the suspicious samples by classifying them into most likely and unlikely malware categories. This will enable effective utilisation of resources and expertise for the most likely category of samples in subsequent stages and avoid nugatory effort. This process requires a very fast and resource-optimised method as it is applied on a large sample size. Fuzzy hashing and import hashing methods satisfy these requirements of malware analysis, though, with some limitations. Therefore, the proper integration of these methods, may overcome some of the limitations and improve the detection accuracy without affecting the overall performance of analysis. Hence, this paper proposes a fuzzy-import hashing technique, which is the integration of two methods, namely, fuzzy hashing and import hashing. This integration can offer several benefits such as an improved detection rate by complementing each other when one method cannot detect malware, then the other method can; and the generation of fuzzfied results for subsequent clustering or classification, as the import hashing result can be easily merged with the fuzzy hashing result. The success of this proposed fuzzy-import hashing method is demonstrated through several experiments namely: on the collected malware and goodware corpus; a comparative evaluation against the established YARA rules and application in fuzzy c-means clustering.

Item Type: Article
Uncontrolled Keywords: Malware analysis, Fuzzy-import hashing, Fuzzy hashing, Import hashing, YARA Rules, Fuzzy C-Means clustering, Ransomware
Subjects: G400 Computer Science
G500 Information Systems
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Depositing User: Elena Carlaw
Date Deposited: 06 Apr 2021 14:52
Last Modified: 01 Apr 2022 03:32
URI: http://nrl.northumbria.ac.uk/id/eprint/45869

Actions (login required)

View Item View Item


Downloads per month over past year

View more statistics