Phishing Web Page Detection using Optimised Machine Learning

Stobbs, Jordan, Issac, Biju and Seibu, Mary Jacob (2020) Phishing Web Page Detection using Optimised Machine Learning. In: 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom): 29 December 2020 – 1 January 2021 Guangzhou, China. IEEE, Piscataway, NJ, pp. 483-490. ISBN 9781665403931, 9781665403924

[img]
Preview
Text
Accepted Manuscript_Stobbs et al.pdf - Accepted Version

Download (545kB) | Preview
Official URL: https://doi.org/10.1109/trustcom50675.2020.00072

Abstract

Phishing is a type of social engineering attack that can affect any company or anyone. This paper explores the effect that different features and optimisation techniques have on the accuracy of intelligent phishing detection using machine learning algorithms. This paper explores both hyperparameter optimisation as well as feature selection optimisation. For hyperparameter tuning, both TPE (Tree-structured Parzen Estimator) and GA (Genetic Algorithm) were tested, with the best option being model dependent. For feature selection, GA, MFO (Moth Flame Optimisation) and PSO (Particle Swarm Optimisation) were used with PSO working best with a Random Forest model. This work used URL (Uniform Resource Locator), DOM (Document Object Model) structure, page rank and page information related features. This research found that the best combination was Random Forest using PSO for feature selection and TPE for hyperparameter optimisation, giving an accuracy of 99.33%.

Item Type: Book Section
Uncontrolled Keywords: phishing detection, bio-inspired optimisation anti-phishing optimisation
Subjects: G400 Computer Science
Department: Faculties > Engineering and Environment > Computer and Information Sciences
Related URLs:
Depositing User: John Coen
Date Deposited: 11 Nov 2020 10:01
Last Modified: 31 Jul 2021 15:51
URI: http://nrl.northumbria.ac.uk/id/eprint/44731

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics