Gibson, Simran, Issac, Biju, Zhang, Li and Jacob, Seibu Mary (2020) Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms. IEEE Access, 8. pp. 187914-187932. ISSN 2169-3536
|
Text
09222163.pdf - Published Version Available under License Creative Commons Attribution Non-commercial No Derivatives 4.0. Download (2MB) | Preview |
|
|
Text
Pre-print version.pdf - Accepted Version Download (2MB) | Preview |
Abstract
Electronic mail has eased communication methods for many organisations as well as individuals. This method is exploited for fraudulent gain by spammers through sending unsolicited emails. This article aims to present a method for detection of spam emails with machine learning algorithms that are optimized with bio-inspired methods. A literature review is carried to explore the efficient methods applied on different datasets to achieve good results. An extensive research was done to implement machine learning models using Naïve Bayes, Support Vector Machine, Random Forest, Decision Tree and Multi-Layer Perceptron on seven different email datasets, along with feature extraction and pre-processing. The bio-inspired algorithms like Particle Swarm Optimization and Genetic Algorithm were implemented to optimize the performance of classifiers. Multinomial Naïve Bayes with Genetic Algorithm performed the best overall. The comparison of our results with other machine learning and bio-inspired models to show the best suitable model is also discussed.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Machine Learning, Bio-inspired Algorithms, Cross-validation, Particle Swarm Optimiza-tion, Genetic Algorithm |
Subjects: | G400 Computer Science G900 Others in Mathematical and Computing Sciences |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences |
Depositing User: | John Coen |
Date Deposited: | 12 Oct 2020 14:46 |
Last Modified: | 16 Dec 2022 16:00 |
URI: | https://nrl.northumbria.ac.uk/id/eprint/44488 |
Downloads
Downloads per month over past year