IJCATR Volume 11 Issue 3

An Email Spam Filtering Model Using Ensemble of Machine Learning Techniques

Aju Omojokun Gabriel, Adedeji Ayomiposi Joy
10.7753/IJCATR1103.1003
keywords : Spam Email, Email Filtering, Ensemble Machine Learning, Forward Propagation Training, Performance Metrics.

PDF
The growth of spam emails is on the increase responsible for larger portions of the global email traffics. Aside the annoyance and the time wasted sifting through the unwanted messages; spam emails can also cause immeasurable harms through malicious software capable of damaging systems and compromising confidential information. The risks of filtering spam emails is that sometimes, legitimate mails are marked as spam, yet the results of not filtering spam are the constant flood of spam clogs on networks that adversely impacts users inboxes while draining valuable resources on the networks such as bandwidth and storage capacity, productivity loss and interfere with the expedient delivery of legitimate emails. Several researchers had worked on the design of models for spam email filtering using different techniques, however the detection accuracy of these models have also become subject of discussions. This study developed spam email filtering model using Ensemble of Decision Tree, Support Vector Machine and Multilayer Perceptron (DT-SVM-MLP) technique as a solution approach to solving issues of low spam emails detection accuracy. The ensemble model was trained using forward propagation training technique and the performance was evaluated using five performance metrics of Accuracy, False Positive (FP) Rate, Precision, Recall and F-Measure.
@artical{a1132022ijcatr11031003,
Title = "An Email Spam Filtering Model Using Ensemble of Machine Learning Techniques ",
Journal ="International Journal of Computer Applications Technology and Research(IJCATR)",
Volume = "11",
Issue ="3",
Pages ="66 - 71",
Year = "2022",
Authors ="Aju Omojokun Gabriel, Adedeji Ayomiposi Joy"}
  • The paper develops an ensemble machine learning email spam filtering model.
  • An ensemble ( DT- MLP -SVM) algorithm is developed for the model.
  • The model is developed and trained in the Jupyter notebook environment using python.
  • 4) The performance evaluation of the model is carried out using five performance metrics.