TY - JOUR
T1 - Improving malicious email detection through novel designated deep-learning architectures utilizing entire email
AU - Muralidharan, Trivikram
AU - Nissim, Nir
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2023/1/1
Y1 - 2023/1/1
N2 - In today's email dependent world, cyber criminals often target organizations using a variety of social engineering techniques and specially crafted malicious emails. When successful, such attacks can result in significant harm to physical and digital systems and assets, the leakage of sensitive information, reputation damage, and financial loss. Despite the plethora of studies on the detection of phishing attacks and malicious links in emails, there are no solutions capable of effectively, quickly, and accurately coping with more complex email-based attacks, such as malicious email attachments. This paper presents the first fully automated malicious email detection framework using deep ensemble learning to analyze all email segments (body, header, and attachments); this eliminates the need for human expert intervention for feature engineering. In this paper, we also demonstrate how an ensemble framework of deep learning classifiers each of which are trained on specific portions of an email (thereby independently utilizing the entire email) can generalize better than popular email analysis methods that analyze just a specific portion of the email for analysis. The proposed framework is evaluated comprehensively and with an AUC of 0.993, the proposed framework's results surpass state-of-the-art malicious email detection methods, including human expert feature-based machine learning models by a TPR of 5%.
AB - In today's email dependent world, cyber criminals often target organizations using a variety of social engineering techniques and specially crafted malicious emails. When successful, such attacks can result in significant harm to physical and digital systems and assets, the leakage of sensitive information, reputation damage, and financial loss. Despite the plethora of studies on the detection of phishing attacks and malicious links in emails, there are no solutions capable of effectively, quickly, and accurately coping with more complex email-based attacks, such as malicious email attachments. This paper presents the first fully automated malicious email detection framework using deep ensemble learning to analyze all email segments (body, header, and attachments); this eliminates the need for human expert intervention for feature engineering. In this paper, we also demonstrate how an ensemble framework of deep learning classifiers each of which are trained on specific portions of an email (thereby independently utilizing the entire email) can generalize better than popular email analysis methods that analyze just a specific portion of the email for analysis. The proposed framework is evaluated comprehensively and with an AUC of 0.993, the proposed framework's results surpass state-of-the-art malicious email detection methods, including human expert feature-based machine learning models by a TPR of 5%.
KW - Analysis
KW - Deep learning
KW - Detection
KW - Email
KW - Malware
KW - Phishing
UR - http://www.scopus.com/inward/record.url?scp=85141781464&partnerID=8YFLogxK
U2 - 10.1016/j.neunet.2022.09.002
DO - 10.1016/j.neunet.2022.09.002
M3 - Article
C2 - 36371967
AN - SCOPUS:85141781464
SN - 0893-6080
VL - 157
SP - 257
EP - 279
JO - Neural Networks
JF - Neural Networks
ER -