Deep Learning Techniques for Spam URL Detection in Emails

Authors

  • Priyanshi Indian Institute of Information Technology Guwahati (IIITG)s Assam, India Author

DOI:

https://doi.org/10.63345/ijarcse.v1.i2.305

Keywords:

deep learning; spam URL detection; email security; sequence modeling; transformer encoder

Abstract

With the exponential growth of email communication, malicious actors increasingly embed harmful URLs in spam messages to phish, distribute malware, or facilitate fraud. Traditional rule‐based and shallow machine‐learning approaches struggle to generalize to novel URL patterns and obfuscation techniques. Deep learning, with its capacity for hierarchical feature extraction and sequence modeling, offers a promising solution for robust spam URL detection. This manuscript presents a comprehensive study of multiple deep neural architectures—including Convolutional Neural Networks (CNNs), Long Short‐Term Memory networks (LSTMs), and transformer‐based models—applied to the task of identifying spam URLs in email corpora. We detail a pipeline encompassing data collection and labeling, URL tokenization, character‐level and word‐level embeddings, and model training via stratified k-fold cross-validation. Statistical comparisons are conducted using one-way ANOVA and post-hoc testing to assess performance differentials among models.

A simulation environment is developed to mimic real-world email traffic with configurable spam injection rates, enabling assessment of detection latency and throughput under varying load conditions. Results demonstrate that transformer‐based encoders achieve peak detection accuracy (95.8 % ± 0.9 %) and F1-score (0.956 ± 0.008), significantly outperforming CNN (92.3 % ± 1.2 %) and LSTM (93.1 % ± 1.0 %) baselines. The conclusions underscore the trade-offs between detection performance, computational cost, and real-time applicability, offering guidelines for deployment in enterprise email security gateways.

Downloads

Download data is not yet available.

Downloads

Additional Files

Published

2025-06-07

How to Cite

Priyanshi. “Deep Learning Techniques for Spam URL Detection in Emails”. International Journal of Advanced Research in Computer Science and Engineering (IJARCSE) 1, no. 2 (June 7, 2025): Jun (27–33). Accessed October 19, 2025. https://ijarcse.org/index.php/ijarcse/article/view/59.

Similar Articles

1-10 of 36

You may also start an advanced similarity search for this article.