ISSN :2582-9793

A Systematic Review of Artificial Intelligence Techniques for Phishing Detection

Review Article (Published On: 07-Aug-2025 )
DOI : https://doi.org/10.54364/AAIML.2025.53231

Luis Salazar and Anghelo Aguirre

Adv. Artif. Intell. Mach. Learn., 5 (3):4094-4114

1. Anghelo Aguirre: San Ignacio de Loyola University - Faculty of Engineering

2. Luis Salazar: National University of San Marcos

Download PDF Here Citation Info via Semantic Scholar

DOI: 10.54364/AAIML.2025.53231

Article History: Received on: 08-May-25, Accepted on: 01-Aug-25, Published on: 07-Aug-25

Corresponding Author: Luis Salazar

Email: lsalazarma@unmsm.edu.pe

Citation: Anghelo Aguirre, Luis Salazar. A Systematic Review of Artificial Intelligence Techniques for Phishing Detection. Advances in Artificial Intelligence and Machine Learning. 2025;5(3):231.


Abstract

    

This systematic review investigates the application and effectiveness of Artificial Intelligence techniques such as Machine Learning (ML), Deep Learning (DL) and Generative Artificial Intelligence (Gen AI) in the detection and/or prevention of Phishing attacks. The analysed studies were categorised into four groups: ML-based models, DL-based models, hybrid models (ML, DL, others), and Large Language Models (LLMs). The results reveal that none of the models consistently outperforms the others (regardless of context); however, performance depends on factors such as the quality of the dataset used, algorithm architecture, hyperparameter tuning and deployment environment. However, a clear trend emerged: DL and Gen AI-based models tend to achieve higher accuracy and stability, particularly in complex scenarios and when working with large or unstructured datasets. Techniques such as convolutional neural networks (CNN), short and long-term memory (LSTM) and temporal convolutional networks (TCN) demonstrated exceptional performance, achieving accuracies above 99%. Similarly, tuned LLMs such as GPT-2-medium and Llama-3-8b-instruct  showed strong classification capabilities in phishing detection tasks. In contrast, traditional ML algorithms such as Random Forest and XGBoost performed well in structured and balanced data environments, but exhibited limitations in adaptivity and semantic representation. The findings emphasise the importance of a holistic approach that considers not only the algorithm itself, but also the quality of the input data, computational resources and the practical context of the implementation. This review aims to provide relevant information on modern AI techniques to strengthen cybersecurity measures against evolving phishing threats.

Statistics

   Article View: 2532
   PDF Downloaded: 57