Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Automatic phishing website detection and prevention model using transformer deep belief network

Published: 01 December 2024 Publication History

Abstract

In the digitally connected world cybersecurity is paramount, phishing where attackers pose as trusted entities to steal sensitive data, looms large. The proliferation of phishing attacks on the internet poses a substantial threat to individuals and organizations, compromising sensitive information and causing financial and reputational damage. This study's goal is to establish an automated system for the early detection and prevention of phishing websites, thereby enhancing online security and protecting users from cyber threats. This research initially employs One Hot Encoding (OHE) mechanism-based pre-processing mechanism that converts every URL string into a numerical vector with a particular dimension. This study utilizes two feature selection techniques which are transfer learning-based feature extraction using DarkNet19 and Variational Autoencoder (VAE) to select the value of the most important feature. The robust security mechanisms are presented to prevent phishing attacks and safeguard personal information on websites. List-based deep learning-based systems to prevent and detect phishing URLs more efficiently. The study proposes a transformer-based Deep Belief Network (TB-DBN), a veritable pre-trained deep transformer network model for phishing behaviour detection. A cross-validation technique with grid search hyper-parameter tuning based on the Intelligence Binary Bat Algorithm (IBBA) was designed using the proposed hybrid model. Predictions were made to classify the phishing URLs using a probabilistic estimation guided boosting classifier model and evaluate their performance in terms of accuracy, precision, recall, specificity, and F1- score. The risk level associated with the URL will be assessed based on various factors, such as the source's reputation, content analysis results, and behavioural anomalies. The computational complexity of DL model training is influenced by various factors, such as the model's complexity, the training data's size, and the optimization algorithm exploited, for training. The outcome demonstrates that tweaking variables increases the effectiveness of Python-based deep learning systems. The findings of the proposed method excel, achieving an accuracy of 99.4 %, precision of 99.2 %, recall of 99.3 %, and an F1-score of 99.2 %. This innovative automatic phishing website detection and prevention model, based on a Transformer-based Deep Belief Network, offers advanced accuracy and adaptability, strengthening cybersecurity measures to safeguard sensitive user information and mitigate the substantial threat of phishing attacks in the digitally connected world.

References

[1]
Raghad Alshalan, Hend Al-Khalifa, A deep learning approach for automatic hate speech detection in the saudi twittersphere, Appl. Sci. 10 (23) (2020) 8614.
[2]
K. Gaurav, B.K. Singh, V. Kumar, Intelligent fault monitoring and reliability analysis in safety–critical systems of nuclear power plants using SIAO-CNN-ORNN, Multimed Tools Appl 83 (2024) 61287–61311,.
[3]
Dr. Anand Gudnavar, N. Manjanaik, Novel framework for enhancing data quality using data correlation factor in wireless sensor network, Int. J. Comput. Dig. Syst. (Scopus-Q3) 12 (1) (2022) 724–730,.
[4]
S. Jain, C. Gupta, A support vector machine learning technique for detection of phishing websites, in: 2023 6th International Conference on Information Systems and Computer Networks (ISCON), IEEE, 2023, pp. 1–6.
[5]
A.K. Jha, R. Muthalagu, P.M. Pawar, Intelligent phishing website detection using machine learning, Multimed. Tools Appl. (2023) 1–26.
[6]
R. Jha, G. Kunwar, Machine Learning based URL Analysis for Phishing Detection, in: 2023 6th International Conference on Information Systems and Computer Networks (ISCON), IEEE, 2023, pp. 1–5.
[7]
T. Kavitha, S. Hemalatha, R. Mounica, V. Niveda, Y. Kumar, A visionary approach to detect spoofing website using machine learning algorithms, in: 2023 International Conference on Computer Communication and Informatics (ICCCI), IEEE, 2023, pp. 1–6.
[8]
A.S. Mahajan, P.K. Navale, V.V. Patil, V.M. Khadse, P.N. Mahalle, The hybrid framework of ensemble technique in machine learning for phishing detection, Int. J. Inform. Comput. Secu. 21 (1–2) (2023) 162–184.
[9]
R. Mahajan, I. Siddavatam, Phishing website detection using machine learning algorithms, Int. J. Comput. Appl. 181 (23) (2018) 45–47.
[10]
Pranav Maneriker, et al., URLTran: improving phishing URL detection using transformers, in: MILCOM 2021-2021 IEEE Military Communications Conference (MILCOM), IEEE, 2021.
[11]
Viera Maslej-Krešňáková, et al., Comparison of deep learning models and various text pre-processing techniques for the toxic comments classification, Appl. Sci. 10 (23) (2020) 8631.
[12]
P. Mehdi Gholampour, R.M. Verma, Adversarial Robustness of Phishing Email Detection Models, in: Proceedings of the 9th ACM International Workshop on Security and Privacy Analytics, 2023, pp. 67–76.
[13]
S. Menaka, J. Harshika, S. Philip, R. John, N. Bharathiraja, S. Murugesan, Analysing the Accuracy of Detecting Phishing Websites using Ensemble Methods in Machine Learning, in: 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS), IEEE, 2023, pp. 1251–1256.
[14]
R.O. Ogundokun, M.O. Arowolo, R. Damaševičius, S. Misra, Phishing Detection in Blockchain Transaction Networks Using Ensemble Learning, Telecom, 4, MDPI, 2023, pp. 279–297.
[15]
M.K. Prabakaran, P. Meenakshi Sundaram, A.D. Chandrasekar, An enhanced deep learning-based phishing detection mechanism to effectively identify malicious URLs using variational autoencoders, IET Inform. Secu. 17 (3) (2023) 423–440.
[16]
Rodríguez, Álvaro Ibrain, and Lara Lloret Iglesias. "Fake news detection using deep learning." arXiv preprint arXiv:1910.03496 (2019).
[17]
Sanjiban Sekhar Roy, et al., Multimodel phishing url detection using lstm, bidirectional lstm, and gru models, Future Internet 14 (11) (2022) 340.
[18]
K. Sadaf, Phishing Website detection using xgboost and catboost classifiers, in: 2023 International Conference on Smart Computing and Application (ICSCA), IEEE, 2023, pp. 1–6.
[19]
Ş. Şentürk, E. Yerli, İ. Soğukpınar, Email phishing detection and prevention by using data mining techniques, in: 2017 International Conference on Computer Science and Engineering (UBMK), IEEE, 2017, pp. 707–712.
[20]
H. Shirazi, K. Haefner, I. Ray, Fresh-phish: a framework for auto-detection of phishing websites, in: 2017 IEEE international conference on information reuse and integration (IRI), IEEE, 2017, pp. 137–143.
[21]
J. Srivastava, A. Sharan, Phishing Website Detection Based on Hybrid Resampling KMeansSMOTENCR and Cost-Sensitive Classification, in: Advances in Cognitive Science and Communications: Selected Articles from the 5th International Conference on Communications and Cyber-Physical Engineering (ICCCE 2022), Hyderabad, India, Singapore, Springer Nature Singapore, 2023, pp. 725–733.
[22]
G. Sujatha, M. Ayyannan, S.G. Priya, V. Arun, A.N. Arularasan, M.J. Kumar, Hybrid Optimization Algorithm to Mitigate Phishing URL Attacks in Smart Cities, in: 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), IEEE, 2023, pp. 1–5.
[23]
J. Tzschoppe, H. Löhr, Browser-in-the-Middle-Evaluation of a modern approach to phishing, in: Proceedings of the 16th European Workshop on System Security, 2023, pp. 15–20.
[24]
Rubaiath E. Ulfath, et al., Hybrid CNN-GRU framework with integrated pre-trained language transformer for SMS phishing detection, in: The 5th International Conference on Future Networks & Distributed Systems, 2021.
[25]
n. Venkatesh, v. tejaswini, g. soumya, t.s. priya, Malicious URL detection using machine learning, Turkish J. Comput. Math. Educ. (turcomat) 14 (2) (2023) 537–552.
[26]
Vijjali, Rutvik, et al. "Two stage transformer model for COVID-19 fake news detection and fact checking." arXiv preprint arXiv:2011.13253 (2020).
[27]
Y. Wang, W. Zhu, H. Xu, Z. Qin, K. Ren, W. Ma, A Large-Scale Pretrained Deep Model for Phishing URL Detection, in: ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, 2023, pp. 1–5.
[28]
Xu, Pingfan. "A transformer-based model to detect phishing URLs." arXiv preprint arXiv:2109.02138 (2021).

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computers and Security
Computers and Security  Volume 147, Issue C
Dec 2024
218 pages

Publisher

Elsevier Advanced Technology Publications

United Kingdom

Publication History

Published: 01 December 2024

Author Tags

  1. Phishing website
  2. Transformer-based deep belief networks
  3. One hot encoding
  4. Variation auto encoder
  5. DarkNet19
  6. Intelligence binary bat algorithm detection

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media