Abstract
Phishing is the process of portraying malignant web pages in the place of genuine web pages to obtain important and delicate information from the end-user. Nowadays phishing is considered as one of the most serious threats to web security. Most of the existing techniques for phishing detection use Bayesian classification for differentiating malignant web pages from genuine web pages. These methods work well if a dataset contains less no of web pages and they provide accuracy up to 90 percent. In recent years the size of the web is increasing tremendously and the existing methods have not provideda good enough accuracy for large datasets. So this paper proffers an innovative approach to identify phishing websites using hyperlinks available in the source code of the HTML page in the corresponding website. The proposed method uses a feature vector with 30 parameters to detect malignant web pages. These features are used in training the supervised Deep Neural Network model with Adam optimizer for differentiating fraudulent websites from genuine websites. The proposed deep learning model with Adam Optimizer uses a Listwise approach to classify phishing websites and genuine websites. The performance of the proposed approach is decent when compared to other traditional machine learning approaches like SVM, Adaboost, AdaRank. The results show that the proposed approach provides more accurate results in detecting phishing websites.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Buzuraiq, S., Alkasassbeh M., and Almseidin, M. (2020) “Intelligent Methods for Accurately Detecting Phishing Websites,” in 2020 11th International Conference on Information and Communication Systems (ICICS), 085–090 https://doi.org/10.1109/ICICS49469.2020.239509.
Adebowale, M. A., Lwin, K. T., Sánchez, E., & Hossain, M. A. (2019). Intelligent web-phishing detection and protection scheme using integrated features of Images, frames and text. Expert Systems with Applications, 115, 300–313. https://doi.org/10.1016/j.eswa.2018.07.067.
MohithGowda, H. R., Adithya, M. V., & Vinay S. (2020). Development of anti-phishing browser based on random forest and rule of extraction framework. Cyber Security, 3(1), 1–14. https://doi.org/10.1186/s42400-020-00059-1.
Prakash, P., Kumar, M., Kompella, R. R., & Gupta, M., (2010). Phishnet: predictive blacklisting to detect phishing attacks. In 2010 proceedings IEEE INFOCOM (pp. 1–5). https://doi.org/10.1109/INFCOM.2010.5462216.
Basit, A., Zafar, M., Liu, X., Javed, A. R., Jalil, Z., & Kifayat, K. (2020). A comprehensive survey of AI-enabled phishing attacks detection techniques. Telecommunication System. https://doi.org/10.1007/s11235-020-00733-2.
Khan, S. A., Khan, W., & Hussain, A. (2020). Phishing attacks and websites classification using machine learning and multiple datasets (A comparative analysis). In International Conference on Intelligent Computing (pp. 301–313). Cham: Springer. https://doi.org/10.1007/978-3-030-60796-8_26.
Sonowal, G. (2020). Phishing email detection based on binary search feature selection. SN Computer Science, 1(4), 191. https://doi.org/10.1007/s42979-020-00194-z.
Saravanan, P., & Subramanian, S. (2020). A Framework for detecting phishing websites using GA based feature selection and ARTMAP based website classification. ProcediaComputer Science, 171, 1083–1092. https://doi.org/10.1016/j.procs.2020.04.116.
“How to detect search engine spam - News - Elsevier.” https://www.journals.elsevier.com/ knowledge-based-systems/news/how-to-detect-search-engine-spam (Accessed Dec. 11, 2020).
Naresh Kumar, D., Hemanth, N. S. R., Premnath, S., Nishanth Kumar, V., Uma, S. (2020) Detection of phishing websites using an efficient machine learning framework. International Journal of Engineering Research and Technology, 9(5). https://doi.org/10.17577/IJERTV9IS050888.
Moorthy, R. S., & Pabitha, P. (2020). Optimal detection of phising attack using SCA based K-NN. ProcediaComput. Sci., 171, 1716–1725. https://doi.org/10.1016/j.procs.2020.04.184.
Lakshmi, V. S., & Vijaya, M. S. (2012). Efficient prediction of phishing websites using supervised learning algorithms. Procedia Engineering, 30, 798–805. https://doi.org/10.1016/j.proeng.2012.01.930.
Subasi, A., & Kremic, E. (2020). Comparison of Adaboost with MultiBoosting for Phishing Website Detection. ProcediaComputer Science, 168, 272–278. https://doi.org/10.1016/j.procs.2020.02.251.
Lakshmi, L., Reddy, P. B., & Bindu, C. S. (2018) SLOLAR: Scalable listwise online learning algorithm for ranking. Journal of Advanced Research in Dynamical and Control Systems, 10(3), 755–764.
MotiurRahman, S. S. M., Islam, T., & Jabiullah, Md. I. (2020). PhishStack: Evaluation of stacked generalization in phishing URLs detection. ProcediaComputer Science, 167, 2410–2418. https://doi.org/10.1016/j.procs.2020.03.294.
Baykara, M. and Gürel Z. Z. (2018) “Detection of phishing attacks,” in 2018 6th International Symposium on Digital Forensic and Security (ISDFS), 1–5 https://doi.org/10.1109/ISDFS.2018.8355389.
Li, X., Geng, G., Yan, Z., Chen, Y., & Lee, X. (2016) “Phishing detection based on newly registered domains,” in 2016 IEEE International Conference on Big Data (Big Data), 3685–3692 https://doi.org/10.1109/BigData.2016.7841036.
Basnet, R., Mukkamala, S., & Sung, A. H. (2008). Detection of Phishing Attacks: A Machine Learning Approach. In B. Prasad (Ed.), Soft Computing Applications in Industry (pp. 373–383). Berlin, Heidelberg: Springer.
Pham, T. A., Nguyen, Q. U., & Nguyen, X. H. (2014). Phishing Attacks Detection Using Genetic Programming. In Knowledge and Systems Engineering (pp. 185–195). Cham: Springer. https://doi.org/10.1007/978-3-319-02821-7_18.
“[PDF] Phishing detection: A recent intelligent machine learning comparison based on models content and features | Semantic Scholar.” https://www.semanticscholar.org/paper/Phishing-detection%3A-A-recent-intelligent-machine-on-Abdelhamid-Thabtah/403a3f5a3eb7c94e2a33c6e60161cb4c568467dc (Accessed Dec. 11, 2020).
Li, T., Kou, G., & Peng, Y. (2020). Improving malicious URLs detection via feature engineering: Linear and nonlinear space transformation methods. Information Systems., 91, 101494. https://doi.org/10.1016/j.is.2020.101494.
“Intelligent phishing url detection using association rule mining | Human-centric Computing and Information Sciences | Full Text.” https://hcis-journal.springeropen.com/articles/https://doi.org/10.1186/s13673-016-0064-3 (Accessed Dec. 11 2020).
Jain, A. K., & Gupta, B. B. (2019). A machine learning based approach for phishing detection using hyperlinks information. Jornal of Ambient Intelligence and Humanized Computing, 10(5), 2015–2028. https://doi.org/10.1007/s12652-018-0798-z.
“UCI Machine Learning Repository: Phishing Websites Data Set.” https://archive.ics.uci.edu/ml/datasets/phishing+websites (Accessed Dec. 11, 2020).
Sahingoz, O. K., Buber, E., Demir, O., & Diri, B. (2019). Machine learning based phishing detection from URLs. Expert Systems with Applications, 117, 345–357. https://doi.org/10.1016/j.eswa.2018.09.029.
Acknowledgements
This research is supported by the department of science and technology under WOS-A (SR/WOS-A/ET-1071/2014). I would like to thank my Scientist Mentor Dr. P Bhaskara Reddy for his support and help through the year.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lakshmi, L., Reddy, M.P., Santhaiah, C. et al. Smart Phishing Detection in Web Pages using Supervised Deep Learning Classification and Optimization Technique ADAM. Wireless Pers Commun 118, 3549–3564 (2021). https://doi.org/10.1007/s11277-021-08196-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-021-08196-7