Phishing Websites Classification Placed on URL Features and Extreme Machine Learning
DOWNLOAD PDF

Keywords

Neural Network
Bernoulli Naive Bayes
phishing attacks
website security.

How to Cite

Ranjitha R G, & Meenakshi Sundaram A. (2023). Phishing Websites Classification Placed on URL Features and Extreme Machine Learning . Transactions on Federated Engineering and Systems, 1(1), 37–43. https://doi.org/10.5281/zenodo.10279674

Abstract

Phishing attacks have become an increasingly common threat to individuals and organizations alike. The traditional methods used to detect phishing attacks, such as blacklisting known phishing URLs or using heuristics to identify suspicious websites have proven to be limited in their effectiveness. Phishing attackers continuously evolve their tactics, making it difficult for traditional detection methods to keep up. To address this challenge, this study explores the use of machine learning classifiers to uncover illegitimate websites. Specifically, this research utilizes the Multilayer Perceptron and Bernoulli Naive Bayes (NB) classifiers. The feature selection process is performed using a decision tree classifier, which helps to identify the most relevant features for the classification task. To train and test the classifiers, the study collected a dataset of blacklisted and whitelisted websites. Accuracy, precision, recall, and the ROC curve were only few of the measures used to assess the classifier's effectiveness. The results demonstrate the effectiveness of the Multilayer Perceptron and Bernoulli NB classifiers in detecting phishing websites. The feed forward neural network classifier achieved an accuracy of over 82% on the dataset. These results showcase the potential of machine learning techniques in improving the discovering of phishing attacks and reducing further risks of phishing attacks.
https://doi.org/10.5281/zenodo.10279674
DOWNLOAD PDF

References

Aaron, G., Chapin, L., Piscitello, D., & Strutt, C. (2020). Phishing Landscape 2020: A Study of the Scope and Distribution of Phishing.

Yang, R., Zheng, K., Wu, B., Wu, C., & Wang, X. (2021). Phishing website detection based on deep convolutional neural network and random forest ensemble learning. Sensors, 21(24), 8281.

Ahmed, S. T., Sreedhar Kumar, S., Anusha, B., Bhumika, P., Gunashree, M., & Ishwarya, B. (2020). A generalized study on data mining and clustering algorithms. New Trends in Computational Vision and Bio-inspired Computing: Selected works presented at the ICCVBIC 2018, Coimbatore, India, 1121-1129.

Rao, R. S., & Pais, A. R. (2017). An enhanced blacklist method to detect phishing websites. In Information Systems Security: 13th International Conference, ICISS 2017, Mumbai, India, December 16-20, 2017, Proceedings 13 (pp. 323-333). Springer International Publishing.

Saha, I., Sarma, D., Chakma, R. J., Alam, M. N., Sultana, A., & Hossain, S. (2020, August). Phishing attacks detection using deep learning approach. In 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT) (pp. 1180-1185). IEEE.

Ragaventhiran, J., Vigneshwaran, P., Kodabagi, M. M., Ahmed, S. T., Ramadoss, P., & Megantoro, P. (2022). An unsupervised malware detection system for windows based system call sequences. Malaysian Journal of Computer Science, 79-92.

Wu, L., Du, X., & Wu, J. (2015). Effective defense schemes for phishing attacks on mobile computing platforms. IEEE Transactions on Vehicular Technology, 65(8), 6678-6691.

Sreedhar, K. S., Ahmed, S. T., & Sreejesh, G. (2022, June). An Improved Technique to Identify Fake News on Social Media Network using Supervised Machine Learning Concepts. In 2022 IEEE World Conference on Applied Intelligence and Computing (AIC) (pp. 652-658). IEEE.

Benavides, E., Fuertes, W., Sanchez, S., & Sanchez, M. (2020). Classification of phishing attack solutions by employing deep learning techniques: A systematic literature review. Developments and Advances in Defense and Security: Proceedings of MICRADS 2019, 51-64.

AlEroud, A., & Karabatis, G. (2020, March). Bypassing detection of URL-based phishing attacks using generative adversarial deep neural networks. In Proceedings of the sixth international workshop on security and privacy analytics (pp. 53-60).

Muppavarapu, V., Rajendran, A., & Vasudevan, S. K. (2018). Phishing detection using RDF and random forests. Int. Arab J. Inf. Technol., 15(5), 817-824.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Copyright (c) 2023 Ranjitha R G, Meenakshi Sundaram A