Detection of Malicious URLs using Artificial Intelligence
Main Article Content
Abstract
Background/Objectives: The main objective of the project is to avoid various security threats and network attacks by detecting malicious Uniform Resource Locator(URL) based on the keyword text classification.
Methods/Statistical analysis: A semi-supervised technique, naive Bayes classification is proposed to locate malicious URL by text classification phenomena. The probabilities of the predicted and the exact values are calculated and it results with high probability. With more accuracy the malignant URL is predicted. A page rank algorithm is used to detect the blacklist which contains the URLs that are already noted as spam, malware or phishing URL.
Findings: With the persistent improvement of Web assaults, many web applications have been languishing from different types of security dangers and system assaults. The security identification of URLs has consistently been the focal point of Web security. One of the main sources of attacks is via malicious URLs; the attackers may send embedding executable codes or injects malicious codes through these URLs. Thus, it is important to improve the unwavering quality and security of web applications by precisely identifying malignant URLs. The utilization of profound figuring out how to group URLs to recognize Web guests' aims has significant hypothetical and scientific values for Web security investigate, giving new plans to canny security discovery.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
IJCERT Policy:
The published work presented in this paper is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This means that the content of this paper can be shared, copied, and redistributed in any medium or format, as long as the original author is properly attributed. Additionally, any derivative works based on this paper must also be licensed under the same terms. This licensing agreement allows for broad dissemination and use of the work while maintaining the author's rights and recognition.
By submitting this paper to IJCERT, the author(s) agree to these licensing terms and confirm that the work is original and does not infringe on any third-party copyright or intellectual property rights.
References
Surendra Sedhai; AixinSun" Semi-Supervised Spam Detection in Twitter Stream" IEEE Transactions on Computational Social Systems (Volume: 5, Issue: 1, March 2018)
Bo Feng; Qiang Fu; Mianxiong Dong; Dong Guo; Qiang Li "Multistage and Elastic Spam Detection in Mobile Social Networks through Deep Learning"IEEE Network ( Volume: 32 , Issue: 4 , July/August 2018 )
Jonghyuk Song; Sangho Lee; Jong Kim " Inference Attack on Browsing History of Twitter Users Using Public Click Analytics and Twitter Metadata" IEEE Transactions on Dependable and Secure Computing (Volume: 13, Issue: 3, May-June 1 2016)
Longfei Wu; Xiaojiang Du; JieWu" Effective Defense Schemes for Phishing Attacks on Mobile Computing Platforms" IEEE Transactions on Vehicular Technology (Volume: 65, Issue: 8, Aug. 2016)
Eric Lancaster ; TanmoyChakraborty ; V. S. Subrahmanian"MALTP : Parallel Prediction of Malicious Tweets"IEEE Transactions on Computational Social Systems( Volume: 5 , Issue: 4 , Dec. 2018 )
Hong Zhao; Zhaobin Chang; Weijie Wang; XiangyanZeng" Malicious Domain Names Detection Algorithm Based on Lexical Analysis and Feature Quantification" IEEE Access (Volume: 7)
Xuanzhe Liu ; Yun Ma ; Xinyang Wang ; Yunxin Liu ; Tao Xie ; Gang Huang"SWAROVsky: Optimizing Resource Loading for Mobile Web Browsing"IEEETransactions on Mobile Computing( Volume: 16 , Issue: 10 , Oct. 1 2017 )
DohoonKim" Potential Risk Analysis Method for Malware Distribution Networks" IEEE Access (Volume: 7)
JoostBerkhout"Google's PageRank algorithm for ranking nodes in general networks"IEEE 2016 13th International Workshop on Discrete Event Systems (WODES)
Zhou Hao; PuQiumei; Zhang Hong; ShaZhihao"An Improved PageRank Algorithm Based on Web Content" IEEE 2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES)
Yuguang Huang ; Lei Li"Naive Bayes classification algorithm based on small sample set" IEEE 2011 IEEE International Conference on Cloud Computing and Intelligence Systems
HaiyiZhang; Di Li"Naïve Bayes Text Classifier" IEEE 2007 IEEE International Conference on Granular Computing (GRC 2007)
Mohammed Al-Janabi: Ed de Quincey: Peter Andras: “Using supervised machine learning algorithms to detect suspicious URLs in online social networks” 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.
Justin Ma: Lawrence K. Saul: Stefan Savage: Geoffrey M. Voelker:“Identifying Suspicious URLs: An Application of Large-Scale Online Learning” 26th International Conference on Machine Learning, Montreal, Canada, 2009.
Training Datasets: https://www.kaggle.com/teseract/urldataset