Semantic & Behavioral Feature analysis for Detecting Fake Reviews using Machine Learning
Main Article Content
Abstract
Background: In this age of technology, online business is playing a vital role in the growth of the economy of the business. Hence, people need feedback on various products, technologies, businesses, etc. Since their opinions are the input for an individual to evaluate and adapt them. Therefore, the Review system is playing a vital role in decision making. So there arises a necessity to evaluate the reviews as well since the business units are trying to generate fake reviews to identify more clients for their products.
Methods/ Statistical Analysis: In this paper, we implement two machine learning algorithms SVM and Naïve Bayes algorithm and analyse the data and predict for the new set of data. We also compare the performance of both algorithms.
Findings: In this paper, we are trying to develop a Machine learning model which analyses the reviews on various factors and obtain the necessary features and classify the reviews as a fake or non-fake review. This helps in identifying fraudulent reviews and predicts the trustworthiness of the reviews in the future.
Improvements: The system can introduce and make available Machine learning techniques and identifying fake reviews at the earliest stage.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
IJCERT Policy:
The published work presented in this paper is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This means that the content of this paper can be shared, copied, and redistributed in any medium or format, as long as the original author is properly attributed. Additionally, any derivative works based on this paper must also be licensed under the same terms. This licensing agreement allows for broad dissemination and use of the work while maintaining the author's rights and recognition.
By submitting this paper to IJCERT, the author(s) agree to these licensing terms and confirm that the work is original and does not infringe on any third-party copyright or intellectual property rights.
References
Esposito, Gennaro. LP-type methods for Optimal Transductive SVMs. Vol. 3. Gennaro Esposito, PhD, 2014.
N. Jindal and B. Liu, "Review spam detection", Proceedings of the 16th international conference on World Wide Web - WWW 07 (2007), ACM, pp. 1189–1190, 2007
Myle Ott, Yejin Choi, Claire Cardie, and Jeffrey T. Hancock. Finding deceptive opinion spam by any stretch of the imagination. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, HLT, 2011.
S. Xie, G. Wang, S. Lin, and P. S. Yu, "Review spam detection via temporal pattern discovery," p. 823, 2012.
W. Etaiwi,G.Naymat, "The impact of applying preprocessing steps on review spam detection", The 8 th international conference on emerging ubiquitous system and pervasion networks, Elsevier, pp. 273-279, 2017.
P.Rosso, D.Cabrera, M. Gomez, "Detecting positive and negative deceptive opinions using PU-learning", Elsevier, pp.1-11, 2014.
S. Feng, L. Xing, A Gogar and Y. Choi, "Distributional footprints of deceptive product reviews". In Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM), pp. 98- 105, 2012.
M. I. Ahsan, T. Nahian, A. A. Kafi, M. I. Hossain, and F. M. Shah, "Review spam detection using active learning," 2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), 2016.
M. Singh, L. Kumar, and S. Sinha, "Model for Detecting Fake or Spam Reviews," Advances in Intelligent Systems and Computing ICT Based Innovations, pp. 213–217, Jan. 2017