A Survey on Data Mining Techniques in Distributed Databases
Main Article Content
Abstract
Data Mining is the process of discovering interesting patterns & Knowledge from large amounts of data. The
data can be gathered from different sources like Databases, Data Warehouses, the web, other information Repositories. In
client/server environment Distributed Databases play an important role for information processing and it is easy to foresee
that their importance will rapidly grow. Because of increased growth of data sharing, distributed databases are developed. In
Distributed Databases it is difficult to maintain large amounts of data in centralized databases without duplication & to
maintain secured data. In distributed databases, copy of data is stored in different locations, where memory wastage is
heavy. In many situations data is gathered from many places for analysis, where a privacy problem occurs. So, to avoid all
this Data Mining give Distributed Data Mining(DDM) to identify correct & perfect information. In this paper I am going to
concentrate on how to give security to Distributed Databases by using Data Mining Techniques like Privacy Preserving Data
Mining(PPDM),Association Rule for frequent patterns, Horizontal & Vertical partitions etc.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
IJCERT Policy:
The published work presented in this paper is licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. This means that the content of this paper can be shared, copied, and redistributed in any medium or format, as long as the original author is properly attributed. Additionally, any derivative works based on this paper must also be licensed under the same terms. This licensing agreement allows for broad dissemination and use of the work while maintaining the author's rights and recognition.
By submitting this paper to IJCERT, the author(s) agree to these licensing terms and confirm that the work is original and does not infringe on any third-party copyright or intellectual property rights.
References
Jefferey A. Hoffer. Mary B.Prescott Fred R. Mcfadden, Modern Database Management, 6th Ed, pearson education, PP:493-494.
Dr. Mohmed Kashif Oureshi et al.IJAIR, Security Aspects of Distributed Database, ISSN:2278-7844,PP:201-203.
Jiawei Han | Micheline Kamber | Jian Pei, Data Mining Concepts and Techniques, 3rd Ed, MK,PP: 5-8.
Ahmed HajYasien. Thesis on “PRESERVING PRIVACY IN ASSOCIATION RULE MINING” in the Faculty of Engineering and Information Technology Griffith University June 2007.
S.V. Vassilios , B. Elisa, N.F. Igor, P.P. Loredana, S. Yucel and T. Yannis, 2004, “State of the Art in Privacy Preserving Data Mining” Published in SIGMOD Record, 33, 2004,PP:50-57.
Nidhi saxena, Priya Gupta,Onkar Singh, 2016,”A Survey on Security Techniques in Data Mining”, IJASRM, Vol. 1, Issue 5, May 2016,ISSN:2455-6378,PP:159-162.
Hina Vaghashia,Amit Ganatra,2015,”A Survey: Privacy Preservation Techniques in Data Mining”, International Journal on Computer Applications(0975-8887),Vo.119-No.4,June 2015,PP:20-26.
Jaideep vaidya, Chris Clifton,2004,”Privacy-Preserving Data Mining: Why, How, and When”, IEEE Computer Society, Security & Privacy(1540-7993/04),November/December,2004,PP:19-27.
da Silva, J. C., Giannella, C., Bhargava, R., Kargupta, H., & Klusch, M. (2005). Distributed data mining and agents. Engineering applications of artificial intelligence, 18(7), 791-807.
Kantarioglu,M. and Clifton,C (2004) “Privacy Preserving Mining of association rules on horizontally partition data,” IEEE Transactions on Knowledge & Data Engineering, Vol.16,No.9,PP:1026-1037.
Chin-Chen Chang, Jich-Shan yeh & Yu-Chiang Li,2006,”Privacy-Preserving Mining of Association Rules on Distributed Databases”, IJCSNS, International Journal of Computer Science & Network Security,Vol:6 No.11,PP:259-266.
Aggarwal C, Yu P, “An Introduction to Privacy Preserving Data Mining”, Chapter 2 in Privacy Preserving Data Mining: Models and Algorithms, Springer, NY, USA, Pg-11 to Pg-27, (2012)
Abitha N, Sarada G, Manikandan G., Sairam, “A Cryptographic Approach for Achieving Privacy in Data Mining”. International Conference on Circuit, Power and Computing Technologies, IEEE, (2015)
Pinkas B., “Cryptographic techniques for privacy preserving data mining”, SIGKDD Exploration, Vol.4 (Issue - 2), Pg-12 to Pg-19.
Kantarcioglu, “A Survey of Privacy-Preserving Methods Across Horizontally Partitioned Data”, Chapter 13, Privacy Preserving Data Mining: Models and Algorithms, Springer, NY, USA, Pg313 to Pg-332, (2012).
Vadiya, “A Survey of Privacy-Preserving Methods Across Vertically Partitioned Data”, Chapter 14 in Privacy Preserving Data Mining: Models and Algorithms, Springer, NY, USA, Pg-337 to Pg356, (2012).
Merani M., Barcellonay C, Tinnirelloy I, “Multi Cloud Privacy Preserving Schemes for Linear Data Mining”, IEEE, Communication and Information Systems Security, (2015).
Samanthula, Elmehdwi, Jiang, “k-Nearest Neighbour Classification over Semantically Secure Encrypted Relational Data”, IEEE Transactions on Knowledge and Data Engineering, (2013).
A.V.Sriharsha,Dr.C.Parthasarathy,2015,”A Survey on Privacy Preserving Data Mining”, Interrnational Journal of Advanced Research in Computer Science & Software Engineering, ISSN:2277 128X,Vol 5,Issue 10,October-2015,PP:631-636.
Nivetha.P.R,Thamrai Selvi.K,2013,”A Surevey on Privacy Preserving Data Minging Techniques”,IJCSMC,ISSN:2320-088X,Vol 2,Issue 10,October 2013,PP:166-170.
Gayatri Nayak, Swagatika Devi,”A survey on Privacy Preserving Data Mining: Approaches & Techniques”,IJEST, ISSN:0975-5462,Vol 3,No 3, March,2011,PP:2127-2133.
Anil Vasoya, Dr.Nitin Koli,ELSEVIER,”Mining of association rules on large databases using distributed and parallel computing”,1877-0509,2016,WWW.Sciencedirect.com, PP:221-230.
L.Wang et al.,” Efficient Mining of frequent Items sets on large uncertain databases”,IEEE Transactions on Knowledge and Data Engineering, Vol.24,no.12,PP:2170-2183.Dec.2012.
V.S.Tseng,”Efficent Algorithm for Mining High utility itemsets from transactional databases”, IEEE Transactions on Knowledge and Data Engineering, Vol.25,no.8,PP:1772-1786.Aug.2013.
Kaufmann, 1994,pp. 407-419. IEEE Tran. Knowledge and Data Eng. , vol. 8, no. 6, 1996,pp. 962-969;. Distributed Algorithm for Mining Association Rules," 31-42; (VLDB 94),
J. Han , J. Pei, and Y. Yin , "Mining Frequent Patterns without Candidate Generation," Int'l. Conf. Management of Data , ACM Press, 2000,pp. 1
Albert Y. Zomaya, Tarek El-Ghazawi, Ophir Frieder, “Parallel and Distributed Computing for Data Mining”, IEEE Concurrency, 1999. International Journal of Computer Science and Information Technology, Volume 2, Number 2, April
D.W.L Cheung, V.T.Y. Ng, A.W.C. Fu, and Y. Fu. “Efficient mining of association rules in distributed databases”. IEEE Trans. Knowl. Data Eng., 8(6):911–922, 1996Tavel, P. 2007 Modeling and Simulation Design. AK Peters Ltd.
Tamir Tassa, “Secure Mining of Association rules in Horizontally Distributed Databases” IEEE trans. Knowledge and Data Engg. Vol. 26, no. 2, April 2014.
Mohamed A.Ouda,Samesh A.Salem,Ihab A.Ali, and El-Sayed M.Saad,2012,”Privacy-Preserving Data Mining(PPDM) Method for Horizontally Partitioned Data”,IJCSI, Vil.9,Issue 5,No 1, September 2012,ISSN:1694-0814,PP:339-347.