A Survey on Dynamic Resource Allocation for Efficient Parallel Data Processing

Main Article Content

B. Praveen Kumar
Santhosh Kumar

Abstract

As recently ad-hoc parallel information handling has developed to be one of the executioner applications for Infrastructure as a service (IaaS) cloud. Most of the Distributed computing organizations have begun to coordinate systems for parallel information making ready in their item portfolio, making it straightforward for clients to get to these Infrastructure and to convey their projects. The preparing constitution which are right now utilized have been intended for static, homogeneous group setups and slight the specific way of a cloud. Subsequently, the dispensed figure assets may be lacking for huge parts of the submitted work and superfluously increment handling time and cost. In this paper we discuss the challenges for proficient parallel information handling in cloud and introduce our exploration venture Nephele. Nephele is the first information preparing structure to expressly neglect the dynamic quality portion offered by today's IaaS mists for each, trip booking and execution. In this paper we talk about the open doors and challenges for effective parallel information preparing Specific errands of a handling occupation can be doled out to distinctive sorts of virtual machines which are consequently instantiated and ended amid the employment execution. In view of this new structure, we perform amplified assessments of Map Reduce-roused preparing occupations on an IaaS cloud framework and contrast the outcomes with the mainstream information handling system Hadoop.

Article Details

How to Cite
[1]
B. Praveen Kumar and Santhosh Kumar, “A Survey on Dynamic Resource Allocation for Efficient Parallel Data Processing”, Int. J. Comput. Eng. Res. Trends, vol. 2, no. 12, pp. 1106–1112, Dec. 2015.
Section
Survey

References

Amazon Web Services LLC. Amazon Elastic Compute Cloud (Amazon EC2). http://aws.amazon.com/ec2/, 2009.

Amazon Web Services LLC. Amazon Elastic MapReduce. http://aws.amazon.com/elasticmapreduce/ , 2009.

AmazonWeb Services LLC. Amazon Simple Storage Service. http://aws.amazon.com/s3/ , 2009.

M. Coates, R. Castro, R. Nowak, M. Gadhiok, R. King, and Y. Tsang. Maximum Likelihood Network Topology Identification from Edge-Based Unicast Measurements. SIGMETRICS Perform. Eval. Rev., 30(1):11–20, 2002.

R. Davoli. VDE: Virtual Distributed Ethernet. Testbeds and Research Infrastructures for the Development of Networks & Communities, International Conference on, 0:213–220, 2005.

J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI’04: Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation, pages 10–10, Berkeley, CA, USA, 2004. USENIX Association.

I. Raicu, I. Foster, and Y. Zhao. Many-Task Computing for Grids and Super computers. In ManyTask Computing on Grids and Supercomputers, 2008. MTAGS 2008. Workshop on, pages 1–11, Nov.2008.

M. Stillger, G. M. Lohman, V. Markl, and M.Kandil. LEO-DB2’s LEarning Optimizer. In VLDB ’01: Proceedings of the 27th International Conference on Very Large Data Bases, pages 19–28, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.

D. Warneke and O. Kao. Nephele: Efficient Parallel Data Processing in the Cloud. In MTAGS ’09: Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers, pages 1– 10, New York, NY, USA, 2009. ACM.

T. White. Hadoop: The Definitive Guide. O’Reilly Media, 2009.