Journals Proceedings

International Journal of Advances in Computer Science and Its Applications

Efficient Utilization Of Profiles To Reduce Time In Very Large Data Set



Hadoop is a software framework for analysis of large data sets. Hadoop distributed file system and map reduce paradigm provide an efficient way to deal with terabyte of data being produced every second. MapReduce is known as a popular way to hold data in the cloud environment due to its excellent scalability and good fault tolerance. However, creating profiles for the same job again and again makes it less efficient. This paper proposes an INTERFACE that optimizes time taken to match sampled mapreduce jobs (Js) with already created profiles. It acts as mediator between profile store and worker (nodes).

No fo Author(s) : 2
Page(s) : 48 - 52
Electronic ISSN : 2250 - 3765
Volume 4 : Issue 3
Views : 472   |   Download(s) : 196