A Survey on Estimation of Time on Hadoop Cluster for Data Computation

Mohan Kumar M K, Akram Pasha

Abstract

Data is generated from several ways such Social media, Internet, IT management, Business, Scientificapplications etc.is in the form of petabytes of size and it is in unstructured format to manage this unstructured data is difficult task in a given interval of time. Hadoop is used to answer Big Data, storage of data is performed by distributed file systemusing Hadoop (HDFS) and information retrieval is done by using Map Reduce concept. Hadoop cluster is considered by taking a single node at all time and analyzed three kinds of time that is user, real and system time. Cloud provides Management and examining several types of Data using Big Data-as-a-Service.

Keywords

-