Postagens

Mostrando postagens de maio, 2013

Introduction to Map Reduce with Hadoop (I)

Imagem
I thought of compiling some of the basics concepts related to Map Reduce and Hadoop along with a trivial sample to get started with Hadoop. Also, I would like to discuss about the new MapReduce API of Hadoop and samples will be based on the new API. Why Map Reduce? Nowadays, we are surrounded by huge amount of data and each one of us keep on consuming and generating data at every second. Facebook, YouTube, Twitter, LinkedIn, Googling and every other thing that we do on the internet is dealing with a huge amount of data. The main challenge here is to analyze this sort of huge volumes of data and make decisions based on the analysis. Google was the first initiator who came up with an abstraction called Map Reduce to address the challenges in parallel processing of high volume of data. Fundamentals of Map Reduce MapReduce is a programming model and an associated implementation for processing and generating large data sets. It was originally developed by Google ( MapReduce: Simplified Dat...