Lately, I have become extremely interested in MapReduce, specifically the open source implementation of this in Hadoop.
From Wikipedia (MapReduce):
MapReduce is a software framework implemented by Google to support parallel computations over large (greater than 100 terabyte) data sets on unreliable clusters of computers. This framework is largely taken from map and reduce functions commonly used in functional programming.
That description, though quite accurate, does not do justice to MapReduce.
2 Comments
Nice summary, Sid. Looking forward to read more on this subject from you.
Thank you Naseer. I will try to report on my experiences as I go about using this.