Category Archives: Tools

MapReduce and Scale

Lately, I have become extremely interested in MapReduce, specifically the open source implementation of this in Hadoop.
From Wikipedia (MapReduce):
MapReduce is a software framework implemented by Google to support parallel computations over large (greater than 100 terabyte) data sets on unreliable clusters of computers. This framework is largely taken from map and reduce functions commonly used [...]

Also posted in Scalability | Tagged , | 2 Comments

Tools/Libraries for IR

We’ve created a page to gather together some of the tools that a researcher or engineer working on IR problems might find useful. Hopefully this will be useful to many.
Tools/Libraries for Information Retrieval
We will be updating that page as we come across more tools.
We are enabling comments on the page, please leave comments about any [...]

Posted in Tools | Tagged , | 2 Comments
  • About grok.in

    This is a blog primarily focussed on the subjects of Information Engineering—Retrieval, Extraction & Management, Machine Learning, Scalability and Cloud Computing.