Notes

Softwares/Libraries for Full-text Search

Notes on some of softwares/libraries that come handy when you want to add full-text search to your applications.

Web Crawling with Perl

Notes on some of the Perl CPAN modules that are most useful when the task at hand is Web Crawling.

Cloud Computing

I think everyone has a moral (:P) responsibility to add to the confusion. Only through such attempts can we achieve clarity. These notes — an attempt to put in words, my understanding of what Cloud Computing is and is not — is a contribution to that end.

MapReduce and Scale

Some preliminary notes explaining the MapReduce computing paradigm with a particular focus on one implementation: Hadoop.

Machine Learning: Classification

A quick introduction to Machine Classification.

  • About grok.in

    This is a blog primarily focussed on the subjects of Information Engineering—Retrieval, Extraction & Management, Machine Learning, Scalability and Cloud Computing.