Category Archives: Information Extraction

Fun with Google Sets

Google Sets is a real fun experiment from the Google Labs. It basically allows you to “automatically create sets of items from a few examples.” So you can enter “Sachin Tendulkar”, “Rahul Dravid” and “Sourav Ganguly,” and, be presented with a much larger set of the players of the Indian Cricket Team. Or enter “Athens”, [...]

Also posted in Information Retrieval, Machine Learning | Tagged , | 1 Comment

Tag Mirror

LibraryThing (an online service to help people catalogue their books easily) recently launched a very useful feature that they call “Tag Mirror“. This is one of the more interesting things that has been done with tags. In fact, I would wager that this is one of the best thing to happen to tagging since tag [...]

Posted in Information Extraction | Tagged | Leave a comment

Interesting Papers on Web Spam at AIRWeb 2007

AIRWeb (Adversarial Information Retrieval on the Web) is workshop on IR in the world of Web Spam. From the call for papers page:
Adversarial Information Retrieval addresses tasks such as gathering, indexing, filtering, retrieving and ranking information from collections wherein a subset has been manipulated maliciously. On the Web, the predominant form of such manipulation is [...]

Posted in Information Extraction | Tagged , | 1 Comment
  • About grok.in

    This is a blog primarily focussed on the subjects of Information Engineering—Retrieval, Extraction & Management, Machine Learning, Scalability and Cloud Computing.