If you are looking to write a web crawler, Perl, with all its great CPAN modules, is one of the best platforms you can pick. There are CPAN modules for most of the common components of a web crawler. Here, I’ll point to some of the modules that you would want to start out with.
About grok.in
This is a blog primarily focussed on the subjects of Information Engineering—Retrieval, Extraction & Management, Machine Learning, Scalability and Cloud Computing.
3 Comments
check this article about web crawling,
http://crawltheweb.blogspot.com/
the blog is nice..but it is be nice if it will be included with examples..like for WWW::Mechanize..
tnx..
Thanks for the useful information.
Regards,
kiran Kumar. Bodi