Installing Nutch

Events happening in the community are now at Drupal community events on www.drupal.org.
maxmmize's picture

Hi,

I'm new to Drupal, new to Solr, new to Nutch. Thanks to Robert for his dedication in answering my questions.

I installed Solr with Jetty and it is working. I am using Robert's module. (Thanks again)

I have installed Nutch v1.4.0 inside of my /home/lib folder. I have read a lot of the documentation for Nutch. I installed the nutch crawler from Drupal. I run 6.x. I run a Centos box. I'm using Apache 2.0 and PHP 5 v2.9. I have Tomcat 5+ running.

Obviously, since Solr is working, everything is fine for a Nutch install (with Tomcat).

I have placed mu nutch files inside lib/nutch. I can bin/nutch and I get the commands needed. At this point, I ASSUME I have a valid install of nutch.

I installed the latest nutch module into Drupal. I get no error messages after I enabled exec() and restarted. (I had to MANUALLY create a logs/hadoop file) Other than that, I have no errors and both my users nutch and server are running in the same group.

Questions:

How do I verify a good nutch install?
Why does my nutch crawler just sit at 0 inside the Drupal GUI?
How can I sql inject world peace concepts into carbon forms?

Any help would be great.

Comments

http://drupal.org/node/811062

maxmmize's picture

http://drupal.org/node/811062#comment-3618672

Found out most on my own. Solved a few errors I created and have a few still unresolved. Have crawled over 1k URLS and Solr is showing 4500+ documents in index.

Now, I have to find a way to get the to display in the search.

Lucene, Nutch and Solr

Group organizers

Group categories

Projects

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds: