Need Solr Whiz for Early-Stage Guidance

Events happening in the community are now at Drupal community events on www.drupal.org.
Todd Young's picture

I need a suggestion on how to proceed with a new project from someone who understands the architecture and capabilties of Solr within Drupal.

I'm hoping to put together a basic "search project" system where people initiate new searches as nodes and collect "items" from an external, non-Drupal Solr index populated with tens of millions of records. The result set needs to be browsable and faceted just like the integrated result set would be, with drill-down to smaller row sets and such. Then a user would need the ability to click a link much like the Drupal "flag" module where he could rapidly "bookmark" many of the results from the set to associate them with the currently-open project. Finally, those items would somehow need to be imported from the external Solr or MySQL as nodes within that project, properly mapped with Flags, Collection, Node Associations or some other such method/mod. Oh, and finally, I must have all advanced search syntax available, not just the DisMax handler.

I need suggestions on how to proceed with the Solr solution. I've thought of the following options so far:

1) Import millions of nodes and use ApacheSolr Search Integration out of the box. It actually kinda worked, but I could get into heaps of trouble if I continue?

2) Write my own everything and frame it into Drupal, wrestling with how to call methods from outside over services or through the backend, or possible MySQL triggers, etc.

3) Use ApacheSolr SI but don't use the default XML configs, possibly add ApacheSolr Ajax module or views3 and Solr Views, or Nutch, or something else? Seems awfully hacky at this early stage and I could encounter severe road blocks.

4) Turn to the community for something brilliant I would have never realized and continue to be amazed at how amazing Drupal can be if you're willing to pay your dues...

Comments

Need nutch links visualization

jpk's picture

Hello,

I am interested to build a visualization for my site via nutch.

I used nutch to crawl the site starting from home page and have a few segments in the segments folder.

Now I need to create a UI which shows the traversal path that nutch executed with inbound and outbound links per page.

Is there any such tool already available that I can reuse.

If not, any pointers on how I should query the linkdb?

Thanks
JPK

Lucene, Nutch and Solr

Group organizers

Group categories

Projects

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds: