Big Data Drupal with Cloudera, Hadoop, MapReduce, Nutch and Solr
thanks to the recent work of the Solr Nutch sandbox project I've managed to get Nutch 1.6 jobs to run on a Cloudera CDH3 4 node cluster sending results to Solr 3.6.2 (hosted within Tomcat on Aegir BOA) and then integrated into the Apache Solr 7.1.1 module (not the dev) into search results and Apache Solr Views
I must say, I am pretty excited about Hadoop / Cloudera running Nutch and Solr and integrating with Drupal
for anyone interested in setting up a Cloudera cluster I recommend masterschema (centos) and Gregory Grubbs on YouTube (debian)
I'll post some notes etc ASAP
Read moreBest approach to indexing stemmed and unstemmed fulltext in Drupal?
A common desire with Apache Solr search servers is to get the "best of both" stemming and not stemming terms, indexing both the original term and the stem with something like SnowballPorterFilterFactory. Stemming matches grammatical variations, while indexing the original boosts exact matches to rank higher than near matches, and protects against awkward cases where after stemming, the original term no longer matches.
Read moreSolr Nutch Search Sandbox Project Updated to Integrate with Common Schema
Hello all:
Based on our discussion last month on IRC, I reconfigured this sandbox project as a few Nutch settings that creates an index compatible with the common schema for the apachesolr module.
http://drupal.org/sandbox/cilefen/1858412
The purpose is ad-hoc crawling and indexing, but searching within Drupal and the results are integrated with the Drupal node results.
This is for Nutch 1.x only at this stage.
Read moreSolr Nutch Search Sandbox Project Added
Hi All,
I just added "Solr Nutch Search", a sandbox project.
http://drupal.org/sandbox/cilefen/1858412
I welcome your feedback. Let me know if it is good enough for a full project, in which case I could use a co-maintainer.
-Chris McCafferty
Read moreDrupal Search and Solr office hours
Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!
Read moreSearchapi integration with searchapi solr.
i am using searchapi module and serachapi solr module. i have set up solr successfully.
i have setup 2 different instance in one server for two different content types(A,B);
and indexes are created using it.
Newer version of View3 + Solr article
Hi.
I've installed Solr 3.6.1 and apachesolr (7.x-1.0-rc3) and apachesolr_attachments (7.x-1.2) modules.
It's all running on a Tomcat 7 server.
I can run searches and get the results in a standard page.
I'm working on getting my site's Search results into a Drupal View.
I've installed apachesolr_views module (7.x-1.0-beta1), and found this:
http://www.acquia.com/blog/views-3-apache-solr-acquia-drupal-future-search
It's pretty old and what I'm seeing in that walkthrough is different than what I see on my system.
Read more"Updating" facets after search results have been modified in a hook?
I am using the hook_apachesolr_process_results to modify the result set returned by Solr search. However, the enabled filters/facets do not reflect the modifications to the result set. How can I ensure that the facets reflect the changes to the results?
Thanks!
Read moreDrupal multisite installation, multicore apache solr and apache solr multisite search
Hi all,
I am new in apache solr technology and drupal integration.
I wan't to ask if is it possible to create the following configuration
Base site ( multisite search including results from site1 and site2 separate indexes )
www.example.com
subdomain site1 ( multi site installation with diff db + apache solr core1 index)
site1.example.com
subdomain site2 ( multi site installation with diff db + apache solr core2 index)
site2.example.comAs i know the main concept for multisite search is the following configuration
SOLR facet alter
I have a site that is using SOLR search with just a couple facets. The filter by type facet works for my needs out of the box without much adjustment. However, I need to change the markup of the 'sort by' facet to be a 'select' list rather than a unordered list to match the design.
I have been searching for a solution to this, but have yet to find one. The site is in D7 w apachesolr 7.x-1.0-beta16.
Any help would be greatly appreciated.
-Jeff
Read more

