Lucene is a fabulous indexer, Nutch is a superb web crawler, and Solr can tie them together and offer world class searching. This group discusses the various projects and efforts being made to integrate these technologies with Drupal.
The ApacheSolr module integrates Drupal with the Apache Solr search platform. Solr search can be used as a replacement for core content search and boasts both extra features and better performance. Among the extra features is the ability to have faceted search on facets ranging from content author to taxonomy to arbitrary CCK fields.
Drupal projects that already provide some level of integration with Lucene and/or Nutch:
Altering facets to use a different solr field
In order to build a cross site search system, we have created a centralised taxonomy that will be shared by a number of sites. When content is indexed, we index not only the tid, but also a shared global id for each term. Now, we are looking to integrate with the facetapi.
Read moreSlides from Drupalcon Amsterdam 2014: TURNING DRUPAL INTO A MACHINE FOR AUTOMATED DEPLOYMENT
Attached are the slides from:
https://amsterdam2014.drupal.org/session/turning-drupal-machine-automate...
Read moreSlides from NYC Apache Lucene/Solr Meetup
I presented a meetup talk last night "Ramp Up Your Web Experiences Using Drupal and Apache Solr"
http://www.meetup.com/NYC-Apache-Lucene-Solr-Meetup/events/155091552/
Attached are the slides for anyone interested in looking at them or discussing.
Read moreTurbocharge your Search API Solr Search setup - free webinar 12/6
In this webinar Thomas Seidl (aka Drunkenmonkey) will show how you can turbocharge your Search API Solr Search setup with Visitors Voice Free - a new SaaS that lets you add, remove and reorder the hits exactly the way you want them. Thomas will also answer to questions you might have regarding Search API in general.
Read moreAnnouncing next Search API D8 Sprint - 13-15th of June in Belgium
Wow, what a couple of intense weeks for the Drupal 8 initiative for porting Search API. There were many improvements made lately and we hope to properly explain them all to you but for now I want to shout out a couple things. First and foremost I want to thank everyone in the community!
It's time for another sprint on Search API and this time it will be held at the Intracto offices during the 13th till the 15th of June. Intracto has been so kind to not only sponsor the project with money but also offer their location as a sprint location for 3 full days.
Location
Intracto Group nv
Help us fund the port of Search API Drupal 8
Feedback wanted here :
https://www.drupalfund.us/project/help-us-fund-port-search-api-drupal-8
Pledges would also be really appreciated!
Introduction
Search API in Drupal 8 had a massive interest during Drupal Dev Days in Szeged, Hungary. Now this week ended (or almost ended at the time of this writing) we are looking to organise at least 2 more sprints to get Search API in a stable state for Drupal 8.
With the news that the Apache Solr module & Search API Solr module will merge, this sprint is beneficial for everyone that is using search on their website.
Read moreepoch time (DEC 31 1969)
Whenever I input a standard search inside of a search box, majority (if not all) returned results appears in epoch time (DEC 31 1969), needless to say we don't have this date inside of our content.
our environment is using: php-5.3.3 (RHEL6), apache-solr-1.4.1 and apachesolr-6.x-2.0-beta2.
drush -r /var/www/html/current/ -l XXX.XXX.XXX pm-list | grep -iE '(?)solr(?)'
Apache Solr Apache Solr comment search Module Not installed 6.x-2.0-beta2
Read moreBig Data Drupal: Cloudera Hadoop, MapReduce, Nutch, Solr, Aegir BOA, Drupal 7 ApacheSolr Views
I am giving a talk at Badcamp on Big Data Drupal: Cloudera Hadoop, MapReduce, Nutch, Solr, Aegir BOA, Drupal 7 ApacheSolr Views
http://2013.badcamp.net/sessions/big-data-drupal-cloudera-hadoop-mapredu...
I am trying to gather some other experts i.e. Cloudera / Hadoop / MapReduce + HyperDrupal + Twig etc to come and handle the bigger and deeper questions
https://drupal.org/node/2104503
https://groups.google.com/a/cloudera.org/forum/#!topic/cdh-user/Uwuj1q7bWBY
(Solr) Search in Drupal 8 - Part 2
Drupalcon : Further Discussion on architecture (27 Sept 2013)
This discussion is purely on what we could do with Search API to solve these issues. The previous discussion (See : https://groups.drupal.org/node/327723 ) was more about what both projects could share.
Read more(Solr) Search in Drupal 8
On Tuesday, 24th of September we had a birds-of-a-feather session about Solr in Drupal 8 and there was an attendance of around 8-10 people. Also all the module maintainers were present in this meeting so it was perfect to have a productive meeting, which it also was.
We discussed the following topics
Changes in Drupal 8 core to the search module in core.
Possibilities to have a generic connector for Solr.
Query Class
Document Class
Indexing logic
Schema files
Views query plugin
Indexing logic and why queue is not a good option for indexing items to your search
Configure sort-fields?
Hello SOLR Drupal experts,
is there a way to configure the fields used in sorting?
I am happy to write the schema and config on the solr side to do so (such as "sInt") but I am unsure how to do so on the Drupal side.
thanks in advance.
Paul
PS: what is the best activity time for #drupal-apachesolr ?
Read moreSolr Nutch Sandbox Modified to a simple search interface
Dear Lucene, Nutch, and Solr and my NJ Colleagues:
The Solr Nutch project has been changed to a simple search interface that does one thing--search Solr indexes that were crawled by Nutch, using Nutch's preferred schema. Please give it a try if you use Nutch.
http://drupal.org/sandbox/cilefen/1858412
Read moreDrupal Search and Solr office hours #6
Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!
Read moreDrupal Search and Solr office hours #5
Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!
Read moreSolr project
I have a project that I've used Drupal 7, Ubercart 3 on and has already launched. After launch, client informs me of their specialized search needs for products.
The use case is all results matching a string of 3 or more consecutive characters, including a mix of upper case, lower case, numbers and punctuation ('tet' should return anything with tetanus, rgTet9-66, etc) from sku and title field.
Read moreEasy Integration on SOLR-Nutch-Drupal
Hello,
Can anybody point me to a step-by-step guide on combining SOLR-NUTCH-DRUPAL together?
What I am trying to achieve is crawl some data from various websites and create a comparison platform for them. Any help will be much appreciated.
Many Thanks in advance.
Shashank
Read moreDrupal Search and Solr office hours
Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!
Read moreDrupal Search and Solr office hours
Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!
Read moreBig Data Drupal with Cloudera, Hadoop, MapReduce, Nutch and Solr
thanks to the recent work of the Solr Nutch sandbox project I've managed to get Nutch 1.6 jobs to run on a Cloudera CDH3 4 node cluster sending results to Solr 3.6.2 (hosted within Tomcat on Aegir BOA) and then integrated into the Apache Solr 7.1.1 module (not the dev) into search results and Apache Solr Views
I must say, I am pretty excited about Hadoop / Cloudera running Nutch and Solr and integrating with Drupal
for anyone interested in setting up a Cloudera cluster I recommend masterschema (centos) and Gregory Grubbs on YouTube (debian)
I'll post some notes etc ASAP
Read moreBest approach to indexing stemmed and unstemmed fulltext in Drupal?
A common desire with Apache Solr search servers is to get the "best of both" stemming and not stemming terms, indexing both the original term and the stem with something like SnowballPorterFilterFactory. Stemming matches grammatical variations, while indexing the original boosts exact matches to rank higher than near matches, and protects against awkward cases where after stemming, the original term no longer matches.
Read more