Lucene is a fabulous indexer, Nutch is a superb web crawler, and Solr can tie them together and offer world class searching. This group discusses the various projects and efforts being made to integrate these technologies with Drupal.
The ApacheSolr module integrates Drupal with the Apache Solr search platform. Solr search can be used as a replacement for core content search and boasts both extra features and better performance. Among the extra features is the ability to have faceted search on facets ranging from content author to taxonomy to arbitrary CCK fields.
Drupal projects that already provide some level of integration with Lucene and/or Nutch:
Anyone working on the nutch module for 6?
Hi there,
Anyone working on getting the nutch module working for Drupal 6? Any folks know of other avenues to get full-text document search (.pdf, .doc, etc.) in Drupal 6?
Thanks!
Read moreMultisite Search using ApacheSolr module
Hi,
Can anyone let me know if it is possible to index and search multiple Drupal and non-drupal websites using the ApacheSolr module?
If not please let me know of any other way that this could be achieved.
Thanks
Read moreProblem while implementing Lucene
Hi all,
I am facing the problem while implementing the lucene search in my site. Can any body help me out for the problem. Even i added teh Zend Framwork for Lucene search in drupal site but still i am facing the following error while adding the module of search lucene API.
"The required Zend Framework components of Search Lucene API are not installed. (Currently using Zend Framework components not installed) "
Please comment with your solution and with site from where i can get the components of search lucene API.
Read moreSolr RDF Support
Overview
This project is about adding RDF Support to the popular ApacheSolr module in the form of a Solr RDF contrib module. The module should be able to read an RDF class specification and automatically generate the necessary mapping to a Solr server, provide the capability to search resources with that type and also generate facets based on its properties. It would even be possible to build the existing Node search capabilites completely on top of this mechanism! But in any case you could also add arbitrary other types like users or taxonomy terms, or resources from other websites altogether.
Read moreSearch results display
HI,
I am newbie to drupal.I am facing few issues with apachesolr module. I am using apachesolr-6.x-1.0-beta5 module and solr 1.3.
1) When i search for any content, the results are displaying only in links not in teasers.
2) When i search for any title it displays empty results.
3) For me the "Spell check" is not working after checking the checkbox in apachesolr settings page .
Is there any settings that i need to do? Please help me.
Read morethe old opensearchclient module
Just wanted to write to get the views of all here on the opensearchclient module.
I've recently been handed the task of porting a hacked version (was done previously, not by me) of the opensearchclient module which is needed for a site I'm upgrading to D6. The hacked version has been ported, and I noticed the original code has been abandoned.
Read moreHelp backport ApacheSolr D6 to D5
Hi Everyone,
Today I took a couple hours and attempted a backport of ApacheSolr D6 to D5. This will bring all of the cool features that D6 has to D5. The patch needs work. It doesn't yet work. If anyone has time to chip in and work on the indexing of nodes (currently not working) that'd be great. With a little momentum I think we can do the backport relatively quickly, after which keeping the two in sync will be much easier.
http://drupal.org/node/337735#comment-1240921
Read moreUsing Apachesolr module as an API
Just want to share my experience with using the apachesolr module as an API. I use apachesolr module purely as interface between my own drupal modules and the Solr instance. Part of the reason to do so is that I use a different schema.xml, compared to the one provided with the apachesolr module.
For the single technical detail, read this issue.
Thanks a lot to the developers for creating the apachesolr module and making it nicely extensible!
Read moreRunning Solr as a service on Debian
I was looking for a way to run Solr as a service on Debian. Ez Publish CMS has some interesting scripts in their svn repository http://svn.ez.no/svn/extensions/ezfind/ezp4/trunk/extension/ezfind/bin/s...
http://svn.ez.no/svn/extensions/ezfind/ezp4/trunk/extension/ezfind/java/
I used the solr script from the first link together with the solr.sh in the second, followed the instructions in the solr script and it works well.
Somebody another or better solution?
Read moreDoes the order of fields in the search query matter in Solr?
Query 1:
field1:value1 field2:value2
Query 1 reversed:
field2:value2 field1:value1
.
Does it make a difference? Would it be better for performance to use the field first which yields the smallest sub result set? Or does Solr/Lucene handle this all by itself?
Read moreSearch through buddy nodes
Did anyone ever tried or succeeded to configure drupal+solr to search through nodes of buddies? So integration with buddylist or any other user relationship module. Would it be even possible?
Read moreAwesome jQuery + Solr integration
Check out this awesome jQuery/Solr library that David Peterson pointed me to: http://solrjs.solrstuff.org/test/reuters/
Read moreSearching over multiple (heterogenous) indexes
(repost from http://drupal.org/node/296198 by drunkenmoney)
While implementing the attachment indexing mechanism, we (febbraro, robertDouglass and I) stumbled across a problem: how to store the attachment text?
It would be easily possible to just append it to the "text" field or add a new, multi-valued field or both. But then it would be impossible to distinguish the place of occurence of the term at search time, which, unfortunately, is a requirement, since the attachments should appear directly in the search results, not just links to the nodes containing them.
MY SQL configuration
We have a database Server Configuration:
4GB RAM
600GB Hard Disk
Xeon Processor 1.3 Ghz.
We are barely able to have 100 concurrent users!!! What are we doing wrong.
I know I need to configure mysql_query cache, mysql_limit_size and table_cache. But what should be the formula, and how do we go about checking the same.
Below is the details of our my.ini file.
[mysqld]
datadir=/database/data
socket=/var/lib/mysql/mysql.sock
set-variable=max_connections=2000
set-variable = max_allowed_packet=64M
default-storage-engine = innodb
log-bin=/database/data/mysql-bin
Benchmarking in Drupal
Drupal and MySQl located in two different servers:
Configuration: Drupal server: Dual core processor, 4GB RAM
MySQL Server: Xeon Processor, 4GB RAM
MySQL Enterprise edition 5
In an exercise to Pre populate the database with 1 million records, record insertions in the tables are very slow since the requests are getting queued up in the database. Insertion is done using a special tool that records our drupal application and plays back the scripts in a loop to populate the database.
Please see the below data collected by the team.
Read moreBench Mark Drupal 6 search and Solr Search
Could you validate our Scenarios and the Server configurations. We will share the results with the community soon.
BENCH MARK SCENARIOS:
The Scenarios:
Drupal 6.0 Search + Statistics + simul. Cron + replication
SolR Search + Statistics + simul. Cron + replication
Drupal 6.0 Search/ Solr Search with Statistics + replication
Drupal 6.0 Search/ Solr Search + simul. Cron + replication
Server Configuration:
Drupal Site and DB are to be placed in two Dual core servers with 4 GB RAM each
Software details: Linux - RHEL 5 , Apache 2.2.3, MySQL Enterperise 5, PHP 5.1.6
Read moreApacheSolr Alpha 3 released
With the help of Drunken Monkey and many others we've come up with an Alpha 3 of the ApacheSolr module. In addition to lots of bugfixes and a performance improvement, there is a new feature: you can set the number of facets per block. Check it out:
http://drupal.org/project/apachesolr
Search for a large Job portal
Need some clarifications on the best search algorithms to use. I work with a Netlink Technologies. We are currently planning to have a Architect a large Job portal in Drupal. Have convinced our organization that we use Drupal 6.0 and create custom nodes and modules. We are also planning on bench marking the different options of Search that we could adopt.
For Search we are just trying to understand ApacheSolr and Sphinx search.
DO you think we are proceeding in the right direction. Will Drupal - SOLR be a scalable option for a large job portal?
Shyamala
Tech Head
New questions and observations
I hate to mess up this nice groups page with newbie issues but looks like I will.
The ApacheSolr module looks to be very cool but it's not what I need. It's one thing to backend all of Drupal with Solr but it's something altogether different (I think) to integrate Drupal with an existing Solr implementation.
Read moreApacheSolr Search and other 3rd party Drupal search options
This is a BoF discussion that is planned for August 24, 2008 at 16:30 - 17:45 as a part of the FrOSCon conference in Sankt Augustin, Germany.
- What improvements have gone into Drupal search as a result of the Minnesota Search Sprint?
- ApacheSolr demonstration
- Xapian search and Drupal.org
Search is cool. Come talk about it.
Read more





