Lucene, Nutch and Solr

We encourage users to post events happening in the community to the community events group on https://www.drupal.org.
This group should probably have more organizers. See documentation on this recommendation.

Lucene is a fabulous indexer, Nutch is a superb web crawler, and Solr can tie them together and offer world class searching. This group discusses the various projects and efforts being made to integrate these technologies with Drupal.

The ApacheSolr module integrates Drupal with the Apache Solr search platform. Solr search can be used as a replacement for core content search and boasts both extra features and better performance. Among the extra features is the ability to have faceted search on facets ranging from content author to taxonomy to arbitrary CCK fields.

Drupal projects that already provide some level of integration with Lucene and/or Nutch:

cilefen's picture

Solr Nutch Search Sandbox Project Updated to Integrate with Common Schema

Hello all:

Based on our discussion last month on IRC, I reconfigured this sandbox project as a few Nutch settings that creates an index compatible with the common schema for the apachesolr module.

http://drupal.org/sandbox/cilefen/1858412

The purpose is ad-hoc crawling and indexing, but searching within Drupal and the results are integrated with the Drupal node results.

This is for Nutch 1.x only at this stage.

Read more
cilefen's picture

Solr Nutch Search Sandbox Project Added

Hi All,

I just added "Solr Nutch Search", a sandbox project.

http://drupal.org/sandbox/cilefen/1858412

I welcome your feedback. Let me know if it is good enough for a full project, in which case I could use a co-maintainer.

-Chris McCafferty

Read more
Nick_vh's picture

Drupal Search and Solr office hours

Start: 
2012-12-05 16:00 - 17:00 UTC
Event type: 
Online meeting (eg. IRC meeting)

Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!

Read more
gaurav-varshney's picture

Searchapi integration with searchapi solr.

i am using searchapi module and serachapi solr module. i have set up solr successfully.
i have setup 2 different instance in one server for two different content types(A,B);
and indexes are created using it.

Read more
niccolox's picture

Nutch 2.1, Solr 4.0 etc

the latest version of Nutch 2.1 seems to work quite nicely with Solr 4.0 and am wondering if others have tried sending results to Search API and / or Apache Solr Search Drupal modules ?

there are lots of possibilities with integrating web-crawls into Drupal views, searches etc

Nutch 2.1 / Solr 4.0 (Gora+Mysql) running using this tutorial
http://nlp.solutions.asia/?p=180

Nutch 2.1 + Aegir BOA?
http://drupal.org/node/1851318

Drupal Nutch module and 2.1?
http://drupal.org/node/1851324

Drupal Elastic Search module and Nutch
http://drupal.org/node/1851064

Read more
evdrupal's picture

Newer version of View3 + Solr article

Hi.

I've installed Solr 3.6.1 and apachesolr (7.x-1.0-rc3) and apachesolr_attachments (7.x-1.2) modules.
It's all running on a Tomcat 7 server.

I can run searches and get the results in a standard page.

I'm working on getting my site's Search results into a Drupal View.

I've installed apachesolr_views module (7.x-1.0-beta1), and found this:

http://www.acquia.com/blog/views-3-apache-solr-acquia-drupal-future-search

It's pretty old and what I'm seeing in that walkthrough is different than what I see on my system.

Read more
vivekisontrack's picture

"Updating" facets after search results have been modified in a hook?

I am using the hook_apachesolr_process_results to modify the result set returned by Solr search. However, the enabled filters/facets do not reflect the modifications to the result set. How can I ensure that the facets reflect the changes to the results?

Thanks!

Read more
aleada's picture

Drupal multisite installation, multicore apache solr and apache solr multisite search

Hi all,
I am new in apache solr technology and drupal integration.
I wan't to ask if is it possible to create the following configuration

Base site ( multisite search including results from site1 and site2 separate indexes )
www.example.com 

subdomain  site1 ( multi site installation with diff db + apache solr core1 index)
site1.example.com

subdomain  site2 ( multi site installation with diff db + apache solr core2 index)
site2.example.com

As i know the main concept for multisite search is the following configuration

Read more
Anonymous's picture

SOLR facet alter

I have a site that is using SOLR search with just a couple facets. The filter by type facet works for my needs out of the box without much adjustment. However, I need to change the markup of the 'sort by' facet to be a 'select' list rather than a unordered list to match the design.

I have been searching for a solution to this, but have yet to find one. The site is in D7 w apachesolr 7.x-1.0-beta16.

Any help would be greatly appreciated.

-Jeff

Read more
dropbydrop's picture

How to install solr 3 in ubuntu?

How can I install solr 3 in latest ubuntu manually?
- fixed -

Read more
mimilamite's picture

Question for event search engine

Hi

I've been looking for times for a system to do my project :
It is an event search engine. Quite common thing.
But here in France, very few event sources have rss, xml, or anything that
permit event syndication.

Here's the idea : index given websites, or even scan paper programs with
OCR, input into the engine,
And it sorts out itself what is the start date, end date, where the event
takes place (city name), The title and the description of the event.
Then it makes a list out of these events for people to go out.

Read more
pinkonomy's picture

How to replicate the Search of this site?

Hi,
I want to replicate the search functions of this site http://www.zocdoc.com/ .

1)First,on the front page I want to have a drop down list to select from,as on the site the
"Find a doctor or dentist specialist" drop down select list
2)At the bottom of the page I want to filter by taxonomy terms for example By City etc.
3)By name as here http://www.zocdoc.com/directory .Eg with a search box.

How can I replicate these 3 kind of searches?
Thanks in advance,I would be grateful for every answer

Read more
sethhill's picture

Desiring help with Solr, Nutch, Facet API

Mathematic Arts is a Drupal development firm in Milwaukee. We recently developed a web site for a research library in Drupal 7, and implemented Solr and Nutch for the search facility. We are using Facet API to filter search results based on a few simple criteria, but would like to do some more complicated filters and to improve the user experience.

For example:

  • Have a facet like content type, but that aggregates many of the general content types that are meaningless to a user. For example:
Read more
haojiang's picture

introduce a high performance free hosting (for drupal)--sae---a GAE like China project , i just finished deploy drupal 6 there

ok , example first to prove that i am telling the truth : http://trackself.sinaapp.com/ just feel its speed!

[background:]
1.SAE: sina app engine , belong to sina.com (China), it's a service like google app engine ,but provide PHP hosting and mysql hosting and more! well , i think it's better than GAE, because i only know php but not python/java. It declare itself cloud service. And better than GAE, it use SVN.
2.Since it offer php and mysql hosting , drupal can be hosted. But a little hack to install.php , nothing more hack

Read more
pwolanin's picture

Feedback wanted for apachesolr beta9 (moving towards RC1)

Thanks in large part to the talented Nick_vh who is interning with Acquia as the final stage of his master's degree, we've been making progress on a number of outstanding issues for apachesolr 7.x. If you are using the module, please take a look at http://drupal.org/node/1293570 which lists remaining issues to be included in the next beta release (possibly the end of this week).

The patch to improve custom search pages includes a number of re-arrangements for the UI, so please try it out: http://drupal.org/node/1294846

Read more
cpliakas's picture

Bye Bye, Search Lucene API!

Search has been a passion of mine for some time now. In many ways the Search Lucene API module was my introduction to Drupal and helped me understand and love the platform. Therefore I am somewhat sad to announce that Search Lucene API has reached "end of life".

Read more
Lex van Sonderen's picture

Drupal Tech Talk 2.0

Start: 
2011-09-22 17:00 - 20:00 Europe/Amsterdam
Organizers: 
Event type: 
User group meeting

After the first super-nerdy Drupal Tech Talk in Rotterdam, Hoppinger, Proteon and Triquanta kindly invite you to the second Drupal Tech Talk on September 22nd at the Proteon offices in Delft, the Netherlands. The Drupal Tech Talk is a meetup specifically targeted at Drupal developers in Belgium and the Netherlands. The talks and sessions dive deep technically. You can count on poorly designed slides with lots of code and risky live demos.

Read more
Anonymous's picture

Connecting to external systems

I've been asked to look at a system for a museum who have an existing collection database they want to be able to search using Solr from Drupal 7. I've used Solr in quite a few projects and always periodically imported the data but at the moment they'd rather just connect the two via an as-yet-undefined interface. Is this a possible reality or should I stick to my guns re periodic import? I prefer import as you don't have to worry if the connection goes down, plus Drupal 'understands' nodes, plus I want to index it with Solr, which I know how to do easily with Drupal!

thx

Read more
kmoll's picture

Keeping Facet Counts when filtering query

I am using Apachesolr module with Drupal 6. I have only one facet so far on my searching and that is the node type. I have a list of links that is supposed to filter the query. When I put in a search term, I get all the matching results and the list of links with the correct counts for each node type. However, when I click a node type it returns just the result for that node type, but now all the counts in my link list have turned to 0, except for that node type which shows the correct amount, I have it set a $_REQUEST['filter'] parameter.

Read more
picxelplay's picture

5 sites on the same machine using one java machine?

I have 5 independent sites on one machine. I also have another machine running just Java. How can I get all 5 sites to use the same 1 java machine for Solr? Would I install Solr 5 times for each of the sites? Would each Solr instance be installed in each of the sites /home directory; instead of say /usr/local/share. When the apachesolr.modules are installed on each site, how would I have it recognize just it's own instance and not the other 4? Or is there a better way to do this?

Read more
Subscribe with RSS Syndicate content

Lucene, Nutch and Solr

Group organizers

Group categories

Projects

Group notifications

This group offers an RSS feed. Or subscribe to these personalized, sitewide feeds:

Hot content this week