Lucene, Nutch and Solr

Lucene is a fabulous indexer, Nutch is a superb web crawler, and Solr can tie them together and offer world class searching. This group discusses the various projects and efforts being made to integrate these technologies with Drupal.

The ApacheSolr module integrates Drupal with the Apache Solr search platform. Solr search can be used as a replacement for core content search and boasts both extra features and better performance. Among the extra features is the ability to have faceted search on facets ranging from content author to taxonomy to arbitrary CCK fields.

Drupal projects that already provide some level of integration with Lucene and/or Nutch:

sethhill's picture

Desiring help with Solr, Nutch, Facet API

Mathematic Arts is a Drupal development firm in Milwaukee. We recently developed a web site for a research library in Drupal 7, and implemented Solr and Nutch for the search facility. We are using Facet API to filter search results based on a few simple criteria, but would like to do some more complicated filters and to improve the user experience.

For example:

  • Have a facet like content type, but that aggregates many of the general content types that are meaningless to a user. For example:
1 comment · Read more

introduce a high performance free hosting (for drupal)--sae---a GAE like China project , i just finished deploy drupal 6 there

ok , example first to prove that i am telling the truth : http://trackself.sinaapp.com/ just feel its speed!

[background:]
1.SAE: sina app engine , belong to sina.com (China), it's a service like google app engine ,but provide PHP hosting and mysql hosting and more! well , i think it's better than GAE, because i only know php but not python/java. It declare itself cloud service. And better than GAE, it use SVN.
2.Since it offer php and mysql hosting , drupal can be hosted. But a little hack to install.php , nothing more hack

17 comments · Read more
pwolanin's picture

Feedback wanted for apachesolr beta9 (moving towards RC1)

Thanks in large part to the talented Nick_vh who is interning with Acquia as the final stage of his master's degree, we've been making progress on a number of outstanding issues for apachesolr 7.x. If you are using the module, please take a look at http://drupal.org/node/1293570 which lists remaining issues to be included in the next beta release (possibly the end of this week).

The patch to improve custom search pages includes a number of re-arrangements for the UI, so please try it out: http://drupal.org/node/1294846

1 comment · Read more

Bye Bye, Search Lucene API!

Search has been a passion of mine for some time now. In many ways the Search Lucene API module was my introduction to Drupal and helped me understand and love the platform. Therefore I am somewhat sad to announce that Search Lucene API has reached "end of life".

2 comments · Read more
Lex van Sonderen's picture

Drupal Tech Talk 2.0

Start: 
2011-09-22 17:00 - 20:00 Europe/Amsterdam
Organizers: 

After the first super-nerdy Drupal Tech Talk in Rotterdam, Hoppinger, Proteon and Triquanta kindly invite you to the second Drupal Tech Talk on September 22nd at the Proteon offices in Delft, the Netherlands. The Drupal Tech Talk is a meetup specifically targeted at Drupal developers in Belgium and the Netherlands. The talks and sessions dive deep technically. You can count on poorly designed slides with lots of code and risky live demos.

1 comment · Read more
stevepurkiss's picture

Connecting to external systems

I've been asked to look at a system for a museum who have an existing collection database they want to be able to search using Solr from Drupal 7. I've used Solr in quite a few projects and always periodically imported the data but at the moment they'd rather just connect the two via an as-yet-undefined interface. Is this a possible reality or should I stick to my guns re periodic import? I prefer import as you don't have to worry if the connection goes down, plus Drupal 'understands' nodes, plus I want to index it with Solr, which I know how to do easily with Drupal!

thx

12 comments
kmoll's picture

Keeping Facet Counts when filtering query

I am using Apachesolr module with Drupal 6. I have only one facet so far on my searching and that is the node type. I have a list of links that is supposed to filter the query. When I put in a search term, I get all the matching results and the list of links with the correct counts for each node type. However, when I click a node type it returns just the result for that node type, but now all the counts in my link list have turned to 0, except for that node type which shows the correct amount, I have it set a $_REQUEST['filter'] parameter.

2 comments · Read more
picxelplay's picture

5 sites on the same machine using one java machine?

I have 5 independent sites on one machine. I also have another machine running just Java. How can I get all 5 sites to use the same 1 java machine for Solr? Would I install Solr 5 times for each of the sites? Would each Solr instance be installed in each of the sites /home directory; instead of say /usr/local/share. When the apachesolr.modules are installed on each site, how would I have it recognize just it's own instance and not the other 4? Or is there a better way to do this?

4 comments
Beanjammin's picture

Index OG members?

I have a site that is based on Open Atrium and currently using solr for site search. I would like to be able to add organic group members to the search index so that I can search for users within a particular OG. I would appreciate suggestions on how best to approach this.

Solr is set up via the Chapter 3 open atrium features module here http://features.chapterthree.com/openatrium-apachesolr-search/1-0-0, which uses apachesolr, apachesolr_og, apachesolr_search, and apachesolr_nodeaccess. In addition, Open Atrium uses nodeprofile so users' profiles are already in the index.

What I have considered doing is adding users' profile nodes to the same groups that they belong to. This would then make them searchable by OG, however my concern is that I would like everyone to be able to search for users across all OGs, including private OGs, and this would result in permission issues.

I would really appreciate any suggestions. Thanks.

Login or register to post comments · Read more

AJAX Guided Form Facets

I am using Drupal 6.22 / module apacheSolr 1.5 / Java based Solr 3.1
I would like to extend the default solr search form to include location input and guided facets in an expanded state (displayed via AJAX as part of the form, before any search results are displayed):

1) Location input (postal code field and slider to set proximity range connects to Solr Geo-spatial)
http://wiki.apache.org/solr/SpatialSearch
http://thedrupalblog.com/geospatial-apache-solr-searching-drupal-6-upgra...

2) AJAX Guided facets - based on term user types, with autosuggest module enabled.

7 comments · Read more · 1 attachment

Introducing Facet API

The Problem

The search community is fragmented. The problem stems from a core search module that doesn't facilitate third party backends, so each project is forced to solve similar problems in slightly different ways. Each contributed module has its own isolated sub-community, which is detrimental to Drupal as a whole.

4 comments · Read more

ApacheSolr Live Search / Nodereference Autocomplete

I'm looking for a way to implement live search results in a drop down menu. Visually it would look and act almost identical to apachesolr_autocomplete, however reporting the top 5 search results instead of search suggestions.

I'm hoping that an existing module is out there, although I've scoured as much as I can. I would like to develop the module myself, but the project I'm on simply doesn't have the budget for it :(

Thanks for any advice!

3 comments

Roadmap for 7.x-1.x and 6.x-* for Apache Solr module

transferred from http://drupal.org/node/1090080

Please Make your notes or changes inline, preferably to reflect IRC or other more rea-time discussions.

We had a BoF and many discussion at Drupalcon Chicago 2011, here's the take away:

<

ul>

  • Abandon any attempt to keep the schema in sync with 6.x-1.x and remove most or all node-specific fields. Mostly done at http://drupal.org/node/1088208

  • get some Views integration asap (volunteers?)
  • support multiple sorts
  • Move filter URL params from ?filters= to multiple ?f[]= params
  • Improve UI w.r.t. settings per server
  • Read more
    atomicjeep's picture

    Solr Multisite Search

    Hi There,
    This regards http://drupal.org/project/apachesolr_multisitesearch
    I'm trying to enable Multisite Search Facets (based on Taxonomies) for a few sites - the 'normal' multisite facets such as 'filter by site', 'current search' etc appear fine but facets based on Taxonomies do not appear not matter what I do. The facets are enabled & the blocks added to regions, caches cleared etc.

    Tested in Drupal Core 6.20 with the latest stable version of Apache Solr & Apache Solr Multisite Search

    Has anyone else successfully enabled this functionality?

    Any help greatly appreciated

    3 comments
    bhp's picture

    Setting the locale through the Drupal Solr API?

    We have a website which is mostly English, but some of the fields in some of our content types are in another language. This other language has different rules for alphabetization, etc. We'd like to be able to sort searches on these fields using the appropriate locale.

    You can do this in the Solr schema.xml file by adding the appropriate locale="..." attribute to the field definitions. I'm wondering whether there's a way to do this through the Drupal API, so that we don't need to modify the schema.xml file. Has anyone else looked into this idea?

    Login or register to post comments

    April 8-13 2011 Apache Solr Search Integration sprint goals

    Please sign up at http://groups.drupal.org/node/138324

    For real-time chat or to connect with sprinters, join #drupal-apachesolr in IRC

    There will be two skype kick-off calls of about 30 min each on Friday, April 8. One at 8 am EDT and one at 3 pm 4 pm EDT (hopefully that will span enough timezones). Join IRC before hand and share your skype name there to be called in.

    At the outset, some possible high-level goals are:

    <

    ul>

  • integration with Facet API
  • integration with Views
  • UI improvements
  • Expanded test coverage
  • 1 comment · Read more

    Trying apachesolr views but which views version to use?

    Hey,

    i already created an issue but got no reaction. Someone here using apachesolr_views? If yes please tell me which version of views you use (or which dev snapshot). I tried 6.x-3.x-dev and alpha3 but report errors.

    regards Volkan

    Login or register to post comments
    pwolanin's picture

    Apache Solr Search Integration virtual sprint

    Start: 
    2011-04-08 (All day) - 2011-04-13 (All day) America/New_York

    We are planning a virtual sprint focused on implementing new feature for the 7.x version, as well as stabilizing it and moving towards an RC release.

    Additional work may begin on a 6.x-3.x branch.

    Watch this space for links to discussions and other organizing posts within the "Sprints" group.

    Sign up for this event if you are interested in participating.

    6 comments

    Upgrading servers with Solr / mySQL

    Hello,

    I'm about to move from development to launch on a project that uses solr to index around 800,000 nodes. At present it is on a dedicated server with 4gb ram, along with about 15 other websites I host. It is clear that under load I am experiencing slow downs in mySQL from a lack of memory, and from what I've read it appears the RAM allocated to Solr is a major factor in how fast and responsive your solr searches are.

    What I'd like to know is

    • Should I be aiming to upgrade this server with more memory, or to launch a new box that is dedicated to mysql/solr/both?
    Login or register to post comments · Read more
    ygerasimov's picture

    Weired wildcard search with EDisMax handler

    I am using Solr 1.4.1 with EDisMax according to http://drupal.org/node/713142

    When I do search for part of the word I see results properly. (In my case the search is for *verhuiz*)
    But when I do search for bigger part of the work I get no results. (Nex search is for *verhuizin*).

    Even I know that the original work is "verhuizing" and it does exist in my documents.

    Please find attached responses and details of the Solr.

    Can anyone advise what might be wrong?

    3 comments · 3 attachments
    Syndicate content