Lucene is a fabulous indexer, Nutch is a superb web crawler, and Solr can tie them together and offer world class searching. This group discusses the various projects and efforts being made to integrate these technologies with Drupal.
The ApacheSolr module integrates Drupal with the Apache Solr search platform. Solr search can be used as a replacement for core content search and boasts both extra features and better performance. Among the extra features is the ability to have faceted search on facets ranging from content author to taxonomy to arbitrary CCK fields.
Drupal projects that already provide some level of integration with Lucene and/or Nutch:
Solr Nutch Search Sandbox Project Updated to Integrate with Common Schema
Hello all:
Based on our discussion last month on IRC, I reconfigured this sandbox project as a few Nutch settings that creates an index compatible with the common schema for the apachesolr module.
http://drupal.org/sandbox/cilefen/1858412
The purpose is ad-hoc crawling and indexing, but searching within Drupal and the results are integrated with the Drupal node results.
This is for Nutch 1.x only at this stage.
Read moreSolr Nutch Search Sandbox Project Added
Hi All,
I just added "Solr Nutch Search", a sandbox project.
http://drupal.org/sandbox/cilefen/1858412
I welcome your feedback. Let me know if it is good enough for a full project, in which case I could use a co-maintainer.
-Chris McCafferty
Read moreDrupal Search and Solr office hours
Drupal Search has a great ecosystem of modules to integrate with technologies such as Solr. However, it needs more vision and direction to grow and be a great platform where other developers feel comfortable with and are able to make the right decisions. Also We are convinced that if we all come together and talk, get some decisions and actually get to work on a regular basis we can come up with a solution for Drupal that kick a**!
Read moreSearchapi integration with searchapi solr.
i am using searchapi module and serachapi solr module. i have set up solr successfully.
i have setup 2 different instance in one server for two different content types(A,B);
and indexes are created using it.
Nutch 2.1, Solr 4.0 etc
the latest version of Nutch 2.1 seems to work quite nicely with Solr 4.0 and am wondering if others have tried sending results to Search API and / or Apache Solr Search Drupal modules ?
there are lots of possibilities with integrating web-crawls into Drupal views, searches etc
Nutch 2.1 / Solr 4.0 (Gora+Mysql) running using this tutorial
http://nlp.solutions.asia/?p=180
Nutch 2.1 + Aegir BOA?
http://drupal.org/node/1851318
Drupal Nutch module and 2.1?
http://drupal.org/node/1851324
Drupal Elastic Search module and Nutch
http://drupal.org/node/1851064
Newer version of View3 + Solr article
Hi.
I've installed Solr 3.6.1 and apachesolr (7.x-1.0-rc3) and apachesolr_attachments (7.x-1.2) modules.
It's all running on a Tomcat 7 server.
I can run searches and get the results in a standard page.
I'm working on getting my site's Search results into a Drupal View.
I've installed apachesolr_views module (7.x-1.0-beta1), and found this:
http://www.acquia.com/blog/views-3-apache-solr-acquia-drupal-future-search
It's pretty old and what I'm seeing in that walkthrough is different than what I see on my system.
Read more"Updating" facets after search results have been modified in a hook?
I am using the hook_apachesolr_process_results to modify the result set returned by Solr search. However, the enabled filters/facets do not reflect the modifications to the result set. How can I ensure that the facets reflect the changes to the results?
Thanks!
Read moreDrupal multisite installation, multicore apache solr and apache solr multisite search
Hi all,
I am new in apache solr technology and drupal integration.
I wan't to ask if is it possible to create the following configuration
Base site ( multisite search including results from site1 and site2 separate indexes )
www.example.com
subdomain site1 ( multi site installation with diff db + apache solr core1 index)
site1.example.com
subdomain site2 ( multi site installation with diff db + apache solr core2 index)
site2.example.com
As i know the main concept for multisite search is the following configuration
SOLR facet alter
I have a site that is using SOLR search with just a couple facets. The filter by type facet works for my needs out of the box without much adjustment. However, I need to change the markup of the 'sort by' facet to be a 'select' list rather than a unordered list to match the design.
I have been searching for a solution to this, but have yet to find one. The site is in D7 w apachesolr 7.x-1.0-beta16.
Any help would be greatly appreciated.
-Jeff
Read moreHow to install solr 3 in ubuntu?
How can I install solr 3 in latest ubuntu manually?
- fixed -
Question for event search engine
Hi
I've been looking for times for a system to do my project :
It is an event search engine. Quite common thing.
But here in France, very few event sources have rss, xml, or anything that
permit event syndication.
Here's the idea : index given websites, or even scan paper programs with
OCR, input into the engine,
And it sorts out itself what is the start date, end date, where the event
takes place (city name), The title and the description of the event.
Then it makes a list out of these events for people to go out.
How to replicate the Search of this site?
Hi,
I want to replicate the search functions of this site http://www.zocdoc.com/ .
1)First,on the front page I want to have a drop down list to select from,as on the site the
"Find a doctor or dentist specialist" drop down select list
2)At the bottom of the page I want to filter by taxonomy terms for example By City etc.
3)By name as here http://www.zocdoc.com/directory .Eg with a search box.
How can I replicate these 3 kind of searches?
Thanks in advance,I would be grateful for every answer
Desiring help with Solr, Nutch, Facet API
Mathematic Arts is a Drupal development firm in Milwaukee. We recently developed a web site for a research library in Drupal 7, and implemented Solr and Nutch for the search facility. We are using Facet API to filter search results based on a few simple criteria, but would like to do some more complicated filters and to improve the user experience.
For example:
- Have a facet like content type, but that aggregates many of the general content types that are meaningless to a user. For example:
introduce a high performance free hosting (for drupal)--sae---a GAE like China project , i just finished deploy drupal 6 there
ok , example first to prove that i am telling the truth : http://trackself.sinaapp.com/ just feel its speed!
[background:]
1.SAE: sina app engine , belong to sina.com (China), it's a service like google app engine ,but provide PHP hosting and mysql hosting and more! well , i think it's better than GAE, because i only know php but not python/java. It declare itself cloud service. And better than GAE, it use SVN.
2.Since it offer php and mysql hosting , drupal can be hosted. But a little hack to install.php , nothing more hack
Feedback wanted for apachesolr beta9 (moving towards RC1)
Thanks in large part to the talented Nick_vh who is interning with Acquia as the final stage of his master's degree, we've been making progress on a number of outstanding issues for apachesolr 7.x. If you are using the module, please take a look at http://drupal.org/node/1293570 which lists remaining issues to be included in the next beta release (possibly the end of this week).
The patch to improve custom search pages includes a number of re-arrangements for the UI, so please try it out: http://drupal.org/node/1294846
Read moreBye Bye, Search Lucene API!
Search has been a passion of mine for some time now. In many ways the Search Lucene API module was my introduction to Drupal and helped me understand and love the platform. Therefore I am somewhat sad to announce that Search Lucene API has reached "end of life".
Read moreDrupal Tech Talk 2.0
After the first super-nerdy Drupal Tech Talk in Rotterdam, Hoppinger, Proteon and Triquanta kindly invite you to the second Drupal Tech Talk on September 22nd at the Proteon offices in Delft, the Netherlands. The Drupal Tech Talk is a meetup specifically targeted at Drupal developers in Belgium and the Netherlands. The talks and sessions dive deep technically. You can count on poorly designed slides with lots of code and risky live demos.
Read moreConnecting to external systems
I've been asked to look at a system for a museum who have an existing collection database they want to be able to search using Solr from Drupal 7. I've used Solr in quite a few projects and always periodically imported the data but at the moment they'd rather just connect the two via an as-yet-undefined interface. Is this a possible reality or should I stick to my guns re periodic import? I prefer import as you don't have to worry if the connection goes down, plus Drupal 'understands' nodes, plus I want to index it with Solr, which I know how to do easily with Drupal!
thx
Read moreKeeping Facet Counts when filtering query
I am using Apachesolr module with Drupal 6. I have only one facet so far on my searching and that is the node type. I have a list of links that is supposed to filter the query. When I put in a search term, I get all the matching results and the list of links with the correct counts for each node type. However, when I click a node type it returns just the result for that node type, but now all the counts in my link list have turned to 0, except for that node type which shows the correct amount, I have it set a $_REQUEST['filter'] parameter.
Read more5 sites on the same machine using one java machine?
I have 5 independent sites on one machine. I also have another machine running just Java. How can I get all 5 sites to use the same 1 java machine for Solr? Would I install Solr 5 times for each of the sites? Would each Solr instance be installed in each of the sites /home directory; instead of say /usr/local/share. When the apachesolr.modules are installed on each site, how would I have it recognize just it's own instance and not the other 4? Or is there a better way to do this?
Read more