We need to implement html node titles with bbcode (as per http://drupal.org/node/28537) for a client site that's using ApacheSolr. The titles need to be displayed in search results with their html intact so the bbcode version has to get indexed. I need to tell Solr to ignore this code when trying to match queries. For example, if a user searches for "blue smurf" (with the quotation marks), and there's a node with the title "[strong]blue[/strong] smurf" in the index, Solr needs to recognise this as a match. I thought that this was what the PatternReplaceFilterFactory was for and addded the following to the query analyzer for the "text" fieldtype definitation:
<filter class="solr.PatternReplaceFilterFactory" pattern="[\/?(em|i|b|strong|u)]" replacement="" replace="all"/>
... but it's not doing the trick. Maybe I'm misunderstanding how this should work. Anyone familiar with this kind of thing who could give me some guidance?