Result grouping and faceting are in essence two different search features. Faceting counts the number of hits for specific field values matching the current query. Result grouping groups documents together with a common property and places these documents under a group. These groups are used as the hits in the search result. Usually result grouping […]
Result grouping made easier
Lucene has result grouping for a while now as a contrib in Lucene 3.x and as a module in the upcoming 4.0 release. In both releases the actual grouping is performed with Lucene Collectors. As a Lucene user you need to use various of these Collectors in searches. However these Collectors have many constructor arguments. […]
Query time joining in Lucene
Recently query time joining has been added to the Lucene join module in the Lucene svn trunk. The query time joining will be included in the Lucene 4.0 release and there is a possibility that it will also be included in Lucene 3.6. Lets say we have articles and comments. With the query time join […]
IndexDocValues – their applications
From a user’s perspective Lucene’s IndexDocValues is a bunch of values per document. Unlike Stored Fields or FieldCache, the IndexDocValues’ values can be retrieved quickly and efficiently as Simon Willnauer describes in his first IndexDocValues blog post. There are many applications that can benefit from using IndexDocValues for search functionality like flexible scoring, faceting, sorting, […]
Apache Solr – Grouping update
Apache Solr’s result grouping feature is now a widely used feature. The major drawback was that grouping (also known as sharding in Solr) was initially not supported for distributed searches. The good news is that recently distributed grouping has been added to Solr! It has been added as the trunk and the stable branch (branch3x). […]
SearchWorkings: Apache Solr – Grouping update
Apache Solr’s result grouping feature is now a widely used feature. The major drawback was that grouping was not supported for distributed searches which also know as sharding in Solr. The good news is that recently distributed grouping has been added to Solr! It has been added the trunk and the stable branch (branch3x). This […]
Search Result Grouping / Field Collapsing in Lucene / Solr
Grouping of search results or also known as field collapsing is often a requirement for search projects. As described earlier this functionality was added to Solr and happens to be one of the most wanted features in Solr. Recently result grouping was added to Lucene as contrib in Lucene 3.1 and a module in 4.0. […]
Indexing your Samba/Windows network shares using Solr
Many of JTeam’s clients want to search the content of their existing network shares as part of their Enterprise Search infrastructure. Over the last couple of years, more and more people are switching to Apache Lucene / Solr as their preferred, open source search solution. However, many still have the misconception that it is not […]
Language analysis comparable to Fast / Endeca for Solr
Good, solid language analysis is a very important asset for the quality of your search results. It is one of the features that for instance Microsoft Fast and Endeca are using as one of their unique selling points. However, you can get the same powerful analysis when using Apache Solr to implement your search.
Result grouping / Field Collapsing with Solr
In a number of search projects that I have done using Lucene and Solr there was a lot of almost identical data. From a user perspective, when searching the first result pages were full of documents that look very similar, for instance getting a full page of the same car model, where only the edition […]