Improve scroll search by using Lucene's IndexSearcher#searchAfter(...) #4940

martijnvg · 2014-01-29T13:12:52Z

Improve the regular scroll search by using Lucene's searchAfter, which allows subsequent scroll request to always have a priority queue size equal to the specified size in the first search request. (priority queue is used to collect the competitive hits that match with a query)

Currently the priority queue size grows with each subsequent scroll request with what has been specified in from of the first search request.

Note: scan scroll is unaffected by this issue, which already is a highly optimized search to fetch a large part or all docs from a cluster. Scan scroll forcefully sort the hits always by the Lucene docids, while with the regular scroll can now support any sort efficiently.

The text was updated successfully, but these errors were encountered:

nik9000 · 2014-01-29T13:30:58Z

Is the idea to be able to scroll a non-scan search?

martijnvg · 2014-01-29T13:59:15Z

This is already possible, the scroll parameter can also be used on non scan search requests.

nik9000 · 2014-01-29T14:15:00Z

Hey, neat. That is kinda documented on the scroll page but it is implied that scroll is a scan thing. So this enhancement will make it more efficient to scroll without scan?

martijnvg · 2014-01-29T14:36:13Z

Yes, this enhancement will make scroll without scan more efficient.

The memory usage will be improved from O(from+size) to O(size) and also collecting the competitive hits for a query will be improved from O(numHits + log(from+size)) to O(numHits + (log(size)). This improvement becomes really noticeable when scrolling deep into a result set.

…of regular search methods which rely on `from` for pagination. This prevents the creation of priority queues of `from + size`, instead the size of the priority queue will always be equal to `size`. Closes #4940

tlrx · 2014-03-21T13:17:36Z

nice, thanks for this optimisation!

ghost assigned martijnvg Jan 29, 2014

martijnvg mentioned this issue Jan 31, 2014

Improve scroll search by using IndexSearcher#searchAfter(...) #4968

Merged

clintongormley mentioned this issue Mar 14, 2014

OutOfMemoryError Java heap space while setting a high 'size' value for a query #5430

Closed

s1monw added v1.2.0 and removed v1.1.0 labels Mar 20, 2014

martijnvg closed this as completed in 947c5f6 Mar 21, 2014

jpountz mentioned this issue Oct 22, 2014

Search: Expose Lucene's searchAfter in the search API #8192

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve scroll search by using Lucene's IndexSearcher#searchAfter(...) #4940

Improve scroll search by using Lucene's IndexSearcher#searchAfter(...) #4940

martijnvg commented Jan 29, 2014

nik9000 commented Jan 29, 2014

Uh oh!

martijnvg commented Jan 29, 2014

Uh oh!

nik9000 commented Jan 29, 2014

Uh oh!

martijnvg commented Jan 29, 2014

Uh oh!

tlrx commented Mar 21, 2014

Uh oh!

Improve scroll search by using Lucene's IndexSearcher#searchAfter(...) #4940

Improve scroll search by using Lucene's IndexSearcher#searchAfter(...) #4940

Comments

martijnvg commented Jan 29, 2014

nik9000 commented Jan 29, 2014

Uh oh!

martijnvg commented Jan 29, 2014

Uh oh!

nik9000 commented Jan 29, 2014

Uh oh!

martijnvg commented Jan 29, 2014

Uh oh!

tlrx commented Mar 21, 2014

Uh oh!