Skip to content

Retrieving by BM25 becomes slower when there are many queries #1704

Closed Answered by lintool
namespace-Pt asked this question in Q&A
Discussion options

You must be logged in to vote

You're probably running out of memory. Since the SearchCollection implementation is multi-threaded, it keeps the hits in memory until all the queries are processed, and the writes out to disk all at once. This simplifies thread synchronization.

Try running on smaller batches of queries.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@namespace-Pt
Comment options

@lintool
Comment options

@namespace-Pt
Comment options

Answer selected by lintool
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1703 on December 15, 2021 14:38.