Replies: 1 comment 2 replies
-
Thank you! Currently the library only supports one thread indexing, but it would be great to add multi-thread indexing in the future! I am not currently sure what's the best way to approach this, I'd probably need to spend some time to think about it. That said, it seems quite slow that you can only process 400it/s. Even on Kaggle, it processes 17K/s: https://www.kaggle.com/code/xhlulu/benchmark-bm25s-msmarco Can you perhaps try to run your code on kaggle to see if there's a bottleneck with your processing code, or with the server itself? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi there!
Thanks for the great library. It's a user-friendly experience to perform retrieval in Python. May I ask is it possible to support indexing with multiple threads? As you can see from the screenshot, right now only one thread got used. Would really love a way to speed things up!
Thanks
Beta Was this translation helpful? Give feedback.
All reactions