Upgrade Lucene to version 9.12.2#2971
Upgrade Lucene to version 9.12.2#2971suraj-subrahmanyan wants to merge 4 commits intocastorini:masterfrom
Conversation
|
Hi @suraj-subrahmanyan thanks for working on this! Do the prebuilt indexes (e.g., https://github.com/castorini/anserini/blob/master/docs/fatjar-regressions/fatjar-regressions-v1.2.2.md ) still work? Or will we have to completely rebuild indexes? |
|
The pre-built indexes are working correctly -- particularly, I was able to download pre-built indexes with |
|
hi @suraj-subrahmanyan I noticed that the HNSW scores change... which means we'll need to run regressions from scratch and manually verify/fix all scores... it's going to be a lot of work :( we'll have to do it all over again when we upgrade to Lucene 10... which makes me think... should we just bite the bullet and upgrade to Lucene 10 directly? wdyt? |
|
Oh, I realize that during the upgrade to the Lucene version, I had to fix test cases because of the vector ranking changed. Should've realized that likely applied to the rest of HNSW scores. I see your point, and I agree -- I don't think updating all the regressions for 9.12.2 is worth it for the minor benefits it introduced. Although Lucene version 10 seems relatively new, it might help restructure Anserini for the better, long-term? |
|
Closing, superseded by #3082 |
This PR addresses issue #2947.
Start: 9.9.1
9.9.1 --> 9.10.0
Error:
Caused by: GITHUB#13090.
Solution: Updated test output with new relevance scores and other metrics.
9.10.0 --> 9.11.0
Error:
Caused by: GITHUB#13288.
Solution: Updated imports and later the expected test output (similar to 9.9.1 --> 9.10.0).
9.11.0 --> 9.11.1
No errors or changes.
9.11.1 --> 9.12.0
Caused by: GITHUB#13469.
Solution: Refactored Lucene99Codec to Lucene912Codec and removed second parameter from FlatVectorsWriter.addField().
9.12.0 --> 9.12.2
No errors or changes.
mvn clean testresults in all tests passed.