Add rerank by jperez999 · Pull Request #1565 · NVIDIA/NeMo-Retriever

jperez999 · 2026-03-11T14:12:08Z

Description

Adds reranker to be used in batch example. Also fixed recall to work with the retriever object to get queries.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

… into fix-lance-slim

jioffe502 · 2026-03-11T18:04:45Z

nemo_retriever/src/nemo_retriever/retriever.py

    local_hf_cache_dir: Optional[Path] = None
    local_hf_batch_size: int = 64
+    # Reranking -----------------------------------------------------------
+    reranker: Optional[str] = "nvidia/llama-nemotron-rerank-1b-v2"


The default here is the model name, which means any code that creates a Retriever() without explicitly setting reranker will download and load the 1B model. The docstring on line 58 says "Set to None to skip reranking (default)"

should this default to None instead so reranking is opt-in?

The batch_pipeline.py handles this correctly with the --reranker bool flag

this is just about the dataclass default for direct Retriever() callers.

jperez999 and others added 13 commits March 6, 2026 21:52

fix lancedb out for recall after pipeline

bff51ca

Merge branch 'main' into fix-lance-slim

b8fbcec

merge in

23fe88b

Merge branch 'fix-lance-slim' of https://github.com/jperez999/nv-ingest…

b12c544

… into fix-lance-slim

Merge branch 'main' into fix-lance-slim

2bea6cb

Merge branch 'main' into fix-lance-slim

cc59c7a

merge in and fix

fda69d1

Merge branch 'fix-lance-slim' of https://github.com/jperez999/nv-ingest…

6832f33

… into fix-lance-slim

Merge branch 'main' into fix-lance-slim

72b42fd

fix overwrite on fts

86abb67

merge in

100564a

fix empty call for schema

fe85a66

reranker hf model support

0d36154

jperez999 self-assigned this Mar 11, 2026

jperez999 requested a review from a team as a code owner March 11, 2026 14:12

jperez999 requested a review from ChrisJar March 11, 2026 14:12

jperez999 added 2 commits March 11, 2026 07:42

make reranker switch cli

0498bee

fix unit tests

8d9accb

jioffe502 approved these changes Mar 11, 2026

View reviewed changes

add tqdm to pyproject

9febf84

jdye64 mentioned this pull request Mar 11, 2026

Simply pull in llama-nemotron-reranker-1b-v2 #1531

Closed

4 tasks

jperez999 and others added 3 commits March 11, 2026 12:04

merge in

6d1b69b

Merge branch 'main' into add-rerank

89051ad

Merge branch 'main' into add-rerank

4d78860

jperez999 merged commit 885978a into NVIDIA:main Mar 11, 2026
8 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add rerank#1565

Add rerank#1565
jperez999 merged 19 commits intoNVIDIA:mainfrom
jperez999:add-rerank

jperez999 commented Mar 11, 2026

Uh oh!

jioffe502 Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jperez999 commented Mar 11, 2026

Description

Checklist

Uh oh!

jioffe502 Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants