Skip to content

(retriever) use vLLM for nemotron-parse inference#1764

Merged
edknv merged 21 commits intoNVIDIA:mainfrom
edknv:edwardk/retriever-parse-vllm
Apr 4, 2026
Merged

(retriever) use vLLM for nemotron-parse inference#1764
edknv merged 21 commits intoNVIDIA:mainfrom
edknv:edwardk/retriever-parse-vllm

Conversation

@edknv
Copy link
Copy Markdown
Collaborator

@edknv edknv commented Apr 1, 2026

Description

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

@edknv edknv requested review from jdye64 and jperez999 April 2, 2026 16:26
@edknv edknv marked this pull request as ready for review April 2, 2026 16:26
@edknv edknv requested review from a team as code owners April 2, 2026 16:26
Copy link
Copy Markdown
Collaborator

@jperez999 jperez999 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let me know if those comments make sense. Otherwise looks good.

@edknv edknv merged commit 349ce96 into NVIDIA:main Apr 4, 2026
5 checks passed
@edknv edknv deleted the edwardk/retriever-parse-vllm branch April 4, 2026 01:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants