fix: bug where empty strings overwrite user/system messages by naymaraq · Pull Request #1406 · NVIDIA-NeMo/Skills

naymaraq · 2026-04-30T13:22:11Z

Summary by CodeRabbit

Bug Fixes
- Improved handling of system and user message overrides in prompt generation.
- Fixed behavior so explicit empty-string system messages remove any existing system prompt, and explicit user messages apply only when explicitly provided (not just truthy).

Signed-off-by: naymaraq <dkaramyan@nvidia.com>

coderabbitai · 2026-04-30T13:25:52Z

📝 Walkthrough

Walkthrough

OpenAI prompt-filling in GenerationTask.fill_prompt now applies user_message only when it is non-None and refines system_message overrides: None means no change, empty string removes a leading system message, and a non-empty string inserts or overwrites the leading system message.

Changes

Cohort / File(s)	Summary
Message Override Handling `nemo_skills/inference/generate.py`	Changed `user_message` and `system_message` override checks to `is not None`. `user_message` applies only when explicitly non-`None`. `system_message = ""` removes a leading system message; non-empty `system_message` overwrites first system message content or inserts a new leading system message.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: fixing a bug where empty strings were incorrectly overwriting user/system messages, which matches the core logic changes in fill_prompt.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch dkaramyan/fix-system-user-message-passing

_{Review rate limit: 9/10 reviews remaining, refill in 6 minutes.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

nemo_skills/inference/generate.py (1)

667-670: ⚡ Quick win

Add a regression test for system_message="" behavior on the OpenAI path.

The new branch at Lines 668-670 changes semantics (explicit empty string removes a leading system message). Please add a targeted test to lock this in and prevent regressions.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@nemo_skills/inference/generate.py` around lines 667 - 670, Add a regression
test that exercises the OpenAI dispatch path when self.cfg.system_message == ""
to ensure the leading system message is removed from data_point["messages"];
specifically, construct a data_point with a leading {"role":"system", ...}
message, set cfg.system_message to the empty string, invoke the code path that
processes messages for the OpenAI backend (the method that checks
self.cfg.system_message and pops data_point["messages"][0]), and assert the
leading system message has been removed while other messages remain unchanged;
mirror the existing test style for other model providers so this behavior is
locked in.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@nemo_skills/inference/generate.py`:
- Around line 667-672: The code that handles self.cfg.system_message currently
indexes data_point["messages"][0] without ensuring the list is non-empty,
causing an IndexError; update the logic in the block around
self.cfg.system_message so that before any access to data_point["messages"][0]
you check whether data_point["messages"] is non-empty and, if it is empty and
system_message is non-empty, insert the system message (using the same
{"role":"system","content": self.cfg.system_message} structure); similarly, when
system_message == "" ensure you only attempt to pop the first message if
data_point["messages"] is non-empty—adjust the branches around the existing pop
and insert calls to guard against empty lists.

---

Nitpick comments:
In `@nemo_skills/inference/generate.py`:
- Around line 667-670: Add a regression test that exercises the OpenAI dispatch
path when self.cfg.system_message == "" to ensure the leading system message is
removed from data_point["messages"]; specifically, construct a data_point with a
leading {"role":"system", ...} message, set cfg.system_message to the empty
string, invoke the code path that processes messages for the OpenAI backend (the
method that checks self.cfg.system_message and pops data_point["messages"][0]),
and assert the leading system message has been removed while other messages
remain unchanged; mirror the existing test style for other model providers so
this behavior is locked in.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 4abbd4f5-788f-4c73-8417-c26e2fee17df

📥 Commits

Reviewing files that changed from the base of the PR and between f57b173 and 3fa1961.

📒 Files selected for processing (1)

nemo_skills/inference/generate.py

Jorjeous

Lets add guard on empty messages list when system_message is non-empty

Other LGTM, will approve when above done

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: naymaraq <dkaramyan@nvidia.com>

coderabbitai

🧹 Nitpick comments (1)

nemo_skills/inference/generate.py (1)

658-676: ⚡ Quick win

Add regression tests for these override semantics.

Please add focused tests for user_message="", system_message="", and system_message=None in the pure OpenAI path so this behavior doesn’t regress.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@nemo_skills/inference/generate.py` around lines 658 - 676, Add focused unit
tests exercising GenerationTask override semantics: 1) when cfg.user_message ==
"" ensure GenerationTask._set_message_text_content replaces the single user
message content with an empty string and that the code still enforces exactly
one user message; 2) when cfg.system_message == "" ensure an existing leading
system message is removed (messages.pop(0)); and 3) when cfg.system_message is
None ensure the existing system message is left unchanged. Implement these tests
against the pure OpenAI path (i.e., run the code path that uses GenerationTask
with the OpenAI client/stub) and assert the final data_point["messages"]
contents match the expected outcomes after _set_message_text_content and
_append_message_text_suffix logic has run.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@nemo_skills/inference/generate.py`:
- Around line 658-676: Add focused unit tests exercising GenerationTask override
semantics: 1) when cfg.user_message == "" ensure
GenerationTask._set_message_text_content replaces the single user message
content with an empty string and that the code still enforces exactly one user
message; 2) when cfg.system_message == "" ensure an existing leading system
message is removed (messages.pop(0)); and 3) when cfg.system_message is None
ensure the existing system message is left unchanged. Implement these tests
against the pure OpenAI path (i.e., run the code path that uses GenerationTask
with the OpenAI client/stub) and assert the final data_point["messages"]
contents match the expected outcomes after _set_message_text_content and
_append_message_text_suffix logic has run.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 7543d785-48ad-422a-8066-a0992edd349b

📥 Commits

Reviewing files that changed from the base of the PR and between 3fa1961 and d975926.

📒 Files selected for processing (1)

nemo_skills/inference/generate.py

Jorjeous

fixed

fix user/system message overriding with empty strings

3fa1961

Signed-off-by: naymaraq <dkaramyan@nvidia.com>

naymaraq requested a review from Jorjeous April 30, 2026 13:22

naymaraq changed the title ~~fix user/system message overriding with empty strings~~ fix: bug where empty strings overwrite user/system messages Apr 30, 2026

coderabbitai Bot reviewed Apr 30, 2026

View reviewed changes

Comment thread nemo_skills/inference/generate.py Outdated

Jorjeous requested changes May 1, 2026

View reviewed changes

Merge branch 'main' into dkaramyan/fix-system-user-message-passing

e0b75e9

Jorjeous added the run GPU tests label May 1, 2026

guard system_message against empty messages

d975926

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: naymaraq <dkaramyan@nvidia.com>

coderabbitai Bot reviewed May 1, 2026

View reviewed changes

naymaraq added run GPU tests and removed run GPU tests labels May 1, 2026

Jorjeous self-requested a review May 4, 2026 08:43

Jorjeous approved these changes May 4, 2026

View reviewed changes

naymaraq enabled auto-merge (squash) May 4, 2026 12:08

naymaraq added run GPU tests and removed run GPU tests labels May 4, 2026

naymaraq added 2 commits May 4, 2026 17:57

Merge branch 'main' into dkaramyan/fix-system-user-message-passing

09b0049

Merge branch 'main' into dkaramyan/fix-system-user-message-passing

00f1376

naymaraq merged commit f21a52c into main May 4, 2026
5 checks passed

naymaraq deleted the dkaramyan/fix-system-user-message-passing branch May 4, 2026 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: bug where empty strings overwrite user/system messages#1406

fix: bug where empty strings overwrite user/system messages#1406
naymaraq merged 5 commits into
mainfrom
dkaramyan/fix-system-user-message-passing

naymaraq commented Apr 30, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 30, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Jorjeous left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Jorjeous left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

naymaraq commented Apr 30, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Jorjeous left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Jorjeous left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

naymaraq commented Apr 30, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 30, 2026 •

edited

Loading