[posttrain] Add Nemotron agentic SFT datasets by taivu1998 · Pull Request #6270 · marin-community/marin

taivu1998 · 2026-06-07T22:49:37Z

Add a rich multi-turn conversation adapter for agentic SFT rows so tool calls, tool observations, reasoning content, and top-level tools are preserved instead of being flattened to role/content only.

Register nvidia/Nemotron-SFT-SWE-v3, nvidia/Nemotron-Cascade-SFT-SWE, nvidia/Nemotron-SFT-CUDA-v1, and split-specific nvidia/Nemotron-SFT-OpenCode-v1 views with pinned revisions. CUDA, SWE-v3, and OpenCode use the rich adapter; Cascade remains on the plain multi-turn path because its rows are ordinary messages plus task metadata.

Register the SWE, Cascade, OpenCode, and CUDA Nemotron SFT datasets through the instruction dataset path. Add a rich multi-turn adapter so tool calls, reasoning content, and tool context survive transformation for agentic SFT rows.

Fix the all-files trailing whitespace failure reported by the PR lint job so the branch can pass repository-wide pre-commit checks.

taivu1998 added 2 commits June 7, 2026 15:48

[posttrain] Add Nemotron agentic SFT datasets

d439faf

Register the SWE, Cascade, OpenCode, and CUDA Nemotron SFT datasets through the instruction dataset path. Add a rich multi-turn adapter so tool calls, reasoning content, and tool context survive transformation for agentic SFT rows.

[lint] Remove trailing whitespace in lint review skill

fe8a079

Fix the all-files trailing whitespace failure reported by the PR lint job so the branch can pass repository-wide pre-commit checks.

taivu1998 marked this pull request as ready for review June 8, 2026 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[posttrain] Add Nemotron agentic SFT datasets#6270

[posttrain] Add Nemotron agentic SFT datasets#6270
taivu1998 wants to merge 2 commits into
marin-community:mainfrom
taivu1998:tdv/nemotron-swe-opencode-cuda-sft

taivu1998 commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

taivu1998 commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant