Enable rolling inference and cutoff_frame functionalities #156

atmguille · 2025-07-16T11:14:52Z

Enable rolling inference for closed-loop based simulation. To support this functionality, cutoff_frame is also implemented, where frames before the cutoff_frame are kept constant during the denoising process

Signed-off-by: Jibin Varghese <[email protected]>

begining -> beginning

chore: update inference_utils.py

Minor fixes on typos and grammars

)

Co-authored-by: Jinwei Gu <[email protected]>

…idia-cosmos#28)

Signed-off-by: Jibin Varghese <[email protected]>

* feat: add post-training and custom-training support * feat: add separate model definitions supporting tp/sp for training; update configs * feat: add example Dataset class, add data augmentors, update config * feat: add example data class, add misc improvements to data loading and config, add script to convert ckpt to tp * fix: fix conflict in DiTEncoder * cleanup * feat: compelete README in examples/ for post/pre-training; update the main README * fix: multiple minor fixes on example dataset * fix: multiple minor fixes + improve example dataset performance * feat+fix: multiple fixes + refinements to README

…oid ignoring the usage of the prompt upsampler (nvidia-cosmos#43)

…mos#55) * [fead-cicd]: add-linting, formatting and check video tags * fix comments

* Adding a News section * Small fix

* replacing aegis with llamaGuard3 * updating moderation output parsing * adding llama3 licensing * linter fixes --------- Co-authored-by: Andrew Wang <[email protected]>

* minor update on how to deal with ema field in the ckpt loading * minor fix

…vidia-cosmos#60) * Add lidar & Hdmap feature to example dataloader Signed-off-by: yu chen <[email protected]> * hdmap/lidar ctrl posttrain config * ckpt loading fixes * fix input channel config to new convention * add short-video ctrlnet model, fix config for sample av post train * add 57 frame configs * adapt to new av-specific dataloader * add readme for postraining sample-av * fix typo * add back script * adapt to latest toolkit data format * resolve review items --------- Signed-off-by: yu chen <[email protected]> Co-authored-by: yu chen <[email protected]> Co-authored-by: Tianshi Cao <[email protected]>

…smos#73) * Add lidar & Hdmap feature to example dataloader Signed-off-by: yu chen <[email protected]> * hdmap/lidar ctrl posttrain config * ckpt loading fixes * fix input channel config to new convention * add short-video ctrlnet model, fix config for sample av post train * add 57 frame configs * adapt to new av-specific dataloader * add readme for postraining sample-av * fix typo * add back script * adapt to latest toolkit data format * resolve review items * update readme link for sample AV pyt example --------- Signed-off-by: yu chen <[email protected]> Co-authored-by: yu chen <[email protected]> Co-authored-by: Tianshi Cao <[email protected]>

* fix issue 82 * fix file header * fix lint --------- Co-authored-by: Jinwei Gu <[email protected]> Co-authored-by: Jinwei Gu <[email protected]>

* minor fix for the keypoint control inference * lint

* add spatial-temporal weight adding code * add robot augmentation examples * update spatial temporal processing code and example * add readme * recover * recover * add prompts and update readme * fix comments * visualization of the mp4 * update readme * linting

Signed-off-by: Jibin Varghese <[email protected]>

This MR adds two new features that improve cosmos-transfer1's support for AV use cases. 1. Cutoff frame : This allows the user to specify a cutoff frame to the input video, to demarcate the point from which cosmos-transfer generates subsequent frames. 2. Save Intermediaries : This allows the user to save the output of the intermediate diffusion steps as a video, and a combined (tiled) GIF. This helps understand diffusion models better. Signed-off-by: Jibin Varghese <[email protected]>

…ndled correctly

nvidia-cosmos and others added 30 commits March 2, 2025 07:23

Initial commit

d3dd39c

initial commit

97facc6

fix broken videos

04d4b0f

update website link

2e9a36e

Update Readme

b786096

Signed-off-by: Jibin Varghese <[email protected]>

chore: update inference_utils.py

cfb0e19

begining -> beginning

Merge pull request nvidia-cosmos#5 from eltociear/patch-1

a364090

chore: update inference_utils.py

Improvements and Bug Fixes

2824ccd

Improvements and Bug Fixes

fb0ce40

Improvements and Bug Fixes

78438cf

Update README and add transfer1 architecture diagram

2e7d324

Update README.md

8ed39c3

Minor fixes on typos and grammars

8eceb73

Merge pull request nvidia-cosmos#11 from zhe-thoughts/patch-1

9bd7946

Minor fixes on typos and grammars

Fix sample_av_hdmap_spec.json input video path (nvidia-cosmos#18)

9f1ec0c

Improvements and Bug Fixes

2ca75b9

[feat] Add License Header (nvidia-cosmos#15)

9f6588a

Allow user to specify between all, 7b and 7b_av models (nvidia-cosmos#16

cbf56b3

)

Remove third_party submodules (nvidia-cosmos#23)

672fba9

fix cp issue (nvidia-cosmos#26)

c450ef4

Co-authored-by: Jinwei Gu <[email protected]>

Update README.md with workflow section and model descriptions

35c2599

update examples README to include instructions on batch inference (nv…

d57d460

…idia-cosmos#28)

Update guardrails to it's own model (nvidia-cosmos#27)

fb3de85

Signed-off-by: Jibin Varghese <[email protected]>

fix bug for spatiotemporal control weight (nvidia-cosmos#33)

7d6e203

Update Readme (nvidia-cosmos#30)

007ac54

Signed-off-by: Jibin Varghese <[email protected]>

fix cp for t2v transfer model (nvidia-cosmos#36)

d90ba4f

Fix input_control path in sample_av_hdmap_spec.json (nvidia-cosmos#39)

679302d

Update README to use torchrun for all examples for consistency and av…

661149c

…oid ignoring the usage of the prompt upsampler (nvidia-cosmos#43)

[fead-cicd]: add-linting, formatting and check video tags (nvidia-cos…

7e1f88c

…mos#55) * [fead-cicd]: add-linting, formatting and check video tags * fix comments

zhe-thoughts and others added 30 commits April 25, 2025 17:25

Adding a News section (nvidia-cosmos#51)

73af7da

* Adding a News section * Small fix

[feat] Replacing aegis with Llama Guard 3 (nvidia-cosmos#41)

2c1ba3d

* replacing aegis with llamaGuard3 * updating moderation output parsing * adding llama3 licensing * linter fixes --------- Co-authored-by: Andrew Wang <[email protected]>

fix: contrl hint key parsing in the example dataset (nvidia-cosmos#69)

82b6a7d

* minor update on how to deal with ema field in the ckpt loading * minor fix

fix issue 82 (nvidia-cosmos#83)

e02dfcb

* fix issue 82 * fix file header * fix lint --------- Co-authored-by: Jinwei Gu <[email protected]> Co-authored-by: Jinwei Gu <[email protected]>

fix: control input key in keypointControl inference (nvidia-cosmos#85)

c42eb07

* minor fix for the keypoint control inference * lint

Update README.md to point to single control examples (nvidia-cosmos#86)

f8c0010

[feat] Enable Input Video In AV Transfer with Cutoff Frames

af7fee0

Signed-off-by: Jibin Varghese <[email protected]>

[feat] Enable Input Video In AV Transfer with Cutoff Frames

d424446

Signed-off-by: Jibin Varghese <[email protected]>

Track MP4 files with Git-LFS

7208104

Remove large files

ba87870

Track IPYNB files with Git-LFS

ce60d37

Add rolling (offline and online) inference funcrtionality

24c86a9

Merge cutoff_frame and intermediates functionality

0879cee

Merge rolling inference logic

ff3dd5a

Try adding mp4 files in merge

927c551

Merge branch 'nvidia-cosmos:main' into main

3a7f8fa

Provide different seed per chunk and assure control inputs idx are ha…

808b832

…ndled correctly

Merge branch 'main' of github.com:atmguille/cosmos-transfer1

d5088a8

Disable setting input_video to 0 in cutoff_frame

c0a4995

Rely on chunk conditioning instead of cutoff_frame functionality

e83bc89

Merge remote-tracking branch 'gitlab_private/av-improvements'

c049cfd

Simplify if clause

a26276d

Merge remote-tracking branch 'gitlab_private/av-improvements'

76d62ac

Merge branch 'nvidia-cosmos:main' into main

fcac400

Support changes in main branch an edge functionality

af7b3f1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable rolling inference and cutoff_frame functionalities #156

Enable rolling inference and cutoff_frame functionalities #156

Uh oh!

atmguille commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants

Enable rolling inference and cutoff_frame functionalities #156

Are you sure you want to change the base?

Enable rolling inference and cutoff_frame functionalities #156

Uh oh!

Conversation

atmguille commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

17 participants