Skip to content

Conversation

@atmguille
Copy link

Enable rolling inference for closed-loop based simulation. To support this functionality, cutoff_frame is also implemented, where frames before the cutoff_frame are kept constant during the denoising process

nvidia-cosmos and others added 30 commits March 2, 2025 07:23
Signed-off-by: Jibin Varghese <[email protected]>
begining -> beginning
Signed-off-by: Jibin Varghese <[email protected]>
* feat: add post-training and custom-training support

* feat: add separate model definitions supporting tp/sp for training; update configs

* feat: add example Dataset class, add data augmentors, update config

* feat: add example data class, add misc improvements to data loading and config, add script to convert ckpt to tp

* fix: fix conflict in DiTEncoder

* cleanup

* feat: compelete README in examples/ for post/pre-training; update the main README

* fix: multiple minor fixes on example dataset

* fix: multiple minor fixes + improve example dataset performance

* feat+fix: multiple fixes + refinements to README
…mos#55)

* [fead-cicd]: add-linting, formatting and check video tags

* fix comments
zhe-thoughts and others added 30 commits April 25, 2025 17:25
* Adding a News section

* Small fix
* replacing aegis with llamaGuard3

* updating moderation output parsing

* adding llama3 licensing

* linter fixes

---------

Co-authored-by: Andrew Wang <[email protected]>
* minor update on how to deal with ema field in the ckpt loading

* minor fix
…vidia-cosmos#60)

* Add lidar & Hdmap feature to example dataloader

Signed-off-by: yu chen <[email protected]>

* hdmap/lidar ctrl posttrain config

* ckpt loading fixes

* fix input channel config to new convention

* add short-video ctrlnet model, fix config for sample av post train

* add 57 frame configs

* adapt to new av-specific dataloader

* add readme for postraining sample-av

* fix typo

* add back script

* adapt to latest toolkit data format

* resolve review items

---------

Signed-off-by: yu chen <[email protected]>
Co-authored-by: yu chen <[email protected]>
Co-authored-by: Tianshi Cao <[email protected]>
…smos#73)

* Add lidar & Hdmap feature to example dataloader

Signed-off-by: yu chen <[email protected]>

* hdmap/lidar ctrl posttrain config

* ckpt loading fixes

* fix input channel config to new convention

* add short-video ctrlnet model, fix config for sample av post train

* add 57 frame configs

* adapt to new av-specific dataloader

* add readme for postraining sample-av

* fix typo

* add back script

* adapt to latest toolkit data format

* resolve review items

* update readme link for sample AV pyt example

---------

Signed-off-by: yu chen <[email protected]>
Co-authored-by: yu chen <[email protected]>
Co-authored-by: Tianshi Cao <[email protected]>
* fix issue 82

* fix file header

* fix lint

---------

Co-authored-by: Jinwei Gu <[email protected]>
Co-authored-by: Jinwei Gu <[email protected]>
* minor fix for the keypoint control inference

* lint
* add spatial-temporal weight adding code

* add robot augmentation examples

* update spatial temporal processing code and example

* add readme

* recover

* recover

* add prompts and update readme

* fix comments

* visualization of the mp4

* update readme

* linting
This MR adds two new features that improve cosmos-transfer1's support for AV use cases.

1.  Cutoff frame : This allows the user to specify a cutoff frame to the input video, to demarcate the point from which cosmos-transfer generates subsequent frames.
2. Save Intermediaries : This allows the user to save the output of the intermediate diffusion steps as a video, and a combined (tiled) GIF. This helps understand diffusion models better.

Signed-off-by: Jibin Varghese <[email protected]>
This MR adds two new features that improve cosmos-transfer1's support for AV use cases.

1.  Cutoff frame : This allows the user to specify a cutoff frame to the input video, to demarcate the point from which cosmos-transfer generates subsequent frames.
2. Save Intermediaries : This allows the user to save the output of the intermediate diffusion steps as a video, and a combined (tiled) GIF. This helps understand diffusion models better.

Signed-off-by: Jibin Varghese <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.